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RESULT 1 

US-10-478-914-64 

; Sequence 64, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 

; TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
; FILE REFERENCE: 7388-80899 

; CURRENT APPLICATION NUMBER: US/10/478,914 

; CURRENT FILING DATE: 2003-11-26 

; PRIOR APPLICATION NUMBER: PCT/JP02/05294 

; PRIOR FILING DATE: 2001-05-30 

; PRIOR APPLICATION NUMBER: JP 2001-162775 

; PRIOR FILING DATE: 2001-05-30 

; PRIOR APPLICATION NUMBER: JP 2001-255226 

; PRIOR FILING DATE: 2001-08-24 

; NUMBER OF SEQ ID NOS : 417 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 6 4 

LENGTH: 212 0 

TYPE: DNA 
; ORGANISM: Homo sapiens 
US-10-478-914-64 

Query Match 44.6%; Score 2120.5; DB 1; Length 2120; 

Best Local Similarity 36.4%; Pred. No. 3.7e-115; 
Matches 514; Conservative 0; Mismatches 264; Indels 635; 
Gaps 73; 

Qy 1 GGCACGAGCG AGTTCCTGTCTC TCTGCCAACGCC 

GCCCGG 40 

I I I I I II I I I I I I I I II I I I I I I I I I 

Db 83 

GGCGCGGACGTGCCCGGTGCCTGGCGCGTGGTAGCAGGCGCCCGGTGCCCCGGCCGGCGA 142 

Qy 41 ATGGCTTCCC AAAACC GCGACCCAGCCG 

CCACTAGCGT 7 8 



Db 143 AGACCATGGCGTTCATGGTGAAGACCATGGTGGGCGG- 

CCAGCTGAAGAACCTCACTGGG 201 



Qy 79 CGCC GCCGCCCG — TAAAGGAGCTG AGCCGAG- 

CGGGG- 113 



Db 202 

AGCCTGGGAGGCGGCGAGGATAAGGGAGATGGGGACAAGTCGGCAGCCGAAGCTCAGGGC 2 61 

Qy 114 — GCGCCGCCCGGGGTCCGGTGGGCAAAAG — GCTAC 

AGC AGGAGC - T 156 

I I I I I I I I I I I I I I I I I I I I II 

Db 262 

ATGAGCCGGGAGGAGTACGAGGAGTATCAGAAGCAACTCGTGGAAGAGAAGATGGAGCGG 321 

Qy 157 GATGACC CTCATGATGTCTGGCGATAAAGGGATTTCTGC 

CTTCC C 201 

I I I I I III I III II I I I I I 



Db 322 

GATGCACAGTTCACACAGAGGAAGGCAGAGCGGGCCACACTGCGGAGCCACTTCCGAGAC 3 81 

Qy 2 02 T G AAT C AG AC AAC C T T T T C AAATG GGTAGGGACC — 

ATCCA 2 40 

I I I I I I I I I I I I I I I I I I I 

Db 382 

AAATACCGGCTACCCAAGAACGAGACAGATGAGAGCCAGATCCAGATGGCAGGTGGAGAC 4 41 

Qy 2 41 -TGGAGC AGCTGGAACAGTATAT GAAGAC- 

CTGAGGTATAAG 280 

I I I I I I I I I I I I II II I I I I I I I I I I II 

Db 442 

GTGGAGCTGCCCCGGGAGCTGGCCAAGATGATCGAGGAGGACACAGAGGAGGAGGAGGAG 501 

Qy 281 CTC— TC GCTAGAGTTCC CCAGTGGCT- 

ACCCTT 311 

III II I I I I I I I I I I I I I I I I I I 

Db 502 AAGGCCTCAGTCCTTGGGCAGCTGGCCAGCCTTCCTGGCTTGAACCTG- 

GGCTCACTCAA 56 0 

Qy 312 — ACAATGCGC CCACAGTGAAGTTCCTCACGCC— CTGC 

TATCACCCCAA 357 

I I I I I I I I I I I I II I I I I I I I III I I I I I 

I I 

Db 561 

GGACAAGGCCCAGGCCACACTGGGGGATCTCAAGCAATCAGCTGAGAAGTGTCACGTCAT 62 0 

Qy 358 CGTGGACA CCCAGGGTAACATAT — GCCTGGACATCC-TGAAGGAAAAGTG 

GT 407 

III II I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 621 - 

GTGACCACTTCCCCGGGGTTACCCACTGGGCTGGGCCCCCATGAGGGCTAAGAGTGTGT 6 79 

Qy 408 C TGCCCTGT- 

ATGATGTC 42 4 

I I I I I I I I I 

I 

Db 680 

CAACTTCCAGGGACCCATACTCCATTTGGGGCTTTGTTTCCCTTGCCCCATCCTAGTTCC 739 



Qy 42 5 AGGACCATTCTGCTCTCCAT — CCAGAGCCTT 

CTAGGA 46 0 

I I I I I I I I I I I I I I II I I I I I II 

I I I 

Db 740 AAGACCTTTC — 

CCATCCATGCCCCAAGCCTATCTTCTGGTTTCTTCCTCTCCGCTGGGA 79 7 

Qy 461 GAA CCCAACATTGATAGTCCCTTGA ACACACATGCT— GCC 

GAG 502 

II I I I I I I I I I I I I I I I I I III III 

I I I 

Db 798 

GTAAAGTCCCCATCTTCACTCTACCCTTCAGGACCCTCCCCACCAGCTCAGCCTGTGGAG 85 7 

Qy 503 — CTC TGGAAAAACCCC AC AGC T T T T AAG 

AAGTACCTGCA 540 

III I I I I I I I I I III I I I I I 

I I I I I I 

Db 858 

GCCTCCCAAGATTGTAGGAATAGGCCCATCCCTCTCTGGCCATGGCCCCAAGTTCCTGCA 917 

Qy 541 AG AAAC C T AC T C AAAG CAGGTCACC 

AGCC 569 

I I I I I I I I I I I I I I I I 

I 

Db 918 

CACAGGAGCACCCACAGAGAGACACACACAGGACACAAAACCCCTGGCACGTTCAGAGAC 9 7 7 

Qy 570 AGGAGCC-CTGAC CCAGGC 

TGC 590 

II I I I I I III II III 

I I 

Db 978 

AGAAGCCACAGACACATCCCGGCACAGACAGACACACACGAGGCCAGCTCCCTTGCGTGT 103 7 
Qy 591 

CCAGCCTGTCCT 602 

I I I I I I III 

Db 1038 

CCAGCCCCTCCAGACACCACCACTCAGAAACTCTGAGAGAGAGCATGGGCAGACACCCTC 109 7 

Qy 603 TGTGT 

CGTCTTTTTAATTTTT 623 

I I I I I I I I I I I I I I 

Db 1098 

AGCAGACAGGAGGCCTGAGTTCCAGTCTCCACCTTTATTGTTCTTGAAAGCCCCTGCTCT 115 7 

Qy 624 CCTTAGAT GGTCTGTCCT TTTTGTG- 

AT 650 

I I I I I I I I I I I I I I I I I I 

I 

Db 1158 

CTCTGAGCCTTATTTCATCATCTGTAAAATGGGAATGTCCTGAATGACTTCTAAGGCTCT 1217 

Qy 651 TTCTGTATAGGACT CTTTAT 

CTTGAG 6 76 

I I I I I I I I I I I II I I I I I 

Db 1218 

TTCTGGCTTGAACTGTCAGAGCCAAGCCCACATCCCTCCTTGGGCAGGGCAGCAGCTGCT 12 7 7 



Qy 6 77 CTGTGGTATTT 

TTGTTTTGTTTTT 700 

I I I I I I I I II 

Db 1278 

GCCACAGCCTCCAGCGGCTGCCACTGTGGGCTCTGGGAGCCGGAGCGATGCTGTGTGAGA 133 7 

Qy 701 GTCT TTTAAATTAA 

GCCTCGGTTGAG 726 



Db 1338 

GGCAGAGTGCCAAGGATGAAGCTGGCACTGAACAGTAAGCGGCTCCAGGCCTCCTCTGGG 139 7 

Qy 72 7 CCCTTG TAT AT T AAAT AAAT G 

CATTTTTG 755 



Db 1398 

CCCAGGGCCCAGCCAATTTCTGTTCTGTTCCTGTAGAACGCTCTCTGGATTCCATAGCTG 145 7 

Qy 756 TCCTTTTTTAAAAAAAAAAAAAAAAAAA 783 

I I I I I I I I I I I I I I I I I I 

Db 1458 GAATCTCCTCTCTTAGCTCAGTGAAAAATAAAA 1490 



RESULT 2 
US-10-478-914-4 

; Sequence 4, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: N AKAG AWARA , AKIRA 

; TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
; FILE REFERENCE: 7388-80899 

; CURRENT APPLICATION NUMBER: US/10/478,914 

; CURRENT FILING DATE: 2003-11-26 

; PRIOR APPLICATION NUMBER: PCT/ JP0 2 / 0 52 9 4 

; PRIOR FILING DATE: 2001-05-30 

; PRIOR APPLICATION NUMBER: JP 2001-162775 

; PRIOR FILING DATE: 2001-05-30 

; PRIOR APPLICATION NUMBER: JP 2001-255226 

; PRIOR FILING DATE: 2001-08-24 

; NUMBER OF SEQ ID NOS : 417 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 4 

LENGTH: 3210 

TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-478-914-4 

Query Match 44.3%; Score 2106; DB 1; Length 3210; 

Best Local Similarity 36.8%; Pred. No. 3.6e-113; 
Matches 478; Conservative 0; Mismatches 290; Indels 530; 
Gaps 63; 

Qy 1 GGCACGAGCG AGTTCCTGTCTCTCTGCCAAC 

GCCGCCCGGATG 43 



Db 680 

GGCCCATCCGCAACCTCACGTTCCAGGACCTTCACCTGCACCATGGAGGCCACCAGGCTG 739 



Qy 4 4 GCTTCCCAAAACCGCGACCCAGC CGCCACTAGC GTC- 

GCCGCCGCCCGTAAA 9 4 

I II I I I I I I I I II I I I I I I II I I I I I I I I I I I 

Db 740 CCAACAC- 

CAGCCACGACCTGGCTCAGCGCCACGGGCTGGAGTCGGCCTCCGACCACCAT 79 8 

Qy 95 GG AGCTGAGCC 

GAGCGG 111 

II I I I I I I I I I I I I 

Db 799 

GGCAACTTCTCCATCACCATGCGCAACCTGACCCTGCTGGATAGCGGCCTCTACTGCTGC 85 8 

Qy 112 GGGCGCCGCC CGGGGTCCGGTGGGCAAAAG- 

GCTA 145 

I I I I I I I I I I I I I I I I I I I 

I I I 

Db 859 

CTGGTGGTGGAGATCAGGCACCACCACTCGGAGCACAGGGTCCATGGTGCCATGGAGCTG 918 

Qy 146 CAG CAGGAGCTGATG-ACCCTC ATGATGT 

CTG 176 

III I I I I I I I I I I I I I I I I I I I 

I 

Db 919 

CAGGTGCAGACAGGCAAAGATGCACCATCCAACTGTGTGGTGTACCCATCCTCCTCCCAG 9 7 8 

Qy 177 GCGA 

TAAAGGGATTTCTG 194 

III I I I I I I I I I 

Db 979 

GAGAGTGAAAGTAAGGGACCAACCTCTTGCCCCTTTTGGGTTCTCTGTTTTCTTCTGTCC 103 8 

Qy 195 CCT TCCCTG 

AATCAGA 210 

III I I I II I I I I I 

Db 1039 

TCATCCTGCACCCAGACCCTGTTTGGAACTCTGGCCTCATCACCCCAAGCCCTCAGAA.ee 109 8 

Qy 211 CAACCTTTTCAAATG-GGTAGGGAC 

CATCCATGGAGCAG — 2 48 

I I I I I I I I II II I I I I I I I I I I I 

Db 1099 

CCCCGGTCCTCCTCCTTTTCTGCTGCTGCACATCCCTTCTGCTTCCTCCTTGGTGCAATC 1158 

Qy 2 49 — CTGGAACAGTATATGAAGAC CTGAGGTATAAGCT 

CTCGCTAGA 291 

I I I I II I I I I I I I I I I I I I I I 

I I 

Db 1159 

CCCAGAAGCCCACTCTCCTTCCATCTGCTCTGGAGTCTCTGCTCCTCTTGACTCTCTGGA 1218 

Qy 292 GTTCC CCAGTGGCTA C CCT 

TACAATGC 318 

I I I I I I I I I I I I III III 

I 

Db 1219 

GTGGCTGTGCCTTGGCAGTGACCTTTGGCCAGGGCAAGTGCCTCATGACAGGTACTGGGT 12 7 8 



Qy 319 GCCC— 

ACAGTGAAGTTCCTCACGCCCTGCTATCACCC 35 4 

I I I I III I I I I I I I I II I I I I I I 

Db 1279 

GCCCCAGGCAGCTAAGTGCCGCCCTGCCCACCAGCCCCCTATGGCTTGGGAAGGCTGGGG 133 8 

Qy 355 — CAACGTGGACACCCAGG-GTAAC 

ATATGCCTGGA 387 



Db 1339 

GTCCTCTTGGCCAACAGGGTGAAACCCCATCTCTACTAAAAACACAAAAATTAGCCAGG- 139 7 

Qy 388 CAT CCTGAAGG 

AAAAGTGGTCTG 410 

III I I I I I I I I I I I I I I 

Db 139 8 CATGGTGGTGCGCGCCTGTAGTCCCAGCTACTCAAGAGG- 

CTGAGGCAAGAGAATCGCTT 1456 

Qy 411 CCC TGTATGATGT CAGGA 

CCATTCTGCTCTCCATCC 446 

III I I I I I I III I I I I I I I I I I 

Db 145 7 GAACCCGGGAGTCAGAGGTTGCAGTGAGCCGAGATCGCGCCA — 

CTGCACTCCAGCCTGC 1514 

Qy 447 AGAG C C T T C T AGG AG AAC C C AAC AT TGATAGTCCCTT G AA- 

CAC 489 



Db 1515 

TGATAGAGCAAGACTCCATCTCAAAAAAAACAACCCAAAATTTGCCTGGCATGGTGGCAG 157 4 

Qy 49 0 ACATGCTG CCGAGCT CTGGAAAAACCCCACAGCTTTT- 

AA 528 



Db 15 75 GCAT- 

CTGTAATCCCAGCTACTCGGGAGGCTGAGACATGAGAGCTGCTTGAACCTGGGAG 1633 

Qy 52 9 GAAGTACCTGCAAGAAACCTA CTCAAAGCAGGTCACCAGCCAG- 

GAG 574 



Db 1634 

GCAGAAGTTGCAGTGAGCCGAGATCACACCACTGCACTCCAGCCTGGGTGACAGAGCGAG 1693 

Qy 575 CC-CTGACCCAG 

GCTGC 590 



Db 1694 

ACTCTGTCCCAAAAAATCAAAAAAATCACTTTTGGTAGAGATGCACTCTCGCTATGTTGC 1753 

Qy 591 CCAGCCTGTCCTTGTG-TC GTCTTTTTAATTTTTCCTTAGATGGTCTGTCCT 

TT 643 



Db 175 4 CCAGGCTGGTCTTGAACTCCTGGGCTCAATTGATCTTCCCACCTTGACC — 

TCCAAAGTG 1811 



Qy 6 44 TTGTGATTTCT-GTATAGGACTCTTTATCTTGAGCTGTGGTATTTTTGT- 

TTT 694 



Db 1812 

CTGGGATTACAGGTGTGAGCCACCATGCCTAGCCTCAGGGAATTCTTATAAGAACTCTAT 18 71 

Qy 695 GTTTTTGTCTT TTAAATTAAGCCTCGGT 

TGAGCCCTTGTATATTAAATAA 744 

I I I I I II I I I I I I I I I I I I I 

Db 18 72 GAAGTAGGCATCACCATCTTCTCTGTATCCATGGAAAGAGAGGCCTAG 

AGATGT 1925 

Qy 745 ATGCATTTTTGTCCTTTTTTAAAAAAAAAAAAAAAAAA 782 

I I I I I I I I I I I I I I II 

Db 1926 ATGCTAACTTGCCCAAGCTCACATAGCCCAGGGTAGCA 1963 



RESULT 3 

US-10-478-914-61 

; Sequence 61, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 

; TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
; FILE REFERENCE: 7388-80899 

; CURRENT APPLICATION NUMBER: US/ 1 0 / 478 , 9 1 4 

; CURRENT FILING DATE: 2003-11-26 

; PRIOR APPLICATION NUMBER: PCT/JP02/05294 

; PRIOR FILING DATE: 2001-05-30 

; PRIOR APPLICATION NUMBER: JP 2001-162775 

; PRIOR FILING DATE: 2001-05-30 

; PRIOR APPLICATION NUMBER: JP 2001-255226 

; PRIOR FILING DATE: 2001-08-24 

; NUMBER OF SEQ ID NOS : 417 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 61 

LENGTH: 2 43 7 

TYPE: DNA 
; ORGANISM: Homo sapiens 
US-10-478-914-61 

Query Match 4 4.2%; Score 2 09 8.5; DB 1; Length 2 43 7; 

Best Local Similarity 37.1%; Pred. No. 2.1e-113; 
Matches 522; Conservative 0; Mismatches 253; Indels 631; 
Gaps 78; 

Qy 1 GGCAC— GAGCGAGTT CCTGTCTCTCT GCCAACGCCGCCC— 

GGAT 42 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 321 GTCACCTGCGAGAGTTCGAGCCCGACGCAATTGGTGGTGGAGCCAA- 

GCAGCCCAAGGCA 3 79 

Qy 43 G GCTTCC CAAAACCGCG 

ACC C 63 



Db 380 

GTGTTTGTGCTGAGCTGCCTGCTGAAAGGCCGGACCGTGGAGATCCTACGGGACATCATC 439 



Qy 6 4 AGCCGC CACTAGC GTCGC 

CGCCGCCCGTAAA 9 4 

I I I I I I I I I I I I I I I I I I I I 

Db 440 

TGCCGCAGTCACTTCCAGTATTGTGTGGTGGTCACAACCGTGAGCCACGCTGTCCACCTC 499 

Qy 95 GGAGCTGA GCCGAGCGGGGGCGCCGCCCGGGGT 

CCGGTGGGC 136 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 500 

ACAGCTAATCATGTCCCAGCGGCGGCAGCGGCCGAGATGGAGGGGCAGCAGCCGGTGTTC 559 

Qy 13 7 AAAAGGCTACAGCAGGAGCT GAT G AC C C T C AT GAT G T 

CTGGCGA 180 

I III I I I I I I I I I I I I I I I I I I I I I I I I 

Db 560 

GAGCAGCTGGAGGAGAAGCTGTGTGAATGGATGGGCAACATGAACTACACGGCCGAGGTG 619 

Qy 181 TAAAGGGATTTC-TGC CTTCCC 

TGAATCAGAC 211 

I I I I I I I I I I I I I I I I I I I I 

Db 620 

TTCCATGTCCCGTTATTGCTTGCCCCTGTTGCTCCCCACTTTGCCTTGACTCCAGCTTTT 6 79 

Qy 212 AACCTTTT-CAAATG GGTAGGGACCATCC ATGGAG — 

CAGCTGGAA 254 

I I I I I I I I I I II II I I I I I I I I I I 

I I I 

Db 680 

GCATCCCTTTTCCCACTGCTACCCCAGGATGTGCACCTCCTTAATAGCGCCCGACCGGAC 739 

Qy 255 CAGTATATGAAG ACCTGAGGTATAAG — CTCTCGCTA- 

GAGTTCCCCAGTG 3 02 

II II I I I I I I II I I I I I I I I I I I I I I I I 
Db 740 AAGAGGAAGCTGGGAAGCCTGGGTGATGTGGACTC- 
CACTACGCTAACCCCAGAGCTGCT 79 8 

Qy 3 03 GCTACCCTTACAATGC GCC 

CACAGTG 32 8 

I I I I I I I I I III I I I I I 

Db 799 

GCTGCAGATCAGATGCCTAGTGTCAGGCCTCAGTTCTCTGTGTGAACATTTAGGAGTACG 85 8 

Qy 329 AAGTTCCTCACGCC CTGC- 

TATC 350 

I I I I I I I I I I I I I I III 

Db 859 

GGAGGAGTGTTTTGCTGTAGGTTCCTTAAGTCAGGTCATCGCTGCGGATCTGGCCAATTA 918 

Qy 351 -ACCCCAAC 

GTGGA 363 

I I I I I 

I I I I I 

Db 919 

TGCCCCTGCAAAGAACAGGAAGAAGACTGCTGCAGGCAGGGCATCAGTGGTTTTTGTGGA 9 7 8 

Qy 36 4 C ACCCAGGGTAACATA — TGCC — TGGACATCCTGAAG 

GAAAAG— 4 03 

I I I I I I I I III II I I I I I I I I I I I I I I I I I 



Db 979 

CAGAACCCTGGATCTCACAGGAGCAGTTGGACATCATGGAGACAACTTAGTAGAGAAGAT 103 8 

Qy 404 

TGGT 407 

I I I I 

Db 1039 

CATTTCAGCACTTCCCCAGCTCCCAGGCCACACAAATGATGTGATGGTTAACATGATAGC 109 8 

Qy 408 CTGCCCT GTATGATGT 

CAGG 42 7 

I I I I I I I I I I I I I I I I I 

Db 1099 

GCTCACTGCACTCCATACTGAGGAGGAAAATTATAATGTGGTTGCACCAGGCTGTCTTTC 115 8 

Qy 42 8 ACCATTCTG CTCTCCATCCAGAGCCTTCTAGGA 

GAACCCAA 46 8 

I I I I I I I II I I I I I I I I I I I I I I I I I I I 

Db 1159 

ACAATTCAGTGACACCACAGCCAAAGCCCTATGGGAAGCTTTACTGAACACTAAGCACAA 1218 

Qy 469 CATTGATAG T C C C T T G AAC AC - AC AT G 

CTGCC 499 



Db 1219 

AGAGGCAGTGATGGAAGTTCGGAGACATCTAGTGGAAGCGGCAAGCAGAGAAAACCTGCC 12 7 8 

Qy 50 0 GAGC TCTG GAAAAACCCC ACAGCT — TTTAAGAAGTAC 

CTG 538 

II III II I I I I I I I I I I I I III 

I I 

Db 1279 

AATCAAGATGAGTATGGGGAGAGTCACACCGGGACAGCTCATGTCCTATATTCAGCTCTT 133 8 

Qy 539 CAAGAAACCTACTCAAAGC AGG TCACCAGCCAGGA 

GCCC 577 



Db 1339 CAAG- 

AACAACCTCAAAGCTCTAATGAATCATTGTGGCCTCCTCCAGCTTGGACTGGCCA 139 7 

Qy 578 TGACCCAGGC — TG CCCAGCCTGTCCTTGT — 

GTCGTCTTTTTAATTTTT 623 

III I II I I I I I II I I II II I I I I I I I I I 

Db 139 8 C AG C T C AAAC G T T G AAAC AC C C AC AG AC T G - 

CCAAGTGGGACAACTTTCTGGCTTTTGAA 1456 

Qy 62 4 CCTTAGATGGTC 

TGTCCTTTTTG 646 

I II I III I I I I I I I I I 

Db 1457 

AGGCTCCTTCTTCAGAGCATTGGGGAGTCAGCAATGTCCGTTGTGTTAAATCAGCTGCTG 1516 

Qy 6 47 TGATTT CTGTAT AGGACTCTTTATCT — TGAGCTG- 

TG 681 



Db 1517 

CCCATGATTAAGCCTGTAACCCAGAGAACCAACGAGGACTACAGCCCTGAGGAACTGCTG 15 76 



Qy 6 82 GTATTTTTGTTTTGTTTTTGTCTTTTAAAT — TAAGC-CTCGGTTG 

AGCCCT 73 0 

I III I I I I I I I I I I I I I I I I I I I I I 

Db 1577 

ATCCTTCTCATATATATTTATTCTGTCACT GGAGAGC TC ACGGT AGAC AAAG AC CTGTGT 1636 

Qy 731 TGTATATTAAATAAATGCATT 

TTTG 755 



Db 1637 

GAAGCAGAAGAAAAAGTCAAGAAAGCATTGGCTCAGGTCTTCTGTGAGGAATCTGGATTG 1696 

Qy 756 T — CCTTTTTTAAAAAAAAAAAAAAA 779 

I I I I I I I I I I I I I I I 
Db 169 7 TCACCTTTGCTGCAAAAAATTACGGA 172 2 



RESULT 4 

US-10-478-914-4/C 

; Sequence 4, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 

; TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
; FILE REFERENCE: 7388-80899 

; CURRENT APPLICATION NUMBER: US/10/478,914 

; CURRENT FILING DATE: 2003-11-26 

; PRIOR APPLICATION NUMBER: PCT/JP02/05294 

; PRIOR FILING DATE: 2001-05-30 

; PRIOR APPLICATION NUMBER: JP 2001-162775 

; PRIOR FILING DATE: 2001-05-30 

; PRIOR APPLICATION NUMBER: JP 2001-255226 

; PRIOR FILING DATE: 2001-08-24 

; NUMBER OF SEQ ID NOS : 417 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 4 

LENGTH: 3210 

TYPE: DNA 
; ORGANISM: Homo sapiens 
US-10-478-914-4 

Query Match 43.0%; Score 2041; DB 1; Length 3210; 

Best Local Similarity 35.1%; Pred. No. 4.5e-109; 
Matches 508; Conservative 0; Mismatches 249; Indels 692; 
Gaps 82; 

Qy 2 GCACGAGCGAGTTCCTGTCTCTCTGCCAACGC CGCCCGGAT GGC — 

TTCCCAAA 53 

III I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 

Db 15 02 GCAGTGGCGCGATCTCGGCTCACTGCAACCTCTGACTCCCGGGTTCAAGCGATTCTC- 

TT 1444 

Qy 5 4 ACCGCGACC CAGCCG CCACTAGCGT-CGCCGCC— 

GCC 8 8 

III II I I I I III III I I I I I III 

Db 1443 

GCCTCAGCCTCTTGAGTAGCTGGGACTACAGGCGCGCACCACCATGCCTGGCTAATTTTT 13 8 4 



Qy 89 CGTAAAGGAG CTG — 

AGCCGAGCGGGGGCGCCGCC 121 

I I I I I I III I I I I I I I I I I I I 

Db 1383 

GTGTTTTTAGTAGAGATGGGGTTTCACCCTGTTGGCCAAGAGGACCCCCAGCCTTCCCAA 132 4 

Qy 122 CGGGGTCCGGTGGGCAAAAGGCTAC AGCAGGAGCTGATG- 

ACCC 16 4 

I I I I I I I I I I I I I I I I I I I I I III I I I I I 

Db 1323 GCCATAGGGGGCTGGTGGGC — AGGGCGGCACTTAGCTG — 

CCTGGGGCACCCAGTACCT 126 8 

Qy 165 -TCATGATG-TCTGGC GATAAAGGGATTTCTGCCTT — CCCTGAATC 

AG 209 

I I I I I I I I I I I I I I I I I I I I I I I III I 

I I 

Db 126 7 GTCATGAGGCACTTGCCCTGGCCAAAGG — 

TCACTGCCAAGGCACAGCCACTCCAGAGAG 1210 

Qy 210 ACAA CCTTTTCAAATGGGTAG GGAC 

CATC 23 8 

III II I I I I I I I III 

I 

Db 1209 

TCAAGAGGAGCAGAGACTCCAGAGCAGATGGAAGGAGAGTGGGCTTCTGGGGATTGCA-C 1151 

Qy 239 CATGG AGCA GCTGGAACAGTAT AT G AAG AC C T GAG G - 

TATAA 2 79 

I I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 1150 

CAAGGAGGAAGCAGAAGGGATGTGCAGCAGCAGAAAAGGAGGAGGACCGGGGGGTTCTGA 1091 
Qy 280 GCTCTC 

GCTAGAGTTC 295 

III I I I I I I I I 

Db 1090 

GGGCTTGGGGTGATGAGGCCAGAGTTCCAAACAGGGTCTGGGTGCAGGATGAGGACAGAA 1031 

Qy 296 CCC— AGTGGCTA 

CCCTTAC 313 

III I I I I I I I I I I I I I 

Db 1030 

GAAAACAGAGAACCCAAAAGGGGCAAGAGGTTGGTCCCTTACTTTCACTCTCCTGGGAGG 9 71 

Qy 314 AATGCGCC-CACAG 

TGAAGTTCCT 336 

I I I I I I I I I III 

I I I 

Db 970 

AGGATGGGTACACCACACAGTTGGATGGTGCATCTTTGCCTGTCTGCACCTGCAGCTCCA 911 

Qy 33 7 CAC GCCCTG CT-ATCACCCCAAC 

GT 360 

III I I I I I I I I I I I I I I I 

I I 

Db 910 

TGGCACCATGGACCCTGTGCTCCGAGTGGTGGTGCCTGATCTCCACCACCAGGCAGCAGT 851 

Qy 361 GGACACC CAGGGTAA CAT ATGCC— 



Db 850 

AGAGGCCGCTATCCAGCAGGGTCAGGTTGCGCATGGTGATGGAGAAGTTGCCATGGTGGT 791 

Qy 388 C ATCC 

TGAAGGAAAAGT 40 4 



Db 790 

CGGAGGCCGACTCCAGCCCGTGGCGCTGAGCCAGGTCGTGGCTGGTGTTGGCAGCCTGGT 731 

Qy 405 GGTCTGC CCTGTAT GATG 

TCAG 42 6 

I I I I I I I I I I I I I I 



Db 730 

GGCCTCCATGGTGCAGGTGAAGGTCCTGGAACGTGAGGTTGCGGATGGGCCGGCGCTCTG 6 71 

Qy 42 7 GACCATTCTGCTCTCCATCC — AGAGC 

C T T C T AGG AG AAC C C AAC AT 471 

I I I I I I I I II I I I I I I I I I I I I I I I I 

I 

Db 670 

AGCAGGTCTGCACCTCGCCCCTCGAGCTGCGGTACCACGTCTTGTAGAAGGTCACATCGT 611 

Qy 4 72 T GAT AG T C C C T T G AAC AC A CATG CTGCCG 

AGCTCTGGAAAAA 513 

I I I I I I I I I I III I I I I I I I I I I 

Db 610 

GCCCTTTGTCCACAGGGCCCAAGAGCCTGCAGGTGAGGGTGACGTTCTG 562 

Qy 514 CCCCACAG C T T T T AAGAAG T AC CTG- 

CAAGAAACC 547 

I I I I I I I I I I I I I I I I I I I 

I I I 

Db 561 

CCCCTCGGGACAGACATACAGGGAATACGGCGTGGCGACCTTGAAGGCTGCCACCGGACC 5 02 

Qy 548 TACTCAAAG CAGGTCACCAGCCAG GAGC CCTG 

ACCCA 58 4 

I I I I I I I I I I I I I I I I I III I I I I 

I I I I 

Db 501 

TGCTCAGAGAGAGGAGAGCCCTGTCACCTGACTGATCCAGTGCTACTCCTGGCAACCCCA 4 42 

Qy 585 G GCTGCCCAGC 

CT 597 

I I I I I I I I I 

I I 

Db 441 

GAGCGGGGAAGAACCTTAGCCTCTGCTACTTCCCAGCCACAGGCCAGGGGCATGTCAGCT 3 82 

Qy 59 8 GTCCT TGTGTC-GTCTTTTTAATTT 

TTCCTTAGATG 632 

I I I I I I I I I I I I I I I I I I I I I I I 

Db 381 

GACCTCCCTAAGACTGCCTTCTGCATTTGTTCACAAACCCCACTCCATCCTAGGCTGTTG 32 2 



Qy 633 GTC TGTCCTTTTTG 

T GAT T T C T G T AT AGG AC 663 

III III II I I I I I I I I I I I 

Db 321 

TAGGGGGAGTCACAGAAGCTTCCCCAACTAGGGGTGCTCCTGGATGCTGGACAGGGCAGG 2 62 
Qy 664 

TCTTTATCTTGAGCTGTGGTATTTTTGTTTTGTTTTTGTCTTTTAAATTAAG 715 

III I I I I I I I II I I I I I I I I II II 

Db 261 GAGAGACTTGG AGGGCAGTGATGCAGTCG- 

GGTGTGTCCCTCTTCCCAGTAGAGTAT 2 06 

Qy 716 CCT CGGTTG 

AG 726 

III I I I I I 

I I 

Db 205 

ATGCAAAACCTGTTGGGTGGGAGGCAGGTTGGGGAGCAGGAAGGCTGTTTCGAACAGCAG 146 

Qy 72 7 C-CCTTGTATAT TAAATAAATGCATT 

TTTGTCCTTTTTTAAAAAAAA 772 

I I I I I I I I I II III I I I I I I I I I 

I 

Db 145 

CTCCTTGTCCTTGGCCGCTCTGTAGGGGCCATCAACACCTCTGCCCCTTCCTACTCCTAG 8 6 

Qy 773 AAAAAAAAA 781 

I III 
Db 85 GACTGAACA 7 7 



RESULT 5 

US-10-478-914-64/C 

; Sequence 64, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 

; TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
; FILE REFERENCE: 7388-80899 

; CURRENT APPLICATION NUMBER: US/ 1 0 / 478 , 9 1 4 

; CURRENT FILING DATE: 2003-11-26 

; PRIOR APPLICATION NUMBER: PCT/ JP02 / 052 9 4 

PRIOR FILING DATE: 2001-05-30 
; PRIOR APPLICATION NUMBER: JP 2001-162775 
; PRIOR FILING DATE: 2001-05-30 
; PRIOR APPLICATION NUMBER: JP 2001-255226 
; PRIOR FILING DATE: 2001-08-24 
; NUMBER OF SEQ ID NOS : 417 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 6 4 

LENGTH: 212 0 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-478-914-64 

Query Match 42.6%; Score 2024.5; DB 1; Length 2120; 

Best Local Similarity 41.8%; Pred. No. 4.2e-109; 
Matches 472; Conservative 0; Mismatches 297; Indels 359; 
Gaps 64; 



Qy 1 GGCAC 

GAGCGAGTTCCTGTCTCTCTGCCAAC 31 

I I I I I I I I I I I I I I I I I I I I I I 

Db 1399 GGCCCAGAGGAGGCCTGGAGCCGCTTACTGT- 

TCAGTGCCAGCTTCATCCTTGGCACTCT 1341 

Qy 32 GCCGCCCGGATGGCTTC CCAAAACCGCGA-CCCA 

GCCGCCACTAGCGTC 79 

I I I I I I I I I I II II I I I I I I I I I I I I I I I I 

Db 1340 

GCCTCTCACACAGCATCGCTCCGGCTCCCAGAGCCCACAGTGGCAGCCGCTGGAGGCTGT 12 81 

Qy 8 0 GCCGCC — GCCCGTAAAG GAGCTGAGC 

CGAGC— 109 

I I I I I I I I I II I I I I II I I I I 

Db 1280 

GGCAGCAGCTGCTGCCCTGCCCAAGGAGGGATGTGGGCTTGGCTCTGACAGTTCAAGCCA 12 21 

Qy 110 GGGGGCGCC GCC CGGGG— TCC GGTG — 

GGCAAAAGGCT — 144 

I I I I I II III III I I I I I I I I I I I 

Db 1220 

G AAAG AGC C T T AG AAG T C AT T C AGG AC AT TCC CAT T T T AC AGAT GAT G AAAT AAGGC T C A 1161 

Qy 145 ACAGCAGGAGC TGATGAC 

CCTCATGATGTCTGGCG 179 

I I I I I I I I I I I I I I I I I I I III 

I 

Db 1160 

GAGAGAGCAGGGGCTTTCAAGAACAATAAAGGTGGAGACTGGAACTCAGGCCTCCTGTCT 1101 

Qy 180 ATAAAGGGATTTCTGCC — TTCCCTGAATCAGACAACCT 

TTTCAAATGGGT 228 

I I I I I I I I I I I I I I I I I I I I II III 

I I I 

Db 1100 GCTGAGGG- 

TGTCTGCCCATGCTCTCTCTCAGAGTTTCTGAGTGGTGGTGTCTGGAGGGG 10 42 

Qy 229 AGGGACCATCCATGGAGC 

AGCTGGAA 254 

I I I I I I I I I I I I I I I I 

Db 1041 

CTGGACACGCAAGGGAGCTGGCCTCGTGTGTGTCTGTCTGTGCCGGGATGTGTCTGTGGC 9 82 

Qy 2 55 — CAGTATATGAA— GACCTGAGGTATAAGCTCTCGCTAGAGTTCCCCAGTGGCTAC- 

CC 309 

I I I I I I I I I I I I I I I II I III I I I I I I I I 

I I 

Db 981 

TTCTGTCTCTGAACGTGCCAGGGGTTTTGTGTCCTGTGTGTGTCTCTCTGTGGGTGCTCC 922 

Qy 310 T TACA ATGCGCCCA CAGTGAAG 

TTCCT 336 

I III I I I I I I I I I I I I I I I I I 

Db 921 

TGTGTGCAGGAACTTGGGGCCATGGCCAGAGAGGGATGGGCCTATTCCTACAATCTTGGG 862 

Qy 33 7 CAC GCCCTG 

CT 347 

III I I I I I 



AGGCCTCCACAGGCTGAGCTGGTGGGGAGGGTCCTGAAGGGTAGAGTGAAGATGGGGACT 8 02 

Qy 3 48 ATCACCCCAACGTGGA CACCCAGGGTAACATATGCCT- 

GGACATCCTGAAGGAAA 401 

I I I I I I I II I I I I I I I I I I I I I I I I I I II 

I I 

Db 801 TTACTCCCAGCGGAGAGGAAGAAACCAG AAGATAGGCTTGGGGCAT — 

GGATGGGAA 747 

Qy 402 AGTGGTCT GCCCTGTATGATGTCAGG 

ACC 43 0 



Db 746 A— 

GGTCTTGGAACTAGGATGGGGCAAGGGAAACAAAGCCCCAAATGGAGTATGGGTCCC 6 89 

Qy 431 ATTCTGCTCTC-CATCCAGAGCCTTCTAGGAGAACCCAAC 

ATTGATAGTCCC 481 

II III I I I I I II III I I I I I I I I I III 

Db 688 TGGAAGTTGACACACTCTTAGCCCTCATGG- 

GGGCCCAGCCCAGTGGGTAACCCCGGGGA 63 0 

Qy 482 -TTGAACAC ACATG CTGCCGAGCT CTGGAAAAACCCCA 

CAGCTTTT 526 

II III I I I I III I I I I I I I I I I I I I I I 

Db 629 

AGTGGTCACATGACGTGACACTTCTCAGCTGATTGCTTGAGATCCCCCAGTGTGGCCTGG 57 0 

Qy 52 7 AAG AAG T AC C T GC AAG AAAC C T -AC T C AAAGC AGG T C -AC C AG CCAGGAGCCCT- 

GACC 582 

I I I I I II II I I I I I I I I I I I I I I I I I I 

I I I 

Db 569 

GCCTTGTCCTTGAGTGAGCCCAGGTTCAAGCCAGGAAGGCTGGCCAGCTGCCCAAGGACT 510 

Qy 583 CAGG-CTGCCCAGCCT-GTCCT-TGTGTCGTCTT — TTTAATTTTTCCTTAGAT- 

GG 633 

I I I I I I I III I I II I I I I I I I I I II II II III 

I I 

Db 509 GAGGCCTTCTCCTCCTCCTCCTCTGTGTCCTCCTCGATCATCTTGGCC — 

AGCTCCCGGG 452 

Qy 63 4 TCTG-TC CTTTTTGTGATTTCTGTATAGGACTC 

TTTATCTTGAGCTGTGGTA 684 

I I I I II II I I I I I I I I I I I I I I I I II 

I I I I 

Db 451 

GCAGCTCCACGTCTCCACCTGCCATCTGGATCTGGCTCTCATCTGTCTCGTTCTTGGGTA 392 
Qy 685 

TTTTTGTTTTGTTTTTGTCTTTTAAATTAAGCCTCGGTTGAGCCCTTGTATATTA 739 

I I I I I I I I I I I I II I I I I I I I I I I I 

Db 391 GCCGGTATTTGTCTCGGAAGTGGCTCCGCAGTGTGGCC-C- 

GCTCTGCCTTCCTCTGTGT 334 

Qy 740 AATAAATGCATTTTTGTCCTT TTTTAAAAAAAAAAAAAAAAAAA 783 

I I I I I I I I I I I I I I I II 

Db 333 GAACTGTGCATCCCGCTCCATCTTCTCTTCCACGAGTTGCTTCTGATA 2 86 



RESULT 6 

US-10-478-914-17 

; Sequence 17, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 

; TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
; FILE REFERENCE: 7388-80899 

; CURRENT APPLICATION NUMBER: US / 1 0 / 478 , 9 1 4 

CURRENT FILING DATE: 2003-11-26 

PRIOR APPLICATION NUMBER: PCT/ JP02 / 0 529 4 
; PRIOR FILING DATE: 2001-05-30 
; PRIOR APPLICATION NUMBER: JP 2001-162775 
; PRIOR FILING DATE: 2001-05-30 
; PRIOR APPLICATION NUMBER: JP 2001-255226 
; PRIOR FILING DATE: 2001-08-24 
; NUMBER OF SEQ ID NOS : 417 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 17 

LENGTH: 219 8 
TYPE: DNA 
; ORGANISM: Homo sapiens 
US-10-478-914-17 

Query Match 42.4%; Score 2015; DB 1; Length 2198; 

Best Local Similarity 37.3%; Preol. No. 2.1e-108; 
Matches 529; Conservative 0; Mismatches 236; Indels 652; 
Gaps 83; 

Qy 2 GCACGAGC GAGTTCCTGTCTCTCTGCCAACGCC GCCCG 

GATGGCTTCC 49 



Db 765 GAACGTGCTAAGCGTACC 

CTCTCTTCCAGCACCCAGGCCAGTATTGAGATCGATTCT 821 

Qy 5 0 C AAAACCGCGACCCAGCCGCCACTAGCGTCGCCGCCGCCCG— 

TAAAGGAG 9 8 



Db 822 CTCTATGAAGGAATCGACTTCTATACCTCCATTA C-CCG- 

TGCCCGATTTGAAGAA 8 75 

Qy 99 CTGA — GCCGAGCGGGGGCGC — CGCCCGGGGTCCGGTGGGCAAAAGGCTAC- 

AGCAG — 151 

I I I I I I I I I I II I I I I I I I I I I I III III 

Db 876 

CTGAATGCTGACCTGTTCCGTGGCACCCTGGACCCAGTAGAGAAAGCCCTTCGAGATGCC 935 

Qy 152 GAGCTG ATGACCC TCATGATGTCTGGCGATAAAGGGATTTCTGC 

CTTCC 200 



Db 936 AAAC T AG AC AAG T C AC AG AT T C AT GAT AT - 

TGTCCTGGTTGGTGGTTCTACTCGTATCCC 994 



Qy 201 C-TGAATCAGA CAAC CTTTTCAAATGGGTAGGGAC- 

CAT 23 7 



Db 995 

CAAGATTCAGAAGCTTCTCCAAGACTTCTTCAATGGAAAAGAACTGAATAAGAGCATCAA 105 4 

Qy 23 8 CC ATGGAGCAGCTG GAACAG 

TAT AT G AAG AC — 26 8 

II I I I I I I I I I I I III II 

Db 1055 

CCCTGATGAAGCTGTTGCTTATGGTGCAGCTGTCCAGGCAGCCATCTTGTCTGGAGACAA 1114 

Qy 269 — CTGAG GTATAA GCTC 

TCGCTAGAGTTCCCC 29 8 

I I I I I II II I I I I I I I I I I I I I 

Db 1115 

GTCTGAGAATGTTCAAGATTTGCTGCTCTTGGATGTCACTCCTCTTTCCCTTGGTATTGA 117 4 

Qy 299 AGTGG CTACCCTTACAATGCG 

CCCACAGTGAA 33 0 



Db 1175 

AACTGCTGGTGGAGTCATGACTGTCCTCATCAAGCGTAATACCACCATTCCTACCAAGCA 123 4 

Qy 331 GT TCC-TCACGCCCTG— CTATCACCCCAAC— 

GTGGAC 36 4 

I I I I I I I III II I I I I I III I 

Db 1235 

GACACAGACCTTCACTACCTATTCTGACAACCAGCCTGGTGTGCTTATTCAGGTTTATGA 129 4 

Qy 365 ACCCAGGGTAACAT- 

ATGCCTGG 386 

II I I I I I I I I I I I I I I 

Db 1295 

AGGCGAGCGTGCCATGACAAAGGATAACAACCTGCTTGGCAAGTTTGAACTCACAGGCAT 135 4 

Qy 387 ACATCCTGAAGGAAAAGTGGTCTGCC-CTGTATGATGTC 

AGGACCAT 432 



Db 1355 ACCTCCTGCA— 

CCCCGAGGTGTTCCTCAGATTGAAGTCACTTTTGACATTGATGCCAAT 1412 

Qy 433 TCTGC 

TC 439 



Db 1413 

GGTATACTCAATGTCTCTGCTGTGGACAAGAGTACGGGAAAAGAGAACAAGATTACTATC 14 72 

Qy 440 TCCA TCCAGAGCCTT CTAGGA GAAC 

CCA 46 7 

II I I I I I I I I I I I I I I I I III 

Db 1473 

ACTAATGACAAGGGCCGTTTGAGCAAGGAAGACATTGAACGTATGGTCCAGGAAGCTGAG 1532 

Qy 46 8 ACATTGAT AGTC — 

C C T T GAAC AC AC AT G — 49 4 

I I I I I I I I I I I I I 

Db 1533 



AAGTACAAAGCTGAAGATGAGAAGCAGAGGGACAAGGTGTCATCCAAGAATTCACTTGAG 1592 

Qy 495 CTGCC GAGC-TCTG GAAAAACCCCACAGC — 

TTTTA- 527 

I I I I III III I I I I I I II II 

I 

Db 1593 

TCCTATGCCTTCAACATGAAAGCAACTGTTGAAGATGAGAAACTTCAAGGCAAGATTAAC 1652 

Qy 52 8 AGAAGTACCTG 

CAAG 542 

I I I I I III 

I I I 

Db 1653 

GATGAGGACAAACAGAAGATTCTGGACAAGTGTAATGAAATTATCAACTGGCTTGATAAG 1712 

Qy 5 43 AAACCTACT-CAAAGCAGG TCACCAGC 

CAGGAG 574 

III I I I I I I I I I I I I I I I I I I I I 

Db 1713 

AATCAGACTGCTGAGAAGGAAGAATTTGAACATCAACAGAAAGAGCTGGAGAAAGTTTGC 17 72 

Qy 575 CC— CTGACCCAGGCTGC CCA 

GCCTG — 598 

II III I I I I III III 

Db 1773 

AACCCCATCATCACCAAGCTGTACCAGAGTGCAGGAGGCATGCCAGGAGGAATGCCTGGG 1832 

Qy 599 TCCTTGTG TCGTCTTTT 

TAATTTTTCCTTAGATGGTCTGTCCTTT 643 

I I I I I I I I I I I I I I I I I I I I I III 

I I 

Db 1833 GGATTTCCTGGTGGTGGAGCTCCTCCCTCTGGTGGTGCTTCCTCAG — 

GGCCCACCATTG 189 0 

Qy 644 TTGTGATTTCT GTATAGG ACTCTTT — 

ATC — T 672 

I I I I I I I I I I I I I I I 

I 

Db 1891 

AAGAGGTTGATTAAGCCAACCAAGTGTAGATGTAGCATTGTTCCACACATTTAAAACATT 195 0 

Qy 6 73 TGAG CTGTGGTATTTTT GTTT 

TGTTTTTGT 702 

III I I I I I I I I I I I III II 

I I 

Db 1951 

TGAAGGACCTAAATTCGTAGCAAATTCTGTGGCAGTTTTAAAAAGTTAAGCTGCTATAGT 2 010 

Qy 703 CTTTTAAATTAAGC 

CTCGGTTGA 725 

III III II III II 

Db 2011 

AAGTTACTGGGCATTCTCAATACTTGAATATGGAACATATGCACAGGGGAAGGAAATAAC 2 0 7 0 

Qy 726 GCCCT TGTATATTAAAT AAATGCATTTTTGTC 

CT 759 

I I I I I I I I I Ml I I I I I I I I I I I I 



2 0 71 ATTGCACTTTATAAACACTGTATTGTAAGTGGAAAATGCA- 



ATGTCTTAAATAAAACT 212 7 



Qy 



Db 




RESULT 7 

US-10-478-914-61/C 

; Sequence 61, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 

; TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
; FILE REFERENCE: 7388-80899 

; CURRENT APPLICATION NUMBER: US / 1 0 / 478 , 9 1 4 

; CURRENT FILING DATE: 2003-11-26 

; PRIOR APPLICATION NUMBER: PCT/ JP02 / 052 9 4 

; PRIOR FILING DATE: 2001-05-30 

; PRIOR APPLICATION NUMBER: JP 2001-162775 

; PRIOR FILING DATE: 2001-05-30 

; PRIOR APPLICATION NUMBER: JP 2001-255226 

; PRIOR FILING DATE: 2001-08-24 

; NUMBER OF SEQ ID NOS : 417 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 61 

LENGTH: 2 43 7 

TYPE: DNA 
; ORGANISM: Homo sapiens 
US-10-478-914-61 

Query Match 42.4%; Score 2013.5; DB 1; Length 2437; 

Best Local Similarity 38.1%; Pred. No. 4.7e-108; 
Matches 509; Conservative 0; Mismatches 254; Indels 573; 
Gaps 80; 

Qy 1 GGCACGAGCGAGTTCC TGTCTCTCTGCCAACGCCGCCCGGATGGC 

TTCCC- 50 



Db 132 3 GACATGAGC- 

TGTCCCGGTGTGACTCTCCCCATACTCATCTTGATTGGCAGGTTTTCTCT 12 65 

Qy 51 -AAAACCGCGACC CAGCCGCC — 

ACTAGCGTCGCCGCCGC 87 

I I I I II I I I I III I I I I I I I I 

Db 1264 

GCTTGCCGCTTCCACTAGATGTCTCCGAACTTCCATCACTGCCTCTTTGTGCTTAGTGTT 12 05 

Qy 8 8 CCGTAAAG GAGCT 

GAGCCGAGCGGG 112 



Db 1204 

CAGTAAAGCTTCCCATAGGGCTTTGGCTGTGGTGTCACTGAATTGTGAAAGACAGCCTGG 1145 

Qy 113 GGC-GCCGC CC GGGGTCCGGTGGGC 

AAAAGGCT 144 



I I 



II II I III II 



I I I 



Db 



1144 



TGCAACCACATTATAATTTTCCTCCTCAGTATGGAGTGCAGTGAGCGCTATCATGTTAAC 10 85 

Qy 145 ACAGCA GGAGCTG 

ATGACC 163 

Db 1084 

CATCACATCATTTGTGTGGCCTGGGAGCTGGGGAAGTGCTGAAATGATCTTCTCTACTAA 102 5 

Qy 16 4 CT-CATGATGTCTGGC 

GAT AAAGGGAT T T 191 

I I I I I I I I I I I I III I I I I I I 

Db 1024 

GTTGTCTCCATGATGTCCAACTGCTCCTGTGAGATCCAGGGTTCTGTCCACAAAAACCAC 965 

Qy 192 — CTGCCTTCCCTGAATCAG ACAACCTTTTCAAATGGGTAGGGACCAT— 

CCAT 2 41 

I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 96 4 TGATGCCCTGCCTGCAGCAGTCTTCTTCCTGTTC 

TTTGCAGGGGCATAATTGGCCA- 9 09 

Qy 2 42 GGAGCAGCTG GAACAGTATATGAAGACCTGAGGTATAA GCTCTCl 

CT 288 

I I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 908 - 

GATCCGCAGCGATGACCTGACTTAAGGAACCTACAGCAAAACACTCCTCCCGTACTCCT 850 

Qy 2 89 AGA-GTTC-CCCAGTG-GCTAC — CCTTACAAT — 

GCGCC 321 

I I I I I I I I I I I II I I I I I I I II I 

Db 849 

AAATGTTCACACAGAGAACTGAGGCCTGACACTAGGCATCTGATCTGCAGCAGCAGCTCT 79 0 

Qy 322 CACAGTGAAGTTC CTCACG CCCTGCTATCAC 

CCCAAC— 358 

I I I I I I I I I III I I I I I I II 

Db 789 

GGGGTTAGCGTAGTGGAGTCCACATCACCCAGGCTTCCCAGCTTCCTCTTGTCCGGTCGG 73 0 

Qy 359 GTGGACA-CCCAGGGTAACA TATGC — 

CTGGA 387 

I I I I I I I I I I I I I I I I I I I 

I I I I I 

Db 729 

GCGCTATTAAGGAGGTGCACATCCTGGGGTAGCAGTGGGAAAAGGGATGCAAAAGCTGGA 6 7 0 

Qy 388 CATCCTGAAGGAAAAGTGG 

TC 408 

II I I I I I I I I I I I 

I I 

Db 669 -GTC 

AAGGCAAAGTGGGGAGCAACAGGGGCAAGCAATAACGGGACATGGAACACCTC 614 

Qy 409 TGCCCTGT ATGATGTCAGGACCATT CTGC- 

TCTCCATCCAG 448 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 613 GGCCGTGTAGTTCATGTTGCCC-ATCCATTCACACAGCTTCTCC- 

TCCAGCTGCTCGAAC 556 



Qy 4 49 AGCCTTCTAGG AGAACCC 

AACATTGATAG 477 

I I I I I I I I II I I I I II 

Db 555 

ACCGGCTGCTGCCCCTCCATCTCGGCCGCTGCCGCCGCTGGGACATGATTAGCTGTGAGG 496 



Qy 478 TCCC — TTG AACACACA 

TGCTGCCGA — 501 

III III I I I I I I I I I I I I I I 

Db 495 

TGGACAGCGTGGCTCACGGTTGTGACCACCACACAATACTGGAAGTGACTGCGGCAGATG 436 

Qy 502 — GCTCTGGAAAAACCCCA CAGCTTTTAAGAAG 

TAC- 535 

I I I I I I I I I I I I I I I I I I I I 

I 

Db 435 

ATGTCCCGTAGGATCTCCACGGTCCGGCCTTTCAGCAGGCAGCTCAGCACAAACACTGCC 3 76 

Qy 536 CTGCAAGAAACCTACTCAAA 

GCAGGTCACCAGCC 569 

I I I I I I I I I I I I I I I I I I I I I 

I 

Db 3 75 TTGGGCTGCTTGGCTCCACCACCAATTGCGTCGGGCTCGAACTCTCGCAGGTGA- 

CAGTC 317 

Qy 570 AGGAGCCCTGAC CCAG GCTGCC 

CAGCCTGTC 600 

I I I I I I I I I I I I I I I I I I I I I I I I 

Db 316 

AGGGCCCCCCACCGCCTCCAGGAGACGGGTGGATCCGCAGCCCCAGTGCAGGCTCTCGGC 257 

Qy 601 -CTTGTGTCGTCTTTTTAA TTTTTCCTTAGATGG — TCTG 

TCC 640 

I I I I I I I III I I I I I I I I III 

I I I 

Db 256 

GCAGGCGGCGTCCAGGTAAACCACAGCCCGTTTCACTTTGGCCAGCACCTGCTCCCATCC 19 7 

Qy 641 TTTTTGTG ATTTCTGTAT AGG-ACTCTTTAT 

CTTGAG- 6 76 

I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 196 

TTGCTGGGTAAAGGACAGTACGCCCGAGGCGCTCATGGTTGGGGATTCGCAGACTTGGGA 13 7 

Qy 677 — CTGTGGT ATTTTTGTTTTGTTTTTGTCTTTT 

AAATTAAGCCTCGG 721 

II III I II II I I I II I I I I I II III 

Db 136 

AACTACGGTGCAGGAACTTCTTTCAGAACTCACCGCTTCCGGAAATTGGGCTCCGGGAGA 7 7 

Qy 72 2 — TTGA GCCCTTGTATATTAAATA AATGC 

ATTTTTGTCCTTT 761 

I I I I II I I I I III I I I I I I I I I 

I 

Db 76 CTTTGACAGTCTCCACAGTACACCCAGTAGGCCAACAGTGCTCGAGGAATTCCG- 

CAGCT 18 



Qy 762 TTTAAAAAAAAAAAAA 777 



Db 



17 T T T AGAGC AGAAGTAA 2 



RESULT 8 

US-10-478-914-21 

; Sequence 21, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 

; TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
FILE REFERENCE: 7388-80899 

CURRENT APPLICATION NUMBER: US/ 1 0 / 478 , 9 1 4 
; CURRENT FILING DATE: 2003-11-26 
; PRIOR APPLICATION NUMBER: PCT/ JP02 / 052 9 4 
; PRIOR FILING DATE: 2001-05-30 
; PRIOR APPLICATION NUMBER: JP 2001-162775 
; PRIOR FILING DATE: 2001-05-30 
; PRIOR APPLICATION NUMBER: JP 2001-255226 
; PRIOR FILING DATE: 2001-08-24 
; NUMBER OF SEQ ID NOS : 417 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 21 

LENGTH: 3589 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-478-914-21 

Query Match 42.0%; Score 1995.5; DB 1; Length 3589; 

Best Local Similarity 41.4%; Pred. No. 6.5e-106; 
Matches 501; Conservative 0; Mismatches 263; Indels 445; 
Gaps 74; 

Qy 2 GCACG-AGCGA GTTCC TGT-CTCTCTGCC — AAC — 

GCCGCCCGGATGGCTTCC 49 



Db 25 

GCACACTGCGAGGGGCTCCGTGTGTGCGCCCTGTCTTGTCTGGCCGTCCTCATCCCTTCC 8 4 

Qy 5 0 CAAAACCGCGACCCAGCCGCCACTAGCGTCGCCGCCGCCCGTAAAGGAGCT- 

GAG 103 



Db 85 C CCAC-CCCCTGCCGCCACTC-CGAC-CCGCT-CCC— 

AAAGTGGCTTCACAATAG 13 4 

Qy 104 CCG AGCGG GGGCGCCGCCCGGGGTCCG GTGGGCAA 

AAGG 142 

II I I I I III II I I I I I I I I III 

Db 135 

TCGGTCCTCGGCGGTGTAGGCTGCGCACCAGGTCCACACTTAAGCGAAATCAAGGAGACC 19 4 

Qy 143 CTACAGC 

AGGAGCTGATGACCCTCATG 169 

I I I I I I I I I I I I I I 

Db 195 

CCCTTCTACTTCTACCTTTGGGTTTGGTGCTCAATGCGAAGCTGCTGCAACTCAGACACG 2 5 4 



QY 



170 ATGTC TGGCGATAAAGG GATTT- 



-CTGCCTTC- 



CCT 202 



Db 255 

CCTAAGTCAACTCATGCAGAAAAAGGAGAAAAGTTTTGGTATACAAATGCTCTCAGTCCA 314 

Qy 2 03 GAATC AGAC AACC T T T TCAAATGGGT 

AGGGACC A 236 



Db 315 GC— CAGAC- 

ACCAAGCCGAAAGGTTGTGCTGGCTGCAACCGAAAGATCAAGGACCGGTA 3 71 

Qy 23 7 TC— CATGGAGCAGCTGGAACAGTA TATGAAGA CCTGAGGTAT 

AA 279 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 3 72 TCTTCTAAAGGCA- 

CTGGACAAATACTGGCATGAAGACTGCCTGAAGTGTGCCTGCTGTG 43 0 

Qy 280 GCTCTCGCTAGAG TTC-CCCAGT GGCTACCCTTA — 

CAATG-C 318 



Db 431 

ACTGTCGCTTGGGGGAGGTGGGCTCCACCCTGTACACTAAAGCTAATCTTATCCTTTGTC 49 0 

Qy 319 GCC CACAGTGAAGTTCC TCAC-GCCCTGCT 

ATCA 351 



Db 491 

GCAGAGACTATCTGAGGCTCTTTGGTGTAACGGGAAACTGCGCTGCCTGTAGTAAGCTCA 550 

Qy 352 CCCCAAC GTGGACA CCCAGGGTAACATAT GCCTGGAC- 

AT 390 

III I III I I I I I I II II I I I I I I I 

Db 551 

TCCCTGCCTTTGAGATGGTGATGCGTGCCAAGGACAATGTTTACCACCTGGACTGCTTTG 610 

Qy 391 CCTGAAGGAAAAGTGGTCTGC-CCTGTATGATGTCAGGACCA — 

TTCTGCT 438 

III I II III I I I I I I I I I I I I I I I 

Db 611 CATGTCAGCTTTGTAATCAGAGATTTTGTGTTG — 

G AG AC AAAT T T T T C C T AAAG AAT AA 66 8 

Qy 439 CTCCAT — CCAGA — GCCTTCTAGGA 

GAACCCAA 46 8 



Db 669 

CATGATCCTTTGCCAGACGGACTACGAGGAAGGTTTAATGAAAGAAGGTTATGCACCCCA 72 8 

Qy 469 CATTGATAGTCCCTTGAACA CACAT 

GCTGCCGAGCTCTGGA 509 

II I I I I I I I I I I I I I I I I I I I I 

Db 729 

GGTTCGCT GAT C T AT C AAC AT C AC C CC AT T AAGAAT AC AGAGC AC T AC AT TCTTTTATCT 7 8 8 

Qy 510 — AAAACCCCACA-GCTTTTAAGAAGTACCTGCAAGAAACCTACTCAA- 
AGC 55 7 



Db 789 TTTTTGCTC C AC AT G T AC AT AAG AAT T G AC — 

AC AGG AAC C T AC T G AAT AGC G TAG AT AT 8 46 

Qy 55 8 AGGTCACCAG 

CCAGGAGCCCTGACCCAGGCTG 589 

III III I I I I I I I I I I I 

Db 847 

AGGAAGGCAGGATGGTTATATGGAATAAAAGGCGGACTGCATCTGTATGTAGTGAAATTG 9 06 

Qy 590 -CCCAG CCTGTCCTTGTGTC — 

GTC 611 

I I I I I I I I I I I I I 

Db 907 

CCCCAGTTCAGAGTTGAATGTTTATTATTAAAGAAAAAAGTAATGTACATATGGCTGGAT 966 

Qy 612 TTTTTAATTTTTCCTTAGATGGTCTGTCCTTT 

TTGTGATTTCTGTATAGGACTCTT- 66 7 



Db 967 

TTTTTTGCTTGCTATTCGTTTTTGTGTCACTTGGCATGAGATGTTTATTTTGGACTATTG 1026 

Qy 66 8 — TATCTTGAGCTGTGGTAT 

TTTTGTTTTGTT 697 

I I I I I I I I I I I I I I I I I I 

Db 1027 

TATATAATGTATTGTAATATTTGAAGCACAAATGTAATACAGTTTTATTGTGTTACCATT 10 86 

Qy 69 8 TTTG — TCTTTTAAATTAAGCC- 

TCGGTTGAGCCCTTGTATAT 737 

I I I I I I I I I II II I II I I I I I I I 

Db 1087 

TGTGTTCCATTTGCTTCTTTGTATTGTTGCATTTAGTACAATCAGTGTTTAAACTTACTG 1146 

Qy 73 8 TAAATAAATGCATT TTTGTC — CTTTTTTAAA 

AAAAAAA 774 



Db 1147 

TATATTTATGCTTTCTGTATTTACCAGCTATTTTAAATGAGCTGTAACTTTCTAGTAAAG 12 06 

Qy 775 AAAAAAAAA 783 

II I I I I 
Db 1207 AATTGAAAA 1215 



RESULT 9 

US-10-478-914-3/C 

; Sequence 3, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 

TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
; FILE REFERENCE: 7388-80899 

; CURRENT APPLICATION NUMBER: US/ 1 0 / 478 , 9 1 4 

; CURRENT FILING DATE: 2003-11-26 

; PRIOR APPLICATION NUMBER: PCT/JP02/05294 

; PRIOR FILING DATE: 2001-05-30 

; PRIOR APPLICATION NUMBER: JP 2001-162775 



PRIOR FILING DATE: 2001-05-30 
PRIOR APPLICATION NUMBER: JP 2001-255226 
PRIOR FILING DATE: 2001-08-24 
NUMBER OF SEQ ID NOS : 417 
SOFTWARE: Patentln version 3.2 
SEQ ID NO 3 

LENGTH: 2083 

TYPE: DNA 

ORGANISM: Homo sapiens 



Query Match 41.7%; 
Best Local Similarity 36.0%; 
Matches 517; Conservative 
Gaps 80; 



Score 1983.5; DB 1; Length 2083; 
Pred. No. 1.4e-106; 
0; Mismatches 256; Indels 663; 



Qy 2 GCACGAGCGAGTTCCTGTCTCTCTGCCAAC GCCGCCCGGAT — 

GGCTTCCCAAAACC 56 



Db 1791 

GCACTGGCATGATCTCGGCTCACTGCAACCTCTGCCTCACGAGTTAGGTCAAGCAATTCT 1732 

Qy 57 GCGACC-CAGCCGCC ACTAG C — GTC GCCGCC — 

GCCCG 9 0 

I I I I I I I I I I I I I I I III I I I I I III 

Db 1731 

TCTGCCTCAGCCTCCCGAGGAGCTAGGATTACAGGTCCCTGCCACCATGCCTGAGTAATT 1672 

Qy 91 TAAAGGAGCTG AGCCGAGCGGGG-GCG- 

CCG 119 



Db 1671 

TTTGTATTTTTAGTAGAGATGGGGTTTCACCATGTTAGCCAAGCTGGTCTCGAATTCCTG 1612 

Qy 12 0 CCCGGGGTCCGGTGGGCAAAAGGCTACAGC AGGAGCTG 

ATGAC 162 



Db 1611 ACC 

TCAGGTGATCCGCCCACCTCAGCCTCCCAAAGTGCTGGGATTACGGGCGTGAG 1556 

Qy 163 CCTCATGATGTCTGG CGATAAAGGGATTT — CTGCCTT 

CCCTGAA 2 05 

III I I I I I I I III III I I I I I I 

Db 1555 CCAC— 

GGTGCCTGGCCTCTATTTTTTATTTTAAATACCTTTTGCCCTGAAATCTATGTC 149 8 

Qy 2 06 -TCAGA CAAC CT TTTCAAATGGGTAGGGA 

CCATCC 239 



Db 1497 

CTCTGATTATCAACACTGCTTACCAGTTTTATTTTTGTTGGTATTTATTTCATTCAATCC 143 8 

Qy 2 40 ATG— GAGCAGC TGGAACAGTAT ATGAAG- 

AC 268 

I III I I I I I I I I I I I I 



Db 1437 

CAGCTCTACAACTTACCAGTATAAAAATTTGGATCAGTTACTTAATCTTGCAGTTTCTTC 13 7 8 

Qy 269 CT GAGG TATAAGCTCTCGCTAGAGTTCCCCAGT 

GGCT 305 



Db 1377 

CTTTTAAAAATGGAATAATATTATCTATCATATAAAATGCCCAGTTAACAAATATGAACT 1318 

Qy 3 06 -ACCCTTACAATGCGCC CAC AGTG AAGTTCC 

TCA 338 



Db 1317 

AACACCTACAATGTGCCAGGCACCATTCTAGGGAATAGAATGTAAATAAGTTCAAGTTCT 12 5 8 

Qy 339 CGCCCTG 

CTATCAC 352 

I I I I I I I I I 

Db 1257 

T G AC C T GAATAAG TTTACATTTTAGT G AG AAG AC AG AC AAT AAAT C AAT AAC AT GTCCAT 1198 

Qy 353 CCCAACG 

TGG— 362 

I I I 

Db 1197 

TATGCCAGATAGTTTTAAATGTCACAAAGAAAAAGGGGGAGCAATCAGATAGTGATGGAA 1138 

Qy 363 — ACACCC AGGG 

T AAC AT 3 78 

I I I I I I I I I I I I I I 

Db 1137 

GTTACACTCTAGAGAGGGTGGACAGAAAAAGTTTCTCTTAGGAGATGACATCTGAACAGA 10 7 8 

Qy 3 79 ATGCCTGG ACATCCTGAAGGAAAAGTGGT 

CTGC- 411 



Db 1077 

GATCTAAATGACTGAGTACAAACCCCAATGGAAAAGTGATTGGTGTGTTGTGAAACAGCA 1018 

Qy 412 CCTGTAT GATGTCAGGACCATTCTG-CT CT 

CCATC- 445 



Db 1017 

ACACAGCCAGTTTGACTAGATGAGTAGATCAGAATGACTGAGGTGGGACTGTGGTCATCA 95 8 

Qy 4 46 CAGAGCCTT C T AGGAGAAC C - 

CAACATTGATAGTCCCTTGAACACAC 491 

I I I I I I I I I I I I I I I I I II I I I I I I I I 
Db 957 AAGAGATCAGAGAGGTAGCCTGGGAAGGCCACATCACAGAATGTCCTTTGGGC- 

CTCTGT 899 

Qy 492 ATGCTG— CCGAGC 

TCTGGAAAAACCC— 516 

I I I I I I I I I I I I 

Db 898 

AAGAGCGTTTTGAAAAAACAGTTCTGAGCAAAGGTTAGTCATTTAACTTGACTTACTCTT 839 



Qy 517 CAC AG C T T T T AAG AAG - T AC C T GC AAG AAAC C T - 

ACT 551 



Db 838 

TAGAAGGATCACTTTGGCTATTGAGTGGCAAACAACTGGAGAAGTGGTTAGGGGAGCACT 7 79 

Qy 552 CAAAGCAGG 

TCACCAGCCAGGAGCCCTGACCCAGGCTGC 590 

I I I I I I I I I I I I I I I I II II 

Db 778 

GTGGGCAGGAATAGAAGGAGGAAAAGCAGACTGGAGCCTAATCCAGTACTTCAGATGAGA 719 

Qy 591 CCAG CCTGTC CTTGTGTC 

GTCT 612 



Db 718 

AATGACAGATTTGCCTGGTACATCATTTCCTATCCCTTTACTTATAACTTATTGGTGTAT 659 

Qy 613 TTTTAATTT TTCCTTAGA TGGTCTGTCC- 

TT 642 

I I I I I I I I I I I I I I I I I I I I 

Db 658 

TTTTATTT T AAG T G T AAAT T T T T AT AAAT G AAGC AT AAT T T G AAG AT T GAT AT G T T C G G A 599 

Qy 6 43 TTTGTGATTTCTGT AT AG - G AC T C T 

TTATC 671 

I I I I I I I I I I I I I I I I I I I I I I I 

Db 598 

CTTGTGTTTTCTATTATCATGGTGTCTCTCCACATACCTTTTCCCTTTAGTTCTGCTTCT 539 

Qy 6 72 — TTGAGCT GTGGTATTTTTGTT 

TTGTTTTTGTCT- 704 



Db 53 8 TATTGTGCTCCTCAAGT- 

CTATTTTTGTTCTTTTCCCTCTACTTCTACTGTTTTGAACTG 48 0 

Qy 705 TTTAAATTAAGCCTCG- 

GTTGAGCCCTTGTATATTAAAT 742 

III III III I I I I I I I I I I I I I I 

Db 479 

TATATGCTATTTCTATTTTGCCAGGAGTTACCCCCTTCAATTTTCGTTGTACTCTCTCAG 42 0 

Qy 743 AAATGCAT TTTTGTCCTTTTTTAAAAAAAAAAAAAAAAAA 

782 

III III I I I I I I I I I I I II I III 

Db 419 TCTCAAACACATACCCATGCTCTCTTCTGTCCTTCTCTTCATATCATCTTCTAGAA 

364 



RESULT 10 

US-10-478-914-17/C 

; Sequence 17, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 

; TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 



FILE REFERENCE: 7388-80899 

CURRENT APPLICATION NUMBER: US/ 1 0 / 478 , 9 1 4 
CURRENT FILING DATE: 2003-11-26 
PRIOR APPLICATION NUMBER: PCT/JP02/05294 
PRIOR FILING DATE: 2001-05-30 
PRIOR APPLICATION NUMBER: JP 2001-162775 
PRIOR FILING DATE: 2001-05-30 
PRIOR APPLICATION NUMBER: JP 2001-255226 
PRIOR FILING DATE: 2001-08-24 
NUMBER OF SEQ ID NOS : 417 
SOFTWARE: Patentln version 3.2 
SEQ ID NO 17 
LENGTH: 2198 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-478-914-17 

Query Match 41.7%; Score 1980; DB 1; Length 2198; 

Best Local Similarity 35.8%; Pred. No. 3.3e-106; 
Matches 498; Conservative 0; Mismatches 281; Indels 614; 
Gaps 69; 

Qy 1 GGCACGAGCGAGTT CCTGTCTCTCTGC 

CAAC 31 

I I I I I I I I I I I I I I I I I I I I III 

Db 1594 

GACTCAAGTGAATTCTTGGATGACACCTTGTCCCTCTGCTTCTCATCTTCAGCTTTGTAC 1535 

Qy 32 GCCGCCCGG ATGGCTTCC 

CAAA 53 

II I I I I I I I I I I I I I I I I 

Db 1534 

TTCTCAGCTTCCTGGACCATACGTTCAATGTCTTCCTTGCTCAAACGGCCCTTGTCATTA 14 75 

Qy 54 ACCG CGACC- 

CAGCCG 6 8 

III I I I I I I I I 

Db 1474 

GTGATAGTAATCTTGTTCTCTTTTCCCGTACTCTTGTCCACAGCAGAGACATTGAGTATA 1415 

Qy 69 CCACTAGCGTC GCC GCCGCC-C— 

GTAAAGGAG — 9 8 

I I I I I I I I II I I I I II 

Db 1414 

CCATTGGCATCAATGTCAAAAGTGACTTCAATCTGAGGAACACCTCGGGGTGCAGGAGGT 1355 

Qy 99 CTG AGCCGAGCGGG GGCGC- 

CGCC- 121 

III I I I I I I I I I I I I 

I I I I 

Db 1354 

ATGCCTGTGAGTTCAAACTTGCCAAGCAGGTTGTTATCCTTTGTCATGGCACGCTCGCCT 12 95 

Qy 122 -CGGGGTCCGG — TGGGCAAA — AGGCT ACAGC — 

AGGAGCTGATGACC C 16 4 

I III I I I I I Mill III III I I I I 

I 

Db 1294 

TCATAAACCTGAATAAGCACACCAGGCTGGTTGTCAGAATAGGTAGTGAAGGTCTGTGTC 1235 



Qy 165 T - C AT GAT — G T C TGGCGAT AAAGGGAT T T C T 

GCCTTCCCTGAATCAGACAACCTTT 218 



Db 1234 

TGCTTGGTAGGAATGGTGGTATTACGCTTGATGAGGACAGTCATGACTCCACCAGCAGTT 1175 

Qy 219 TCAA ATGGGTAGGG ACCATCCATGGAGCAG 

CTGGAACAGTATATGA 26 4 



Db 1174 TCAATACCAAGGGAAAGAGGAGTGACATCCA- 

AGAGCAGCAAATCTTGAACATTCTCAGA 1116 

Qy 265 AGACCTGAGGTAT AAGCT-CTC GCTAGAG- 

TTCCCCAGT 301 



Db 1115 

CTTGTCTCCAGACAAGATGGCTGCCTGGACAGCTGCACCATAAGCAACAGCTTCATCAGG 1056 

Qy 302 G — GCTACCCTTA — CAATGC GCC 

CACAGTGAAGTTCCTCACGC — 3 41 

I I I I I I I I I I I I II I I I I I I I 

Db 1055 

GTTGATGCTCTTATTCAGTTCTTTTCCATTGAAGAAGTCTTGGAGAAGCTTCTGAATCTT 996 

Qy 3 42 CCTGCTATCACC- 

CCAACGTGGACAC 366 

I II I I I I I I I I I I I I I 

Db 995 

GGGGATACGAGTAGAACCACCAACCAGGACAATATCATGAATCTGTGACTTGTCTAGTTT 936 

Qy 367 

CCAGGGTAACATATGCCTGGACATC 391 

I I I I I I I II I I I I 

Db 935 

GGCATCTCGAAGGGCTTTCTCTACTGGGTCCAGGGTGCCACGGAACAGGTCAGCATTCAG 8 76 

Qy 392 

CTGAAGGAAAAGTGGT 40 7 

I I I I I I I I II 

Db 875 

TTCTTCAAATCGGGCACGGGTAATGGAGGTATAGAAGTCGATTCCTTCATAGAGAGAATC 816 

Qy 40 8 CTGCCCTGTATGATGTCA GGACCATTCTGC 

TCTCCATC 445 

I I I I I I I I I I I I II I I II II 

Db 815 

GATCTCAATACTGGCCTGGGTGCTGGAAGAGAGGGTACGCTTAGCACGTTCACAAGCAGT 756 

Qy 4 46 -CAGAGCCTTCTAGGAGAACCC AACATTGATAGTCC 

CTTGA 485 



Db 755 ACGGAGGCGTCTTACAGCTCTCTTGTTCTCACTGAT- 

GTCCTTCTTATGCTTGCGCTTAA 697 

Qy 486 ACACA CATGC 

TGCCGAGCTCTGGAAAAACC 515 



I I 1 1 1 1 1 I 1 1 I I 1 1 1 III 

Db 696 

ACTCAGCAATAAAATGGTTGACCATTCGGTTGTCAAAATCTTCTCCACCCAAGTGGGTGT 63 7 

Qy 516 CCACAGCTTTTAAGAAGTACCTGCAAGAAACC-TACTCA 

AAGCAGGTCACCAGC 56 8 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 

Db 636 CTCCAGCTGTAGACTTG- 

AC C T C AAAG AT TCCATCCT C AAT AGTGAGGAT TGAC AC ATC A 578 

Qy 569 CAGGAGCCCTGACCCAGG CTGC-CCAGCC 

TGTCC 601 

I I I I I I I I I I I I I I I I I I I I 

I I I I 

Db 577 

AAAGTGCCACCTCCCAGGTCAAAGATGAGCACGTTTCTTTCTGCTCCAACCTTTTTGTCT 518 

Qy 602 TTG 

TGTCGTCTTTTTAATTTTTC 624 

I I I I I I I I I I I I I I 

Db 517 

AAGCCGTAAGCAATAGCAGCAGCAGTTGGCTCATTAATAATTCTAAGTACATTGAGACCA 45 8 

Qy 625 CTTAGATGGTCTGTC-CTTTTTGTGATTTCTGTATAG — 

GACTC 665 

I I I I I I I I I I I I I I I I I I I I I I II 

I 

Db 457 GCAATAGTTCCAGCATCTTTGGTAGCCTGACGCTGAGAGTCATTAAAGTA- 

AGCTGGCAC 399 

Qy 666 TTTATCTTGAGCTGTGGTA 

TTTTTGTTTTGTTTT 699 

III I I I I I I I I I I I I I I 

I I 

Db 398 

TGTGACCACAGCATTGGTAACAGTCTTCCCAAGGTAGGCTTCTGCAATTTCCTTCATCTT 339 

Qy 700 TGTCT TTTAAATTAAGCCTC 

GGTTGAGCCCTTGTATAT 73 7 

I I I I I I I I I I I I III I I I I I I I I I 

Db 338 

TGTCAGAACCATAGAAGACACCTCCTCTGGATAGAAGCTTTTGGTCTCTCCCTTGTATTC 2 79 

Qy 73 8 TAAATAAA TGCATTTTTGTCC 

TTTTTTAAAAAA 770 

I I I I I I I I I I I I I I I I I I 

Db 278 

TACTTGGACCTTGGGCCTGCCAGCATCATTCACCACCATAAAGGGCCAATGTTTCATATC 219 

Qy 771 AAAAAAAAAAAAA 783 

II I I I I 

Db 218 AGACTGGACAACA 2 06 



RESULT 11 
US-10-478-914-62 

; Sequence 62, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 



; TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 

FILE REFERENCE: 7388-80899 
; CURRENT APPLICATION NUMBER: US/ 1 0 / 478 , 9 1 4 

CURRENT FILING DATE: 2003-11-26 

PRIOR APPLICATION NUMBER: PCT/JP02/05294 
; PRIOR FILING DATE: 2001-05-30 

PRIOR APPLICATION NUMBER: JP 2001-162775 
; PRIOR FILING DATE: 2001-05-30 
; PRIOR APPLICATION NUMBER: JP 2001-255226 
; PRIOR FILING DATE: 2001-08-24 
; NUMBER OF SEQ ID NOS : 417 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 62 

LENGTH: 19 46 
TYPE: DNA 
; ORGANISM: Homo sapiens 
US-10-478-914-62 

Query Match 41.7%; Score 1979; DB 1; Length 1946; 

Best Local Similarity 38.5%; Pred. No. 1.9e-106; 
Matches 490; Conservative 0; Mismatches 270; Indels 514; 
Gaps 76; 



Qy 3 CAC GAGCGAGTT — 

CCTGTCTCTCTGCCAACGCCGCCCGGATGGCTTCCCAAA 53 

III III III I I I I I I I I III I I I I I I 

I 

Db 29 4 CACATCTCTTGGGCCTGTTTACCTG-CGCTC — CCAGAGCCTCC — 

GCCAGCATCCCAGA 3 48 



Qy 54 ACCGCGA — CCCAGC CG 

CCACTAGCGTCGC 81 

I I I I I I I I I II I I I I I I I 

I 

Db 349 

ATCTCCATCCCCATCTCTCACTTATACACACATCAGTCATCGGTTATCCATTAGCTAAAC 4 0 8 

Qy 82 CGCCGCCCGTAAAGG AGCT 

GAGC 104 

I I I I I I I I I I III III 

Db 409 

CGCCTTCCTTAATAGCTTTACACTGTTTGCTTTCTCTGGAACATTTTTAGTTAAAATTTC 46 8 

Qy 105 — CGAGCGGGGGCGCC GCCCGG— GG 

TCCG G 131 

I I I I I I I I I I I I I I I 

I 

Db 469 

ATAATGCAGTTGCACACAAATGAAGACACAGATGGCTGCATCCTCCGTCTCTTCCCCTCG 52 8 

Qy 132 TGGGCAAAAGGCTACAG — CAGGAGCTGATG ACCCTCATGAT 

GTCTG-G 177 

I II I I I I I I I I I I III I I I I I I I I I III 

I 

Db 529 

TTTACAGGAAGCTGCGGATCAGGGAGGGGTGTTAGGGTTACCCACATGGTAAGGGCAGAG 5 8 8 



Qy 178 CGATAAAGGGATTTCTGCCTTCC- 
CTGA 204 

I I I I I I II I I I I 



Db 589 

ACAAGAGGGGACCCCAGTTTTCCATGCTGCACATGGTCATTGCTGGGGACTGAGGTTTGC 6 4 8 

Qy 2 05 — AT C AG AC AAC C T T TTCAAATGGGTAGGGACCATCC ATGGAGC- 

AGCTG 251 

I I I I I III II I I I I III II I I I I I I I 

I I I I I 

Db 6 49 ACATCACCCTGCCCTGTTCTCCCTCCGCTGGG — 

GAGAAAGTCAGGGATGGAGCAAGCTG 706 

Qy 2 52 GAACAGTATATGAA GACCTGAGGTATAAG- 

CTCTCGCTAGAGTTCC 296 

III I I I I I I I I I I I I I I I I I I I I I I I 

Db 70 7 CAGCATCTTCTGAAAAAGAAAAGGTGGCCTTGTCTCCAGGTCTCCC- 

CTCAAGTCCCACC 765 

Qy 29 7 — CCAGTGG CTACC-CTTACAATG CGC— CCAC 

AGTGAAGTTCC 335 

I I I I I I I I I I I II I I I I I I III 

I I I I 

Db 766 

TTCCCATAGTTTTCTGCCACTTCTCTTGATTTTCCTCTCTGCCACTTCTCTTGATTTTCC 825 

Qy 336 TCAC-GCC-CTGCTAT CACCC— CAAC GTGG 

AC AC CC AG 3 70 

I I I I I I I I I I I I I I I I I I I I I III 

Db 826 

TCTCTGCCACTTCTCTTGATTTTCCTCTCTGCAGCTGCTTTGAGGTGGGTTTTCTCCAGA 8 85 

Qy 3 71 GGTA-ACATATGCCTG GAC ATCCTGAAGGAAAAGTGGTCT 

GCCCTGTA 417 

I I I I I I I I I II I I I I I III III III 

I I 

Db 886 

TGCACACTTTCCCCTGCTTTGCGTCCTTATTCTGGTAGAAGCACAATCTAAAGCTCATTA 9 45 

Qy 418 T GAT G T C AGG AC C AT T C T GCTCTCCATCCAG — AGC- 

CT 453 

I I I I I I I I I I I I I I I I I I I I 

I I 

Db 946 

AGGGAACTAATCAATTCTGTGCATGGCGCTAGCTCAGCAGATCACCACACAGGCAGCACT 10 05 

Qy 45 4 TCTA GGAG AACCCA ACATTGATAGT CCC 

TTGA 485 

II III I I I I I I I I I I I III I 

I I 

Db 1006 

ATTAGCAAGTCGGTGCTTAACACATGGCACTTCCATGAATCGATATGGAGCCCGTGTAGA 1065 



Qy 486 ACAC-ACATG CTGCC GAGCT 

CTGGAAAAACCCCAC 519 

III I I I I I I I I III I I I I I I I 

I 

Db 1066 

ACAAGGCATGGGTTTTTTTCTCTTCCCATTAAGAAAAACTGATGCCAAAAATAACTTCTC 112 5 

Qy 52 0 AG CTTTTAAGAAG 

TACCTGCAAGAAACCTA 5 49 

II I I I I I I I I I I I I I I I I I I I 



I 

Db 1126 AG AT AT T T T C AAGT ATGAC T T T T ATGAAGGGAAAAAGC AT T T T T G T T T GC AA- 

AATCATG 1184 

Qy 55 0 C-TCAAAGCAGGTCACCAG- 

CC 577 



Db 1185 CTTCAGTGCAGG 

CCAGTTGTGAATTGTGATGGCTTTTATTTCTCCTGGGGCTGTAAC 12 41 

Qy 578 T 

GACCCAGGCTGCCCAGCCTGTC 600 

I I I I I I I I I I I I I 

Db 12 42 TTTAAGGGTTTAGAATTTGGAACCA — 

C AGC C T AGC T AAT CAT G AC AC AC AC AC AC AC AC 1299 

Qy 601 CTTG TGTCGTC- 

TTTT 615 

III III 

Db 1300 

ACACACATGCACACACATACACATACACAAAGCATCACGAAGAACCATACAAATTGTACA 1359 

Qy 616 TAATTTTTC-CTTAGATGGTCTGTCCTTTTTGTGATTTCTGTATAGGACT 

CTTTATC 671 



Db 136 0 T T AT T T T AC AC ATGGAGGC TC AC TC T AAAAT - 

AG AT AC C AT T T T AAAT AT T AAC T AAAAC 1418 

Qy 6 72 TTGAGCT-GTGGTATTTT 

TGTTTTGTTTTTGTCTTTTAAATTAAGC 716 
I I I I I I I I I I I I I 
Db 1419 TTGTGCTCATTGTATGTTCATTCTATATGTACTGATTTGTTATTCACATTT- 

CTTTCAAA 1477 

Qy 717 CTCGGTTG 

AGCCCTTGTATATTAAATAAATGCATTTTTGTCCTTTTTTAAAAA 769 



Db 1478 ATCGTTCAAATTTCTAGCCCACATCAATTAAATA- 

TTAATAGTACTTTCTACAAAC 1532 



-CCAGGAGC 



770 AAAAAAAAAAAAAA 783 
I I I I I I I 

1533 ATGAGCGCACAAAA 15 46 



RESULT 12 

US-10-478-914-60/C 

; Sequence 60, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 

; TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
; FILE REFERENCE: 7388-80899 

; CURRENT APPLICATION NUMBER: US/ 1 0 / 478 , 9 1 4 
; CURRENT FILING DATE: 2003-11-26 
; PRIOR APPLICATION NUMBER: PCT/JP02/05294 
; PRIOR FILING DATE: 2001-05-30 



PRIOR APPLICATION NUMBER: JP 2001-162775 
PRIOR FILING DATE: 2001-05-30 
PRIOR APPLICATION NUMBER: JP 2001-255226 
PRIOR FILING DATE: 2001-08-24 
NUMBER OF SEQ ID NOS : 417 
SOFTWARE: Patentln version 3.2 
SEQ ID NO 60 
LENGTH: 2022 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-478-914-60 

Query Match 40.9%; Score 1943; DB 1; Length 2022; 

Best Local Similarity 39.8%; Pred. No. 4.3e-104; 
Matches 487; Conservative 0; Mismatches 271; Indels 466; 
Gaps 68; 

Qy 2 GCACGAGCGAGTTCCTGTCTCTCTGCCAACGCCGCC- 

CGGATGGCTTCCCAAAACCGCGA 6 0 

III I I I I I I I I I I I I I I I I I I I I I I I I I I I III 

Db 19 48 GCAATGGCGCGATCTCGGCTCACTGCAACTTCCGCCTC — CTGGGTTC 

AAGCTATTC 189 4 

Qy 61 CCCAGCC GCCACTAGCGTCGCCG CCGCC-CGTAAAGGA — GC- 

TGA 102 

I I I I I I I I I I I I I I I I I I I I I I I I I II 

I I I 

Db 1893 TCCTGCCTCAGCCTCCTGAGTAGCTGAGGTGAGAGGACTGACGCG- 

AGAGGATTGCTTGA 1835 

Qy 103 GCCGAG CGGGGGCGCCGC CCGGGGT 

CCGGTGGGCAAAAGGCT 144 

I I I I I III III I I I I II I I I I I I I I I 

Db 1834 

GCCTAGGAGGTTGAGGTTGCAGTGAGCTGTGTTCACGCCACTGGACTCCAGGCTGGGTGA 17 75 

Qy 145 -ACAGCAGGAGCTGATGACCC TCATGATGTCTGGCGAT- 

AAAGGGATTTC 192 

I I I I I I I I I I I I II I I I I I I II 

I 

Db 1774 CAAAGCAAGACC — 

CTGTCT C AAAAAC AAAAAC AAAC AAAC AAAC AAAC AAAAAAAT AAC 1717 

Qy 193 TGCC TTCCCTGAAT C 

AG AC AAC 214 

I I I I I I I I I I I I I I 

Db 1716 

CAACAAAAAAAACCCTGAGTTTAAGTTTGATTTTAAGACCTGTAGGGACAGCAGATTAGT 165 7 

Qy 215 -CTTTTCAAATGGGTAGGGACCATC 

CATGGAGCA 2 47 

II I I I I I I I I I I I I I I I I I I I II 

Db 1656 

TCTACTTTAATGGGTATGCATCATCTGGAAAGCAAGGGTCATAAAAAGAAGATACATCTT 159 7 

Qy 2 48 -GCTGGAACAGT ATATG AAGACCTGAGGTATA 

AGCTCTCGCT 288 

I I I I I I I III I I I I I I I I I I I I I I 



CACTGGCACCATCACATCAGTCATCAGTCCTTAGATACAGAACCTTTACTGACTCCCTCA 153 7 

Qy 2 89 AGAGTTCC CCAGTGGCTACCCTTACAATGCGCCCACAGT 

GAAGTT 333 

I I I I I I III I I I I I I I I I I I I I I III 

I I 

Db 1536 AAACTTCCATGCTTCCA — 

AACTAACCAAGCAGTTCTTCTACATTGGGATCATCGAATTT 1479 

Qy 33 4 CC TCACGCCCTGCTATCACCCCA 

ACGTGGACACCCAGGGTAACATA 3 79 

II II I I I I I I I I I I I I I I I I I I I 

I I I 

Db 1478 CCCTTCATGTTCCAACTCTCCTCTAGCTTCAATATACACATAGACAACCTTAG — 

CCATC 1421 

Qy 380 TG CCT GGACA-TCCTGAAGG 

AAAAGTGGTCTGCCCTGT 416 

II III I I III II I I I I I I I I I I I 

Db 1420 

TGAAAACCTACTTTTAGAATATTCCCGAGTGGTTCCACGTAAGTCCTCTGCCAGATTCAC 1361 

Qy 417 ATGATGTC 

AGGACCATTCTGCTC 439 

III II I I I I I I I I I I I 

Db 1360 

ATACCCATAAATTCTACTTCATAAGCCCATACTGTTCTTAAAAAAGAAATAGAAAAAATA 13 01 

Qy 440 — TCCAT CCAGAGCCTTCTAGGAGA 

ACCCAA 46 8 

I I I I I I I I I I I I I I I I I I 

Db 1300 

TATCAATATAACTTTAAAATAAGACACAAACCTTCATGAAAATATAGTCATCAAATTTAT 1241 

Qy 469 CATTGATAGTCCCTT GAACAC 

ACATGC 495 

I I I I I I I I I I I I I I I I I I I I 

Db 1240 

TTTCATTGGGATGCCATTTTTTGAAGAATTCCTAAGACTAATGTTTCTTGACATGCAAGA 1181 

Qy 496 TGC CGAGCTCTGG AAAAAC C C C AC AGC T T T T AAG AAG T AC — 

CTG 538 

II I I I I I I I I I I I I I I I I I I I I I I I III 

Db 1180 

GTTAGCATTAATAGCTTACGTTACTATAAATACTGCTGCTTGGAAGCAGTACAACTGTTT 1121 

Qy 539 CAAGAAAC 

CTACTCAAAGC 557 

I I I I I I I I I I I I I I 

Db 1120 

TAGAGTTTTAAGACTACAGACTTTCATTACTCAAATCTTATTCAGTAAATGTAAAAATCA 1061 

Qy 55 8 — AGG-TC— ACCAGC CAGG-AGCCCTGACCCAGG 

CTGCCCAGC 595 

I I I I I I I I I I I I I I I I I II I I I I I I I I I 

I 

Db 1060 

GAAGGTTCTGAACAGCTGGTTAGGAAGGTAGCCAAGATGCAGGAAAGATGTCTGCGCCTC 10 01 
Qy 596 CTGTCCTTG TGTCGTCTTTTTAATTTTTCCTTAGATG GTC 



TGTCCT 641 



Db 1000 CTTTTCAAGGGCAGCCAACTCTTGAA 

CAGTAGGTGCCAAAATATCCACATGGCCT 946 

Qy 6 42 TTTTGTGATTTC TGTATAGGACT — 

CTT 667 

I I I I I I I I I I I I I I I 

Db 9 45 TTAT- 

AGCTTCCACCTGCGTCACACACAAATAACAAAGTAATGTAAATGACATGCTTGAA 8 8 7 
Qy 668 - 

TATCTTGAGCTGTGGTATTTTTGTTTTGTTTTTGTCTTTTAAATTAAGCCT 718 

I I I I I I II I I I I I I I III I I I I I I I III 

Db 8 86 ATAACTTGT- 

CAGCAATACTTTGGCTCAATTATATCACTTTAAACTTTGCTTGCAGCAAT 82 8 

Qy 719 CGGTTGAGC CCT 

TGTATATTAAATAAATGCATTTTTGTC 757 



Db 82 7 AAAATGTACTACTGGGCAAATTATCTCCAGAACTTTGACTTTAAAAAATG 

TTAAATC 771 

Qy 758 CTTTTTTAAAAAAAAAAAAAAAAA 781 

I I I I I I I I I I I I I I I 
Db 770 AAT T AT T T C AAT AT AAAT AAAT AA 747 



RESULT 13 
US-10-478-914-20 

; Sequence 20, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 

; TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
; FILE REFERENCE: 7388-80899 

; CURRENT APPLICATION NUMBER: US/ 1 0 / 478 , 9 1 4 
; CURRENT FILING DATE: 2003-11-26 
; PRIOR APPLICATION NUMBER: PCT/ JP02 / 0 529 4 
; PRIOR FILING DATE: 2001-05-30 

PRIOR APPLICATION NUMBER: JP 2001-162775 
; PRIOR FILING DATE: 2001-05-30 
; PRIOR APPLICATION NUMBER: JP 2001-255226 
; PRIOR FILING DATE: 2001-08-24 
; NUMBER OF SEQ ID NOS : 417 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 2 0 

LENGTH: 2 495 
TYPE : DNA 

ORGANISM: Homo sapiens 
US-10-478-914-20 

Query Match 40.6%; Score 1930.5; DB 1; Length 2495; 

Best Local Similarity 40.0%; Pred. No. 9.3e-103; 
Matches 512; Conservative 0; Mismatches 261; Indels 507; 
Gaps 75; 

Qy 1 GGCAC GAGC 



GAGTTCCTGTCTCT 23 

I I I I I I I I I I I I I I I I I I 

Db 357 

GGAACTATACTACAGAGCAATACAGTTCTACTTAGAATTCAAGCCTCTGTTGTTAAATGA 416 

Qy 2 4 CTGCCAACGC CGCC-CGGATGGCTTCC CAA 

AACCGCG- 59 

III II I I I I I I I I I I I III 

Db 417 

TTTGCTGATGGTGCTGTCTCCACGGTTGGATCACACTCGTGCAGTCAATTATTTCAGCAA 4 76 

Qy 6 0 ACCCAGCCGCCACTAGCGTCGCCGCC— GC— 

CCGTAAAGGAGC 99 

I I I I I I I I I I I I III II III I I I I 

Db 477 

GGTTAAACAGCTACCACTGGTGAAACCGTATTTGCGTTCAGTTCAGAACCATAACAACAA 536 

Qy 100 TGAGCC GAGC 

GGGGGCGCCGCCCGGGGTC 128 

I I I I I I I III II 

Db 537 

AT C T G T GAAT GAAT C AT T G AAC AAT CTTTTTAT T AC AGAAG AAG AT TATCAGGCTCTGCG 596 

Qy 129 CGGTGG — GCAAAAGGCTAC AGC AGGAGC T - 

GATGACCC 16 4 

I II II I I I I I II I I I I I II 

Db 597 

AACATCAATAGATGCTTATGACAACTTTGACAATATCTCGCTTGCTCAGCGTTTGGAAAA 656 

Qy 165 TCATGATGTC-TGGCGATAAAGGGATTT-CTGC CTTC 

CCT 202 

I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 657 

ACATGAACTCATTGAGTTCAGGAGAATTGCTGCTTATCTCTTCAAAGGCAACAATCGCTG 716 

Qy 2 03 GAATC 

AGACAACCTTTTCAAATGGGTAGGGACCATCCA 240 



Db 717 G AAAC AG AG T G T AG AGC TG TGC AAGAAAG AC AGC C T T T AC AA GGATG 

CAATGCA 770 

Qy 2 41 TGGAGCAGCTGGA ACAGTAT ATGAAGACCT 

GAGGTATAAGCT 2 82 



Db 771 

GTATGCTTCTGAATCTAAAGATACTGAATTGGCTGAAGAACTCCTGCAGTGGTTTTTGCA 83 0 

Qy 2 83 CT CGCTAG AGTTCC 

CC 298 



Db 831 

GGAAGAAAAAAGAGAGTGCTTTGGAGCTTGTCTGTTTACCTGTTACGATCTTTTAAGGCC 89 0 

Qy 299 AG-TGGCTACCCTTACAATGC GCCCA 

CAGTGAAGTTCCTCACGCCCTGCT 3 47 



891 AGATGTCGTCCTAGAAACTGCATGGAGGCACAATATCATGGATTTTGC-CATGCCCT- 



Qy 3 48 ATCACCCC AACGTGGAC ACCCAG — GGTAACAT 

ATGCCTGGACATC- 391 

I I I I I I I I I I I II II II III I I I I I 

I I I 

Db 949 

TTCATCCAGGTCATGAAGGAGTACTTGACAAAGGTGGATAAATTAGATGCTTCAGAATCA 10 0 8 

Qy 392 CT GAAGGAAAAG 

TGGTCTGCCCTGT 416 

II llll I III I I I I I I I I I 

Db 1009 

CTGAGAAAAGAAGAAGAACAAGCTACAGAGACACAACCCATTGTTTATGGTCAGCCCCAG 106 8 

Qy 417 ATGATGT CAGGACC — ATTCTGCTCTCCATCCAGAG 

CCTTCTAGG 459 

I I I I I I I I I I I I I III I I I I I I II I I I I I I 

Db 1069 

TTGATGCTGACAGCAGGACCCAGTGTTGCCGTCCCTCCCCAGGCACCTTTTGGTTATGGT 112 8 

Qy 46 0 AGAACC-CAACATTGATAGTCCC TTGAACACA-CAT 

GCTGCC 499 

I I I I I II II II I III I I I I I I III 

Db 1129 

TATACCGCACCACCGTATGGACAGCCACAGCCTGGCTTTGGGTACAGCATGTGAGATGAA 118 8 

Qy 50 0 GAGCT CTGGAAAA ACCC 

CACAGC 522 

llll I I I I I I llll I 



Db 1189 

GCGCTGATCCTGTAGTCACCTATTTTCGTACTGAAACATCGTCTTTACCCACTTCTCAGT 12 4 8 

Qy 52 3 TTTTAA GAAGTACCTGCAA GAAACC — 

TACTCAAAGCAGGTCACCAG 56 7 

I I I I I II II llll I I I I I III I I I I I I 

Db 1249 

TTATAATGGGGGAAAACAGGCAACGTGTTCTTGTAACCTTTATTTCATGAAGGACTTCTT 13 0 8 

Qy 56 8 CCAG GAGCCCTGACCCAGGCTGCCCAGCCTGT CCTTGTGT 

CG 609 

I I I I I I I I I I I III I I I I I I 

I 

Db 1309 TTTGTTTCTAACTATAAACTTGGATCACCT — 

ATGTTAAAACCTTATTTCACATTCCACA 1366 

Qy 610 TCTTTTTA-ATTTTTCCTTAGATGG TC 

TGTCCTTTT 644 

I I I I I I I I I I I I I I I I I II II 

I I I I I 

Db 1367 

TCATTTTAGAATTTATTTTCGAAGGGGAATAGTTTCAATGTTTTATTCACTTGGGCTTTT 142 6 

Qy 6 45 TGT GATTTCTGTATAGGACT-CTTTATCTT- 

GAGCTGTGGT A 684 

II I I I I I I I I I II I I I llll I I I I I I I 



TTTCTTCCCCCTCTTTCTTTAAAGAACTGCTCAATATTCAATCTGTTGTGAAGAACCTGA 14 86 



Qy 685 TT TTTGTTTTGTTT TTG — 

TCTTTTAAATTAA 714 



Db 1487 

TTTGCACTCTGTAGTGTTTAAAGAAACAAAGAAACTCTAATATTGAATCTCTTAAATTTA 15 46 

Qy 715 GCCTCGGTTGAGCCCTTGTA TATTAAATAAATGCATTTTTGTC — 

CTTTTTTAA 766 



Db 1547 

GTGTATGTAAACAGCTTACAAATACGTATTGTCTAAATGCATTTAAATCTGTTTTATTCA 16 06 

Qy 767 AAAAAAA AAAAAAAAAA 783 

I I I I I I III I I I I I 
Db 16 0 7 AAGAAAAGC T AAAGC AAAAA 1626 



RESULT 14 

US-10-478-914-20/C 

; Sequence 20, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 

; TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
; FILE REFERENCE: 7388-80899 

; CURRENT APPLICATION NUMBER: US/10/478,914 

; CURRENT FILING DATE: 2003-11-26 

; PRIOR APPLICATION NUMBER: PCT/JP02/05294 

; PRIOR FILING DATE: 2001-05-30 

; PRIOR APPLICATION NUMBER: JP 2001-162775 

; PRIOR FILING DATE: 2001-05-30 

; PRIOR APPLICATION NUMBER: JP 2001-255226 

; PRIOR FILING DATE: 2001-08-24 

; NUMBER OF SEQ ID NOS : 417 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 2 0 

LENGTH: 2 495 

TYPE: DNA 
; ORGANISM: Homo sapiens 
US-10-478-914-20 

Query Match 40.5%; Score 1925.5; DB 1; Length 2495; 

Best Local Similarity 40.4%; Pred. No. 1.9e-102; 
Matches 477; Conservative 0; Mismatches 294; Indels 411; 
Gaps 64; 

Qy 1 GG CACGAGCGAGTT-CCTGTCTCTCTGCCA 

ACG 32 



Db 12 86 GGTTACAAGAACACGTTGCCTGT- 

TTTCCCCCATTATAAACTGAGAAGTGGGTAAAGACG 122 8 

Qy 33 CCGCCCG GATGGCTTCCCAAAACCGCGACCCAGCCGCCACTAGC- 

GTC 79 

II II I I I I I I I I I I I III III II 



Db 122 7 AT G T T T C AG T AC G AAAAT AGGTGAC - T AC AGGATC AGCGC TTCATC — 

TCACATGCTGTA 1171 

Qy 8 0 GCCGCCGCCCGTAAAGGAGCTGAGCCGAGCGGGGGCGCCG CC— 

CGGGG 126 

II I I I I I I I I I I I I I I I I I I I II 

I I 

Db 1170 CCCAAAGCCAG-GCTGTGGCTGT- 

CCATACGGTGGTGCGGTATAACCATAACCAAAAGGT 1113 

Qy 12 7 T C C GG T GGGC AAAAGGC T AC AGC AGGAGC TGAT G AC — CCTCA — 

TGATGTCTGGCGATA 182 

I I I I I I I I I I I I II I I I I I I I I I I II I I I I I 

I I I 

Db 1112 GCCTGGGGAGGGACGGCAACACTGGGTCCTGCTGTCAGCATCAACTGG- 

GGCTGACCATA 1054 

Qy 183 AA GGGAT TTCTGCCTTCC 

CTGAATCAGA 210 

II I I I I I I I I I I I I I I I I I I I 

Db 1053 

AACAATGGGTTGTGTCTCTGTAGCTTGTTCTTCTTCTTTTCTCAGTGATTCTGAAGCATC 99 4 

Qy 211 CAACCTTTTCAAATGGGT-AGGGAC-CATCCATGGAGCAGCTGGAACAGTATATG— 

AAG 266 

II I I I I I I I I I I I I I I I I I I I I I I I I III 

I I 

Db 993 

TAATTTATCCACCTTTGTCAAGTACTCCTTCATGACCTGGATGAAATAGGGCATGGCAAA 93 4 

Qy 26 7 ACCTGAGGTATAAGCTCTCGCTAGAGTTCCCCAGTGG CTACCCTTA 

CAAT 316 

II I I I I I I I I I I I I I II I I I I I I I I 

Db 933 

ATCCATGATATTGTGCCTCCATGCAGTTTCTAGGACGACATCTGGCCTTAAAAGATCGTA 8 7 4 

Qy 317 GC — GCCCACAGTGAAGTTCCT CACGCCCTGCTATCACCCCAACGTGGACACC- 

CAG 370 

I I I I I I I I I I I I I I I I I I III II III 

I I I 

Db 873 

ACAGGTAAACAGACAAGCTCCAAAGCACTCTCTTTTTTCTTCCTGCAAAAACCACTGCAG 814 

Qy 3 71 GGTAACATATGCCTGGAC ATCCT— GAAGGAAAAGTGGTCTGCCCTGTAT- 

GATGTC 42 4 

I II III I I I I I II I I I I I I I I I I I I 

I I I 

Db 813 

GAGTTCTTCAGCCAATTCAGTATCTTTAGATTCAGAAGCATACTGCATTGCATCCTTGTA 75 4 

Qy 42 5 AGGAC CATTCT GCTCTCCA TCCAG 

AGCCTTCTAGGAG 461 

III I I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 753 

AAGGCTGTCTTTCTTGCACAGCTCTACACTCTGTTTCCAGCGATTGTTGCCTTTGAAGAG 69 4 

Qy 462 A-ACCCAACATTGATAGTCCCTTGAACACA CATG — 

CTGCCGAGCTCTGGAAA 511 



II 1 1 1 1 I 1 1 I 1 1 1 1 1 1 1 1 1 1 1 I 1 1 I I 1 1 1 

Db 693 ATAAGCAGCAAT 

TCTCCTGAACTCAATGAGTTCATGTTTTTCCAAACGCTGAGCA 639 

Qy 512 AAC CCCA CAGCTTTTAA 

GAAGTACCTG 53 8 

II II I I I I I I I I I I I I I I 

Db 638 

AGCGAGATATTGTCAAAGTTGTCATAAGCATCTATTGATGTTCGCAGAGCCTGATAATCT 5 79 

Qy 539 CAA- 

GAAACCTACTC 552 

I I I I I I I I I 

Db 578 

TCTTCTGTAATAAAAAGATTGTTCAATGATTCATTCACAGATTTGTTGTTATGGTTCTGA 519 

Qy 553 AAAGCA GG — TCACCAG 

CCAGG A 5 73 

I I I I I II I I I I I I I II I 

Db 518 

ACTGAACGCAAATACGGTTTCACCAGTGGTAGCTGTTTAACCTTGCTGAAATAATTGACT 459 

Qy 574 GCCC TGACCC AGGCTGCC CAGCCTGTCCTT 

GTGTC 608 

III I I I I I I I I I I I I I I I I I I I 

I 

Db 458 

GCACGAGTGTGATC C AAC C G T G G AG AC AGC AC CAT C AGC AAAT C AT T T AAC AAC AG AG G C 399 

Qy 6 09 GT-CTTTTTAATTTTTCCTTAGATGGTCTGT 

CCTT 642 

I I I I I I I II I I I I I I I I I I I 

Db 398 

TTGAATTCTAAGTAGAACTGTATTGCTCTGTAGTATAGTTCCACATTGGCAACCTTGGTA 339 

Qy 6 43 TTTGTGAT TTCTGTATAGGACTC TTTATCTTG — 

AGCT 6 78 

I I I I I I I I I I III II I I I I I I I I 

I 

Db 338 

ATGATATCTTTGAATTGCCCTTCTTTCCAGGCATCAGTTGGATGATTCATCATGGTAATT 2 79 

Qy 679 GTGGTATTTTTGTTTTGTT TTTGTCTTTTAAA TTAAG- 

CCTCGGTTGA 725 

I I I I I I I I I I I I I I I I I I III II III I 

I I I 

Db 278 

ATGGCATTATCATATTCTTCATACTTGTCATACAAAAACACCAGTTCTGCCCAAAGATGA 219 
Qy 726 GC 

CCTTGTATATTAAATAAATGCATTTTTGTCC 758 

II I I I I I I I I I I I I I I I I 

Db 218 

GCTTGTTCTGCAGCTCTTAGCACCTTGGGAATATTCACTCTAGACCAGAACAGCTCCAGG 159 

Qy 759 TTTT TTAAAAAAAAAAAAAAAAAAA 783 

I I I I I I I I I II I I I I I 

Db 158 TGCTCCCTCATTTTCTGAGGCTTAAATTTAGAGTATAGAATA 117 



RESULT 15 
US-10-478-914-57 

; Sequence 57, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 

; TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
; FILE REFERENCE: 7388-80899 

; CURRENT APPLICATION NUMBER: US/ 1 0 / 478 , 9 1 4 
; CURRENT FILING DATE: 2003-11-26 

PRIOR APPLICATION NUMBER: PCT/ JP02 / 0529 4 

PRIOR FILING DATE: 2001-05-30 
; PRIOR APPLICATION NUMBER: JP 2001-162775 
; PRIOR FILING DATE: 2001-05-30 
; PRIOR APPLICATION NUMBER: JP 2001-255226 
; PRIOR FILING DATE: 2001-08-24 
; NUMBER OF SEQ ID NOS : 417 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 57 

LENGTH: 2336 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-478-914-57 

Query Match 40.4%; Score 1921; DB 1; Length 2336; 

Best Local Similarity 33.1%; Pred. No. 2.5e-102; 
Matches 512; Conservative 0; Mismatches 264; Indels 772; 
Gaps 80; 

Qy 1 GGCACG AGC — 

GAGTTCCTGTCTCT 23 

I I I I I III I I I I I I I I I I I 

Db 100 

GGCGCGGTTCAATACCTCGTCTAGCCTGGGTTCCAGTATCTAATTTTTTTTTTGTTTTAA 159 

Qy 2 4 CTGCCAA CGCCGC CCGGA TGGC 

TTCC 49 

I I I I I I III I I I I I I I I I I I I 

Db 160 

CTGACAAACTCATTTCTCTACTGGGACAGGATGCTGTGCTGGCTGGAAGTTCCATTTCTA 219 

Qy 5 0 CA — AAACC GCGACCCAGCCG 

CCACTAGCGTCGCCGCCGCCCG 9 0 

II I I I I I I I I I I I I I I I I I I I I I I I I 

Db 22 0 CAGCAAGAATCCTATCTGGAAACACAGAAGTTGTCCTCTAGC — 

CACAGCAGCTCGAACT 2 77 

Qy 91 TAAAGGAGCTG AGCCGAGCGGGGGCGCC 

GCCCGGGGT 12 7 



Db 2 78 TTTTTGATTGTCGTTGCTGCTTTCTCCCATC 

ACCCCCATCCCCTTTTGACAAAGAT 333 

Qy 12 8 CCGGTGGGCAAAAGGCT ACAG — 

CAGG 152 

II I I I I I I I I I I I I I I I I 

Db 334 

CCAACTGTAAAAAGTCTTACGTAACAGTTCAGGACTACTTCGGTTCTTTTACTGGGTAAG 393 



Qy 153 AGCTGATGACC 

CTCATGA 170 

I I I I II I I I I I 

Db 394 

CACTTTCAATTTTTTTTTTTTAACTAAAAGCCATTTTAAAATTGAATCTGTTGAGGGGCT 453 

Qy 171 TGTCTGG CGATAAAG GGATTTCTGCC TTC — 

CCT 202 

I I I I I I I I I I I I I I I I I III 

Db 454 

TGACTAAAATCTTTTAAGTAATTTGTGTAATGGAATACTGTCAGTGGATTTTTTCGTCTC 513 

Qy 2 03 GAATCAG-AC AACCTTT TCAAATGGGTAG 

GGACCATCCATGGAGC 2 46 



Db 514 

ATTTCTGCACGTGCTCCTTTGTTCTCAGAACAGAAGCTTTTTATACACATCCCATAACGC 5 73 

Qy 2 47 AGC T GG AAC AG TAT AT G AAG ACC T G AGG T AT AAG CTC- 

TCGCTAGAGTTCC 296 

I I I I I I I I I I I I I I I I I I I I I II III I I I I 

Db 574 

AGCTGGAGAGAGTTATGAAGTCAGTTATTATAAGGAACACAAAGGTTGCTTTCCATTCTT 633 

Qy 29 7 — CCAGTGGCTAC 

CCTTACAATG 317 

I I I I I I I I I I I I I I 

Db 634 

TGCCTTTAGATAATTAATTTTTTTGTTTTCTTAAAATGGAGTATTTAAAGAAGGAAGAAA 693 

Qy 318 — CGC CCACAG TGAAGTTCCTCACGCC- 

CTGCT 347 



Db 694 

TTCACAAGAAATAAACTGTTGGAGAATTTAGAAAAGTTTGAAGTTTTTACCACCTTTTCT 753 

Qy 3 48 ATCAC CCCAACG — TG- 

GACACCCAGGG 372 

I I I I I I I I I I I I I I I I I I 

Db 754 

ATCTCTAGTTTTGTGTGGCCAAACACTTGTGCCGCCTGGGGCGGTGGGGGTAGAGGCAAG 813 

Qy 3 73 TAACA TATGCC TGGACA 

TC 391 



Db 814 

CATAGACAGAGAGGAACTAAGCCAGACATGGACAAAGGCACGAGCCAAAACCAGACAGTC 8 73 

Qy 392 CTG AAGG — AAAAGT 

GGTCTGCC 412 

III I I I I I I I I I I I I I I 

Db 874 

CTGGCCGTTCGATAGGCCAGCAGTGGGTGAGACAGGTCAGCCAGCTGCAGGGGCGGGAGC 933 

Qy 413 CTGTATGA 

TGTCAGGA — 42 8 



Db 934 

GGGGGGGCGGGTAGGGACATGGATGTGTGAGGTGCTCATGCGTGCGATCGTGACTAGATT 993 

Qy 429 CCATT CTGCTC 

TCCAT 444 



Db 994 

CTGAAACTGCCAGCCATTTTCCCAGCTCCGCTTTGTGCAATCTAAAGGAATGCATCCCCT 1053 

Qy 4 45 CCAGAGCCT — TCTAGGAGA ACC- 

CAACAT— 471 

I I I I I I I I I I I I I I I III 

Db 1054 

CTGAAGCAGTCTTGCCAGAGCCTAGTGAGGGAGAGAAGTATGGTAAATACCACAACATAT 1113 

Qy 4 72 TGATAGTCCCTTGAAC ACA 

CATGCTGC 49 8 

I I I I I I I I I I II I I I I I 

Db 1114 

GGAATCAGAAAATACCGGGAACTGGAGTGGGCAAGGGGGAATGCAGAGGGTGTGGAAATT 1173 

Qy 499 CGAGCTCTGGAA 

AAAC C C C AC AG — 521 

I I I I I I I I I II 

Db 1174 

TTTTAGTGATCTGGAATGTGTTGAGTGACAGGAAGTGCCCCAAGCTTCTCCCCCACCAAC 1233 

Qy 522 -CTTTTAAGAAGTACCTG 

CAAGAAACCTACTCAAAGCAGGTCAC 564 

I I I I I I I I I I I I I I I I I I I I I II 

Db 1234 

TCTTCTCAGTCGCGCCTGCTTTTGTCTAACTCTTGTAATCTACACACTACTGCTTACAAA 1293 

Qy 565 -CAGCCAG GAGCCCT GAC-CCAGGCTGC 

CC 592 

I I I I I I I I I I I I I III 

Db 1294 

GCTGTCTGAGTTTAAGACAAAAAGAAACCTAAAAGTCTCCTTACTTCATAGTACCTGTGA 1353 

Qy 593 AGCCTGTCC — TTGTGTC 

GTCTTTTT 616 

I I I I I I I II II I I I I I I 

Db 1354 

TATGGAAGGAATGTAAAAGCATGACCCTTTAAGCCTATGGACATTTTTTCAGGTATACAG 1413 

Qy 617 AATTTTTCCTTAGA 

TGGTCTG — T 638 



Db 1414 

G AG AAAG AT AAAAT AAT T T T C C AC AG AAAAATGAGAAT TATGAAT TATATAGTTCAGGTT 1473 

Qy 639 CC TTTTTGTGATTTCTGTATAGG — 

ACTCTTTATCTTGAGCTGTGGT — A 684 

II III I I I I I I I I I I I I I I I I I II 

Db 1474 CCAAATCTAATTTTTAAAAGAATTCTGATTCTGCTACACTTTA- 

CAAATGCTTAGGTTGG 1532 



Qy 685 TTTTTGTTTTGTTTTTGTCT 

TTTAAATTAAGCCTCGGTTGAGCCCTTGTATAT 737 

II I I I I I III III I I I I I I I I I I I I I I 

I 

Db 1533 

TTCCTAATTTGAAGGAGACTTGTTTTATTTGGTTAATGCATTGCATTTGAACTTGTTTCT 1592 

Qy 738 TAAATAAATGCAT— TTTTGTCCTTTTTTAAAAAAAAAAAAAAAAAAA 783 

I I I I I I I I I I I I I I I I I I I I III III 
Db 1593 ATTTTCTTTGCATAAATTTGGACTTTGGGAGAAAAATGCAAAGTAATA 1640 
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RESULT 1 
US-10-478-914-4 

; Sequence 4, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 

; TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
; FILE REFERENCE: 7388-80899 

CURRENT APPLICATION NUMBER: US/ 1 0 / 478 , 9 1 4 
; CURRENT FILING DATE: 2003-11-26 
; PRIOR APPLICATION NUMBER: PCT/JP02/05294 
; PRIOR FILING DATE: 2001-05-30 
; PRIOR APPLICATION NUMBER: JP 2001-162775 

PRIOR FILING DATE: 2001-05-30 

PRIOR APPLICATION NUMBER: JP 2001-255226 
; PRIOR FILING DATE: 2001-08-24 
; NUMBER OF SEQ ID NOS : 417 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 4 

LENGTH: 3210 
TYPE: DNA 
; ORGANISM: Homo sapiens 
US-10-478-914-4 



Query Match 45.3%; Score 3132.5; DB 1; Length 3210; 

Best Local Similarity 38.5%; Pred. No. 1.5e-186; 



Matches 747; Conservative 0; Mismatches 360; Indels 831; 
Gaps 111; 

Qy 1 GCCGCTGC-CAC CGCACCCCGCCATGGAGCGG 

CCGTC 36 

I I I I I I I I I I I I I I I I I I I II I I I I 

Db 35 

GCAGGTGCTCTCTTAGTGCTCCCTGGCAAGGATGGGAAGGCATGTTCAGTCCTAGGAGTA 9 4 

Qy 3 7 GCTGC GCGCCCTGCTCCTCGGC 

GCCGCTGGGCTGCTGCT 75 

III I I I I I I I I I I I II I I I I I I I I 

I 

Db 95 GGAAGGGGCAGAGGTGTTGATGGC- 

CCCTACAGAGCGGC C AAG G AC AAG GAGCTGCTGTT 153 

Qy 76 C CTGC — TCCTGC-CCCTCTCCT— CTTCCTC 

CTCTT 107 

I III I I I I I I I II III I I I I I 

I I I I 

Db 154 

CGAAACAGCCTTCCTGCTCCCCAACCTGCCTCCCACCCAACAGGTTTTGCATATACTCTA 213 

Qy 108 C GGACAC CTGCGGC 

CCCTGCGAGCCGGCCTCCTGCCCGCCC 148 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 214 CTGGGAAGAGGGACACACCCGACTGCATCACTGCCCTCCAAGTC- 

TCTCCCTGCCCTGTC 272 

Qy 149 CTGCCCCCGCTGGGCTGCC — TGCTGGGCGAGAC — 

CCGCGACGCGTGCGGCTGCTGCCC 204 

III II III II I I I I I I I I I I I I I I I I I 

I I I 

Db 2 73 CAGCATCC-AGGAGCACCCCTAGTTGGG- 

GAAGCTTCTGTGACTCCCCCTACAACAGCCT 33 0 

Qy 205 

TATGTGCGCCCGCGGCGAGGGCGAGCCGTGCGGGGGTGGCGGC 2 4 7 

I I I I I I I I I I I I I I I I I I I I I III 

Db 331 AGGATGGAGTGGGGTTTGTGAACAAATGCAGAAGGC-AGTCTTAGGGAGGT — 

CAGCTGA 387 

Qy 2 48 GCC GGCAGGG GGTACTG 

CGCGCCGG— 272 

III I I I I I I I I I I I I I I I I I 

Db 388 

CATGCCCCTGGCCTGTGGCTGGGAAGTAGCAGAGGCTAAGGTTCTTCCCCGCTCTGGGGT 4 4 7 

Qy 2 73 -GCATGGAGT-GC GTGA AGAGCCG — 

CAA 297 

I I I I I I I I I I I I I I I I I I I 

Db 448 

TGCCAGGAGTAGCACTGGATCAGTCAGGTGACAGGGCTCTCCTCTCTCTGAGCAGGTCCG 5 0 7 

Qy 29 8 GAGGCGG AAGG — GTAAAGCC 

GGGGCAG 323 

I I I I I I I I I I I I I I I I I I I I I 

Db 508 

GTGGCAGCCTTCAAGGTCGCCACGCCGTATTCCCTGTATGTCTGTCCCGAGGGGCAGAAC 56 7 



Qy 32 4 CAGCCGGC GGTCCGGGTG- 

CGGCGTGTGCGTGTGC 361 



Db 568 

GTCACCCTCACCTGCAGGCTCTTGGGCCCTGTGGACAAAGGGCACGATGTGACCTTCTAC 62 7 

Qy 362 AAGA GCCGCTACCCGGTGTGCG GC 

AGCGACGGCAC— 396 

I I I I I I I I III I I I I II I I I I I I I I I 

Db 628 

AAGACGTGGTACCGCAGCTCGAGGGGCGAGGTGCAGACCTGCTCAGAGCGCCGGCCCATC 6 8 7 

Qy 39 7 CACCT— ACCCGAGCGGCTGCCA- 

GCTGCGCGCCGC 429 

I I I I I III I I I I I I I I I I I I I I 

I 

Db 688 CGCAACCTCACGTTCCAGGACCTTCACCTGCACCATGGAGGCCACCAGGCTGC- 

CAACAC 7 46 

Qy 43 0 CAGCCA-GA — GGGCCGAGAGCCGCGG GGAGAAGGCC 

ATCACCCAGG 4 73 

I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I II 

Db 747 

CAGCCACGACCTGGCTCAGCGCCACGGGCTGGAGTCGGCCTCCGACCACCATGGCAACTT 8 06 

Qy 474 TCAGCAAGGGCA CCTGCGAGCAAG-GTCCT 

TCCATAGT 510 

I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 807 

CTCCATCACCATGCGCAACCTGACCCTGCTGGATAGCGGCCTCTACTGCTGCCTGGTGGT 866 

Qy 511 GACGCC-CCCCAAGGACATCTGGAATGTCACTGGTGCCCAGGTG- 

TACTTGA 56 0 

I I I I I I I III III III I I I I I I I I I I I I 

I 

Db 86 7 GGAGATCAGGCACCACCACTCGGAGCACAGG 

GTCCATGGTGCCATGGAGCTGCAGGT 923 

Qy 561 GCTGTGAGGTCATCG — GAATC — CCGAC ACCTGTCCTCAT — 

CTGGAAC 604 

III I I I I I I III I I I I III I I I I I I I I I I 

Db 92 4 GCAGACAGG- 

CAAAGATGCACCATCCAACTGTGTGGTGTACCCATCCTCCTCCCAGGAGA 9 82 

Qy 6 05 AAGGTAAAAAGGG 

G T C AC T AT GG AG T T C AAAG 636 

I I I I I I I I I I I I I I I I I I 

Db 983 

GTGAAAGTAAGGGACCAACCTCTTGCCCCTTTTGGGTTCTCTGTTTTCTTCTGTCCTCAT 10 42 

Qy 63 7 GAC AGAACTCCTGCCTGGTGAC 

CGGGACAACCT 669 

III I I I I I I I I I I III I I I I 

I I 

Db 1043 

CCTGCACCCAGACCCTGTTTGGAACTCTGGCCTCATCACCCCAAGCCCTCAGAACCCCCC 1102 

Qy 6 70 GG CCATTCAG ACCCGGGGTG 



GCCCA 69 4 

II I I I I I I I I I I I I 

I I I I 

Db 1103 

GGTCCTCCTCCTTTTCTGCTGCTGCACATCCCTTCTGCTTCCTCCTTGGTGCAATCCCCA 1162 

Qy 695 GAAAAGCA TGAAGTAACTG 

GCTGG 718 

III II I I I I I I I I I I I I 

Db 1163 

GAAGCCCACTCTCCTTCCATCTGCTCTGGAGTCTCTGCTCCTCTTGACTCTCTGGAGTGG 12 2 2 

Qy 719 — GTGC — TGGTA-TCTCCTCT AAGT 

AAGGAAGATGCTGG 753 

I I I I I I I I I I I I I I I I I II I I I I I I I 

Db 1223 

CTGTGCCTTGGCAGTGACCTTTGGCCAGGGCAAGTGCCTCATGACAGGTACTGGGTGCCC 12 82 

Qy 75 4 — AG AAT AT GAG T GCCATG-CATCCAATTCCCAAGGAC 

AGGCT 793 

II I I I I I I I I I I III I I I I I I I I I I I 

Db 1283 

CAGGCAGCTAAGTGCCGCCCTGCCCACCAGCCCCCTATGGCTTGGGAAGGCTGGGGGTCC 13 42 

Qy 79 4 TCAG CATC AGCAAAAATTA — CAG — 

TGG 818 

III I I I I I I I I I I I I I III 

I I I 

Db 1343 

TCTTGGCCAACAGGGTGAAACCCCATCTCTACTAAAAACACAAAAATTAGCCAGGCATGG 14 02 



Qy 819 TTGATGC-CTTACAT G AAAT AC CAG 

TGAA 8 46 

I I I I I I I II I I I I I I 

I I I I 

Db 1403 -TGGTGCGC — 

GCCTGTAGTCCCAGCTACTCAAGAGGCTGAGGCAAGAGAATCGCTTGAA 1459 

Qy 8 47 AAAAGGTGAAG-GTGCCGAGCTATAAAC 

CTCCAGAATATTATTA 8 89 



Db 1460 

CCCGGGAGTCAGAGGTTGCAGTGAGCCGAGATCGCGCCACTGCACTCCAGCCTGCTGATA 1519 

Qy 890 G TCT 

GCATGGTTAAAAGT 907 

I III I I I I I I I 

Db 1520 

GAGCAAGACTCCATCTCAAAAAAAACAACCCAAAATTTGCCTGGCATGGTGGCAGGCATC 15 79 

Qy 908 AGTCATGGATAACTACAT- 

TACCTG 931 

I I I I I I I I I I I I I I I I I 
Db 158 0 TGTAATCCCAGCTACTCGGGAGGCTGAGACAT- 

GAGAGCTGCTT G AAC CTGGGAGGCAGA 1638 

Qy 932 TTCTTGCCT 

AATAA 9 45 

II I I I I 



Db 1639 

AGTTGCAGTGAGCCGAGATCACACCACTGCACTCCAGCCTGGGTGACAGAGCGAGACTCT 169 8 

Qy 9 46 GTTTCTTTTAATCCAATC CACT AAC AC T T TAG T TAT AT T 

CA— 9 86 

II I I I I I I I I I I I I I I I I I I I I I I 

Db 1699 

GTCCCAAAAAATCAAAAAAATCACTTTTGGTAGAGATGCACTCTCGCTATGTTGCCCAGG 175 8 

Qy 987 CTGGT T T T AC AC AG AG AAAT AC AAAAT - 

AAAGAT 1019 

I I I I I I I I I I I I I I I I I I 

I I I 

Db 1759 

CTGGTCTTGAACTCCTGGGCTCAATTGATCTTCCCACCTTGACCTCCAAAGTGCTGGGAT 1818 

Qy 102 0 CACA CATCAAGACTATCTACAAAAATTTATTATA 

T AT T T AC AG AAG 1065 

III I II I III II I I I I I I I I III 

I I 

Db 1819 

TACAGGTGTGAGCCACCATGCCTAGCCTCAGGGAATTCTTATAAGAACTCTATGAAGTAG 18 7 8 

Qy 1066 — AAAAGCATGCATATCATTAAACAAATAAA 

ATACTTTTTATC 1106 

I I I I I I I I I II I I III I I I I 

I 

Db 18 79 GCATCACCAT- 

CTTCTCTGTATCCATGGAAAGAGAGGCCTAGAGATGTATGCTAACTTGC 193 7 

Qy 1107 ACAAAAAAAAAAAAAAAA 1124 

III III I 

Db 1938 CCAAGCTCACATAGCCCA 1955 



RESULT 2 

US-10-478-914-61 

Sequence 61, Application US/10478914 
Patent No. 7335755 
GENERAL INFORMATION: 
APPLICANT: NAKAGAWARA, AKIRA 

TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
FILE REFERENCE: 7388-80899 

CURRENT APPLICATION NUMBER: US/ 1 0 / 478 , 9 1 4 
CURRENT FILING DATE: 2003-11-26 
PRIOR APPLICATION NUMBER: PCT/ JP02 / 0529 4 
PRIOR FILING DATE: 2001-05-30 
PRIOR APPLICATION NUMBER: JP 2001-162775 
PRIOR FILING DATE: 2001-05-30 
PRIOR APPLICATION NUMBER: JP 2001-255226 
PRIOR FILING DATE: 2001-08-24 
NUMBER OF SEQ ID NOS : 417 
SOFTWARE: Patentln version 3.2 
SEQ ID NO 61 

LENGTH: 2 43 7 

TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-478-914-61 



Query Match 



43.6%; Score 3018; DB 1; Length 2437; 



Best Local Similarity 36.6%; Pred. No. 1.2e-179; 
Matches 758; Conservative 0; Mismatches 357; Indels 956; 
Gaps 12 0; 

Qy 1 GCCGCTG CCAC-CGCAC CCCGC 

CATGGAGCGGCCGTCG C 3 8 



Db 19 

GCTGCGGAATTCCTCGAGCACTGTTGGCCTACTGGGTGTACTGTGGAGACTGTCAAAGTC 7 8 

Qy 39 TGC — GCGCCCTGCTCCTCGGCGCCGCTGGG — CT GCTGCTCCTGCTCC 

TG 85 



Db 79 TCCCGGAGCCCAATTTC- 

CGGAAGCGGTGAGTTCTGAAAGAAGTTCCTGCACCGTAGTTT 13 7 

Qy 86 CCCC — TCTCCTCTTCCTCCTCTTCGGACACCTGCGGC 

CCCTGC— 127 

III I I I I I I I I I I I I I I III 

Db 138 

CCCAAGTCTGCGAATCCCCAACCATGAGCGCCTCGGGCGTACTGTCCTTTACCCAGCAAG 19 7 

Qy 128 GAGCCGGCC TCCTGCCCGCC- 

CCTGCC 153 



Db 198 

GATGGGAGCAGGTGCTGGCCAAAGTGAAACGGGCTGTGGTTTACCTGGACGCCGCCTGCG 257 

Qy 154 CC CGCT-GGGCTGC CTGCTG — GGC 

GAGACCCG- 183 

II I I I I I I I I II I I I I I III 

Db 258 

CCGAGAGCCTGCACTGGGGCTGCGGATCCACCCGTCTCCTGGAGGCGGTGGGGGGCCCTG 317 

Qy 184 CGACGC GTGCGGC — 

TGCTGCCC 204 

I I I I I I III II II 

Db 318 

ACTGTCACCTGCGAGAGTTCGAGCCCGACGCAATTGGTGGTGGAGCCAAGCAGCCCAAGG 3 7 7 

Qy 2 05 TATGTGC GCCCGCGGCGAGGGCGAGCCGTG 

CGGGGGTGGC 2 44 



Db 3 78 CAGTGTTTGTGCTGAGCTGCCTGCTGAAAGGCCGGACCGTGGAGATCCTACGG- 

GAC 433 

Qy 2 45 GGCGCCGGCAGGGGGTACTGCGCGCCGGGCATGGAGTGCGTGA 

AGAGCCG 29 4 

I I I I I I I I I I I I I I I I I I I I I I I 

Db 434 

ATCATCTGCCGCAGTCACTTCCAGTATTGTGTGGTGGTCACAACCGTGAGCCACGCTGTC 493 

Qy 295 CAAGAGGCGGAAGGGTAAAGCC 

GGGGCAGCA 325 

I I I I I I I I I I I III 



Db 49 4 CACCTCACAGCTAATCATGTCCCAGCGGCGGCAGCG 

GCCGAGATGGAGGGGCAGCA 5 49 

Qy 326 GCCGGCGGTC CGGGTG 

TAAGC 3 46 



Db 550 

GCCGGTGTTCGAGCAGCTGGAGGAGAAGCTGTGTGAATGGATGGGCAACATGAACTACAC 6 09 

Qy 3 47 GGC GTGTG-CGTG TGCAAGAGCC 

GCTACCCGGTGTGC 382 

III I I I I III III I II I I I I I I I 

Db 610 

GGCCGAGGTGTTCCATGTCCCGTTATTGCTTGCCCCTGTTGCTCCCCACTTTGCCTTGAC 669 

Qy 3 83 GGCA GCGAC GGCACCACCT- 

AC 403 



Db 670 

TCCAGCTTTTGCATCCCTTTTCCCACTGCTACCCCAGGATGTGCACCTCCTTAATAGCGC 72 9 

Qy 40 4 CCGAGCG GCT GCCAG 

CTGCGC 424 

I I I I I I III I I I I II 

Db 730 

CCGACCGGACAAGAGGAAGCTGGGAAGCCTGGGTGATGTGGACTCCACTACGCTAACCCC 7 89 

Qy 425 — GCCGC— CAGC CAGA GGGCC 

GAGAGC 448 

I I I I III I I I I I I I I 

Db 790 

AGAGCTGCTGCTGCAGATCAGATGCCTAGTGTCAGGCCTCAGTTCTCTGTGTGAACATTT 8 49 

Qy 4 49 CGCGGGGA GAAGG — 

CCATCACCCAGGTCAGCAAGG 482 

I I I I I I I I I I I I I I I I I I I I I I 

Db 850 

AGGAGTACGGGAGGAGTGTTTTGCTGTAGGTTCCTTAAGTCAGGTCATCGCTGCGGATCT 9 09 

Qy 4 83 GCACCTGC GAGCAAG GTC 

CTTCCATAGT 510 



Db 910 

GGCCAATTATGCCCCTGCAAAGAACAGGAAGAAGACTGCTGCAGGCAGGGCATCAGTGGT 969 

Qy 511 GAC GCCC CCCA AGGACATC- 

TGGA 533 

III III III I I I I I I I I I I I 

Db 970 

TTTTGTGGACAGAACCCTGGATCTCACAGGAGCAGTTGGACATCATGGAGACAACTTAGT 102 9 

Qy 53 4 ATGTCACTGGTGCCCAGG 

TGT A 555 



Db 1030 

AGAGAAGATCATTTCAGCACTTCCCCAGCTCCCAGGCCACACAAATGATGTGATGGTTAA 10 89 



Qy 556 CTTGA GCT GTGAGGT — 

CATCGG- 576 



Db 1090 

CATGATAGCGCTCACTGCACTCCATACTGAGGAGGAAAATTATAATGTGGTTGCACCAGG 1149 

Qy 577 AATCCC GACACC TGTCCT CAT— CTGGA- 

AC 604 



Db 1150 

CTGTCTTTCACAATTCAGTGACACCACAGCCAAAGCCCTATGGGAAGCTTTACTGAACAC 12 09 

Qy 6 05 AAGGTAAAAAGGGGTCACTATGG-AGTTCAAAG GAC 

AGA 6 42 



Db 1210 

TAAGCACAAAGAGGCAGTGATGGAAGTTCGGAGACATCTAGTGGAAGCGGCAAGCAGAGA 12 69 

Qy 6 43 ACTCCTGCC TGGTG ACCGGGACA — ACCTGGCC 

AT 675 



Db 1270 

AAACCTGCCAATCAAGATGAGTATGGGGAGAGTCACACCGGGACAGCTCATGTCCTATAT 1329 

Qy 6 76 TCAG ACC CGGGGTGGC 

CCAG 695 

I I I I III I I I I I I II 

Db 1330 

TCAGCTCTTCAAGAACAACCTCAAAGCTCTAATGAATCATTGTGGCCTCCTCCAGCTTGG 13 89 

Qy 696 AAAAGC ATGAAGTA ACTGGC TGGG 

TGCTGGT 726 



Db 1390 

ACTGGCCACAGCTCAAACGTTGAAACACCCACAGACTGCCAAGTGGGACAACTTTCTGGC 14 49 

Qy 72 7 AT CTCC TCTAAGTAAGGAAGATGCTGGAGAATATG- 

AGTG 765 

I I I I I I I I I I I I I I I I I I I I 

Db 1450 

TTTTGAAAGGCTCCTTCTTCAGAGCATTGGGGAGTCAGCAATGTCCGTTGTGTTAAATCA 15 09 

Qy 766 CCATGCAT CCAAT-TCCCA-AGGAC 

AGGCTTCAGC A 799 



Db 1510 

GCTGCTGCCCATGATTAAGCCTGTAACCCAGAGAACCAACGAGGACTACAGCCCTGAGGA 1569 

Qy 800 TCAGC 

AAAAATTACAGTGGTTGATGC 825 



Db 1570 



ACTGCTGATCCTTCTCATATATATTTATTCTGTCACTGGAGAGCTCACGGTAGACAAAGA 162 9 

Qy 826 CTTACAT GAAATACCAGTGAAAAAAG 

GTGAAGGTGC 861 



Db 1630 

CCTGTGTGAAGCAGAAGAAAAAGTCAAGAAAGCATTGGCTCAGGTCTTCTGTGAGGAATC 16 89 

Qy 862 CGAGCTATAAAC CTCCAGAATATT ATTAGTCTG- 

CAT- 89 7 



Db 1690 

TGGATTGTCACCTTTGCTGCAAAAAATTACGGACTGGGACTCTTCAATTAATCTGACATT 17 49 

Qy 89 8 GGTTAAAAGTAGTCATGGA 

TAACTACATTACCTGTTCTTGC 93 8 

I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1750 

TCACAAATCCAAAATTGCCGTGGATGAACTCTTTACTTCACTTCGGGATATTGCTGGAGC 18 09 

Qy 939 CT-AATAAGTTTCTTTTAATCCAATCCACTA- 

AC 970 



Db 1810 

TCGGAGTCTCCTGAAACAGTTTAAGTCTGTATATGTTCCTGGAAATCATACCCACCAGGC 1869 

Qy 9 71 ACTTTA GTTATATTCACTGGTT TTACA — 

CAGAGA A 1004 



Db 1870 

ATCTTATAAGCCATTGTTGAAGCAAGTTGTGGAGGAAATATTTCATCCCGAGAGGCCAGA 192 9 

Qy 10 05 ATAC-AAAATAAAGATCACA CATCAAGACT ATCTAC — 

AAAAAT TTATTATA 1053 

II III I I I I I I I I I I I I I I I I I I I I I I 

Db 1930 

TTCCGTTGATATTGAACACATGTCTTCAGGCCTCACTGATCTCCTTAAAACTGGATTTAG 19 89 

Qy 1054 T AT T T AC A- G AAGAAAAGC ATGC AT -AT CAT 

TAAACAAATAAAATACTTTTT— 1103 

I I I I I I I I I III II I I I I I II I II 

Db 1990 

CATGTTCATGAAGGTGAGCCGGCCTCATCCTAGTGACTACCCCCTCCTGATCCTCTTTGT 2 0 49 

Qy 1104 AT C AC AAAAAAAAAAAAAAAA 112 4 

I I I I I II III 

Db 2 050 GGTAGGTGGGGTCACAGTCTCTGAAGTGAAA 2080 



RESULT 3 

US-10-478-914-64 

; Sequence 64, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 

; TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 



FILE REFERENCE: 7388-80899 

CURRENT APPLICATION NUMBER: US/ 1 0 / 478 , 9 1 4 
CURRENT FILING DATE: 2003-11-26 
PRIOR APPLICATION NUMBER: PCT/JP02/05294 
PRIOR FILING DATE: 2001-05-30 
PRIOR APPLICATION NUMBER: JP 2001-162775 
PRIOR FILING DATE: 2001-05-30 
PRIOR APPLICATION NUMBER: JP 2001-255226 
PRIOR FILING DATE: 2001-08-24 
NUMBER OF SEQ ID NOS : 417 
SOFTWARE: Patentln version 3.2 
SEQ ID NO 64 
LENGTH: 2120 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-478-914-64 

Query Match 43.2%; Score 2987.5; DB 1; Length 2120; 

Best Local Similarity 41.5%; Pred. No. 6e-178; 

Matches 669; Conservative 0; Mismatches 410; Indels 533; 
Gaps 93; 

Qy 1 GCCGCTG CC ACC— GCAC CCC— 

GCCA- 2 3 

I I I I I I II III III III 

I I I I 

Db 519 

GCAGCTGGCCAGCCTTCCTGGCTTGAACCTGGGCTCACTCAAGGACAAGGCCCAGGCCAC 57 8 

Qy 2 4 — TGGAG CGGCCGTC-GCTG CGCGCCCTG 

CTCCTCGGCG 59 

I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 579 

ACTGGGGGATCTCAAGCAATCAGCTGAGAAGTGTCACGTCATGTGACCACTTCCCCGGGG 63 8 

Qy 6 0 CCGCTGGGCTG 

CTG 73 

II I I I I I I I I 

I 

Db 639 

TTACCCACTGGGCTGGGCCCCCATGAGGGCTAAGAGTGTGTCAACTTCCAGGGACCCATA 69 8 

Qy 74 CTCC TGCTCC 

TG 85 

I I I I I I I I I 

I I 

Db 699 

CTCCATTTGGGGCTTTGTTTCCCTTGCCCCATCCTAGTTCCAAGACCTTTCCCATCCATG 75 8 

Qy 86 CC CCTCTCCTCT 

TCCTCCTCTTCGGAC 112 

II I I I I I I I I I I I I I I I I I I 

I 

Db 759 

CCCCAAGCCTATCTTCTGGTTTCTTCCTCTCCGCTGGGAGTAAAGTCCCCATCTTCACTC 818 

Qy 113 — ACCTGCGG CCCCTGCG AGCC 

GGCCTCCT 140 

I I I I I I I I I I I I I I I I I I I I I 



Db 819 

TACCCTTCAGGACCCTCCCCACCAGCTCAGCCTGTGGAGGCCTCCCAAGATTGTAGGAAT 8 7 8 

Qy 141 — GCCCGCCCCT GCCCC 

CGCTGGGCTGCCTGCTGGG — 174 

I I I I I I I I I I I I I I I I I I I I I I 

Db 879 

AGGCCCATCCCTCTCTGGCCATGGCCCCAAGTTCCTGCACACAGGAGCACCCACAGAGAG 93 8 

Qy 175 CGAGACCCGCGACGCGTGC 

GGCTGCTGCCCTATGTGCGCCCGC 217 

I I I I I I I I I I I I III III I I II 

I 

Db 939 

ACACACACAGGACACAAAACCCCTGGCACGTTCAGAGACAGAAGCCACAGACACATCC-C 99 7 

Qy 218 GGCGAGGGCGAGC-CGTGCGGGGGTGGCGGCGCCGGCAGG-GGGTACTG- 

CGCGCCGGGC 2 74 

III II II III I I I I I I I I I I I I I I I I I I 

I 

Db 99 8 GGCACAGACAGACACACACGAG GCCAGCTCC— 

CTTGCGTGTCCAGCCCCTCCAGAC 1052 

Qy 2 75 A TG 

GAGTGCGTGAAGAGCCGCAAGAGGCGGAAGGGTAAAG 314 

I II I I I I I I I I I I I I I I I I I I 

I 

Db 1053 ACCACCACTCAGAAACTCTGAGAGAGAGCATGGGCAGACACCCTCAGCAGACAGG — 

AGG 1110 

Qy 315 CCGGGG CAGC 

AGCC 32 8 

I I I I III I I I I 

Db 1111 

CCTGAGTTCCAGTCTCCACCTTTATTGTTCTTGAAAGCCCCTGCTCTCTCTGAGCCTTAT 117 0 

Qy 329 GGCGGTCC-GGGTG TAAGC 

GGCGTGTGC 355 

I I I I I I I I III I I I I I 

I 

Db 1171 

TTCATCATCTGTAAAATGGGAATGTCCTGAATGACTTCTAAGGCTCTTTCTGGCTTGAAC 123 0 

Qy 356 -GT — GTGCAAGAGCCGCTACCCG — GTGTGC — GGCAGCGACGGC- 

ACCACCTACCCGA 40 7 

II I I I I III III I I I I I I I I I I III I I I I II 

I 

Db 1231 

TGTCAGAGCCAAGCCCACATCCCTCCTTGGGCAGGGCAGCAGCTGCTGCCACAGCCTCCA 12 9 0 

Qy 40 8 GCGGCTGCCAGCTGC — GCGCCGCCAGCCAGAG 

GGCCGAGAGCCG 45 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 1291 GCGGCTGCCA- 

CTGTGGGCTCTGGGAGCCGGAGCGATGCTGTGTGAGAGGCAGAGTGCCA 13 49 

Qy 451 CGG— GGAGAAGGCCATCACCCAGGTCAG CAAGGGCACCTGCGAG — 

CAAGGTCC 501 

II III III III I I I I I I I I I I I I II I I I I 



Db 135 0 AGGATGAAGCTGGCACTGAAC — 

AGTAAGCGGCTCCAGGCCTCCTCTGGGCCCAGGGCCC 1407 

Qy 502 TTCCAT AGTGACGCCCCCCAAGGAC ATCTGGAAT — GTCACTGG — 

TGCCCAG 55 0 

III II I I I I I I I I I I I I I I I I I I I 

Db 1408 

AGCCAATTTCTGTTCTGTTCCTGTAGAACGCTCTCTGGATTCCATAGCTGGAATCTCC— 1465 

Qy 551 GTGTACTTGAGCTGTGAGGTCATCGGAATCCC- 

ACCTGTCCTCATCTG 600 



Db 1466 -TCTCTTAGCTCAGTGA- 

AAAATAAAAATCCCAAATGGTGTGCCTACCTTCCCACTTCT- 152 2 

Qy 601 GAACAAGGTAAAAAGGGGTCACTATGGAGTTCAAAGGACAGAACTCCTGCCT- 

GG 654 



Db 1523 -TAC-TGGCTTCCAGGAGTC 

TTGGAGTTCATAGCCCCCCGAGCCTGCCTTAAAGGGG 157 7 

Qy 655 T G AC C G G G AC AAC CTGGCCATT C AG — AC C C GGGGT GGC — 

CCAGAAAAGCAT 7 03 

I I I I I I I I I I I I I II I I I I III I I I I 

Db 1578 TGTCC — TCCACCCCCCACCTACAGCTTCACAGGAGGGGAGAGGGCATCCAG 

TGCTA 1632 

Qy 70 4 GAAGT — AACTGGCT-GGGTGCTGGTATCTC CTCTAAGT- 

AAGGAAGATGCTGGAG 755 



Db 1633 

GGAGTAGAAGTGTCTCCAGCTCTGTTCTCTTGGGGCCCTGGGTGAAGGTGGGGTCTGGGG 1692 

Qy 756 AATATGA GTGCCAT GCATCC- 

AATTCCCAAGGACA 789 

I I I I I I I I I I I I I I I I I 

Db 1693 

CTTATGAAATAGGTCTGGGCTTTGAGGAGGATGGAGCAGCCTCATTATGTGGGGAAGATG 1752 
Qy 790 -GGCTTC 

AGCATCAGCAAAAATTACAGTGGTTGATGCCTTACATGAAATACCAGTGA 8 45 



Db 1753 GGGCCTCTGGGGCGTCA-CTGAGACCACA 

GGTGGGGCCGGGGCTGGACCGCAGCTG- 18 0 7 

Qy 8 46 AAAAAGGTGAAGGTGCCGAGCTATAAACCT 

CCAGAATATTATTAGTCTGCATGGTTA 9 02 

I I I I I I I I I II III 
Db 1808 TCTTGGGTGCCTGTGCC-TGCACCCCTCCTCACCCTAGAGACGGAAGATGTGCA- 

AAAAG 1865 

Qy 903 

AAAGTAGTCATGGATAACTACATTACCTGTTCTTGCCTAATAAGTTTCTTTTAATCCAAT 962 



■-GAC- 

I I I I I I III 



Db 1866 AAAGAAGG-AAGGGCAACTGCATT-CCAGCCC 

CACACTGTGATGACTTTGAGCCTGT 192 0 

Qy 963 CCACTAACACTTTAGTTATATTCACTGGTTT- 

TACACAGAGAAATACAAAATAAAGATCA 1021 



Db 1921 CCTTTCCCTCCTTGAGCCTGT 

CTTGCTTGTCCCCTGTAAAATGAACAGTCCCCCTCT 1977 

Qy 1022 CAC AT C AAGAC T ATC T AC AAAAAT T T AT T AT AT AT T T A- 

CAGAAGAAAAGC 10 71 



Db 1978 

CCCCCAAATAGTAATAATACATGTTTCAAAGGGTGACCATTTATAAAGCATATGACAAAC 2 03 7 

Qy 10 72 ATGCATATCAT T AAAC AAAT AAAAT AC T T T T T AT C AC AAAAAAAAAAAAAAA 1123 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2 03 8 CATATCAATAAATGTAACTCATTCTTTTAAAAAAAAAAAAAAAAGGCCA 2 0 86 



RESULT 4 

US-10-478-914-4/C 

; Sequence 4, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: N AKAG AWARA , AKIRA 

; TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
; FILE REFERENCE: 7388-80899 

; CURRENT APPLICATION NUMBER: US/10/478,914 

; CURRENT FILING DATE: 2003-11-26 

; PRIOR APPLICATION NUMBER: PCT/JP02/05294 

; PRIOR FILING DATE: 2001-05-30 

; PRIOR APPLICATION NUMBER: JP 2001-162775 

; PRIOR FILING DATE: 2001-05-30 

; PRIOR APPLICATION NUMBER: JP 2001-255226 

; PRIOR FILING DATE: 2001-08-24 

; NUMBER OF SEQ ID NOS : 417 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 4 

LENGTH: 3210 

TYPE : DNA 

ORGANISM: Homo sapiens 
US-10-478-914-4 

Query Match 42.8%; Score 2960.5; DB 1; Length 3210; 

Best Local Similarity 35.9%; Pred. No. 2e-175; 

Matches 746; Conservative 0; Mismatches 357; Indels 973; 
Gaps 126; 

Qy 1 GCCGCTG CCACC GCA CCCCGC— 

CATG 2 5 

I I I I I I I I I I I III I I I I I I 

Db 2134 

GCCTCTGAAAGTCAGGATACTGACCACCTCGGCATCTCTCCCCTTCCCCACTGCTTGCGA 2 0 75 



Qy 26 
TGGGC 6 8 



Db 2074 

ATGCCAGTGCTGGCCTGTTCCTTAAACGCACCAGCTCATCCCCCAGAGCAGCCTGTGTGC 2 015 

Qy 69 T GCTGCTCCTGCTCCTGCC — CCTCTCCTCTTCCTCCT 

CTTC— 108 

I I I I I I I I I I I I I I I I I I I III I I I I III 

Db 2014 TCTGCCAGCGACTGCTACTTCTCCT- 

CCGGCCTCAGCTCAACATCCCAGCTATGCTACCC 1956 

Qy 109 GGACA CCTGC 

GGCCCCT 125 

I I I I III I I I I I I 

Db 1955 

TGGGCTATGTGAGCTTGGGCAAGTTAGCATACATCTCTAGGCCTCTCTTTCCATGGATAC 1896 
Qy 126 -GCGA 

GCCGGCCTCCT 140 

III I I I I I I I 

Db 1895 

AGAGAAGATGGTGATGCCTACTTCATAGAGTTCTTATAAGAATTCCCTGAGGCTAGGCAT 1836 

Qy 141 GCCCGCCCCTG— CCCCCGC 

TGGGCTG C 166 

I I I I I I I I I I I I I I I I I I 

I 

Db 1835 

GGTGGCTCACACCTGTAATCCCAGCACTTTGGAGGTCAAGGTGGGAAGATCAATTGAGCC 17 76 

Qy 16 7 CTGCTGGGCGAGACCCGC GAC 

GCGTGCGGCTGCTGCCCTATGTG 210 

II I I I I I I I I I II I I I I I I I I I I I I I I I 
Db 17 75 CAGGAGTTCAAGACCAGCCTGGGCAACATAGCGAGAGTGCATCT- 

C T AC C AAAAG T GAT T 1717 

Qy 211 CGC 

CCGCGGCGAGGGCGAGCCGTG 23 4 

III II III I I I I I I I I 

Db 1716 TTTTTGATTTTTTGGGACAGAGTCTCGCTCTGTCACCCAGGC — 

TGGAGTGCAGTGGTGT 1659 



Qy 235 CGG GGGT GGCGGC 

GCC— 250 

III III I I I I 

Db 1658 

GATCTCGGCTCACTGCAACTTCTGCCTCCCAGGTTCAAGCAGCTCTCATGTCTCAGCCTC 1599 

Qy 251 GGCAGGGGGTAC TGC — GC 

GCCGGGC 274 

I I I I I III III II I I I I I I 

Db 1598 

CCGAGTAGCTGGGATTACAGATGCCTGCCACCATGCCAGGCAAATTTTGGGTTGTTTTTT 1539 

Qy 2 75 ATGGAG TGCGTGAAGAGCC-GCAAGAG- 

GCGGAAGGGTAAAGCCGGGGCA — 322 

I I I I I I III I III II I I I I I I II I I 

Db 1538 

TTGAGATGGAGTCTTGCTCTATCAGCAGGCTGGAGTGCAGTGGCGCGATCTCGGCTCACT 14 79 



Qy 



323 GCAGCC— 



GGCGGTCCGGGTGTAAGC 3 46 

I I I I I II I I I I I I I I I I 

Db 1478 GCAACCTCTGAC- 

TCCCGGGTTCAAGCGATTCTCTTGCCTCAGCCTCTTGAGTAGCTGGG 142 0 

Qy 3 47 GGCGTG TGC 

GTGTG 360 

I I I I I III I I I I 

Db 1419 

ACTACAGGCGCGCACCACCATGCCTGGCTAATTTTTGTGTTTTTAGTAGAGATGGGGTTT 136 0 

Qy 361 CAAGAG CCG— CTACCC GGTGTGC- 

GG 384 

I I I I I I II I I I I I I I I I I I 

I I 

Db 1359 

CACCCTGTTGGCCAAGAGGACCCCCAGCCTTCCCAAGCCATAGGGGGCTGGTGGGCAGGG 13 0 0 

Qy 3 85 CAGC GAC GGCACCACCTACC CGAGCGGC 

TGCC 416 

III II I I I I I I I I I I I I I I I I I I I 

Db 1299 

CGGCACTTAGCTGCCTGGGGCACCCAGTACCTGTCATGAGGCACTTGCCCTGGCCAAAGG 12 4 0 

Qy 417 -AGCTGC GCGCCGCCA-GCCAGAGGGCCGAGAG CC 

GC 451 

I I I I I I I I I I I I I I I I I I I I I I I II II 

Db 1239 

TCACTGCCAAGGCACAGCCACTCCAGAGAGTCAAGAGGAGCAGAGACTCCAGAGCAGATG 118 0 

Qy 452 GGGGAGAAGGC — C AT — CACCCAGGT 

CAGC 478 

I I I I I III I II I I I I I I I 

I I I I 

Db 1179 

GAAGGAGAGTGGGCTTCTGGGGATTGCACCAAGGAGGAAGCAGAAGGGATGTGCAGCAGC 112 0 

Qy 4 79 A AGGGC ACCTG 

CGAG 493 

I I I I I I I I I I 

I 

Db 1119 

AGAAAAGGAGGAGGACCGGGGGGTTCTGAGGGCTTGGGGTGATGAGGCCAGAGTTCCAAA 106 0 

Qy 49 4 CAAGGTC 

C 501 

I I I I I I I 

Db 1059 

CAGGGTCTGGGTGCAGGATGAGGACAGAAGAAAACAGAGAACCCAAAAGGGGCAAGAGGT 10 0 0 
Qy 502 — 

TTCCATAGTGACGCCCCCCAAGGA C 52 6 

I I I I I I I I I I I III 

I 

Db 999 

TGGTCCCTTACTTTCACTCTCCTGGGAGGAGGATGGGTACACCACACAGTTGGATGGTGC 9 4 0 

Qy 52 7 ATC-TGGAATGTC ACTGGTGC CCAGGTGTACTTGAGCT — 

GTGAGGT 570 

I I I I I I I I I III II I I I I I I I I I I I II 



Db 939 ATCTTTGCCTGTCTGCACCTGCAGCTCCATGGCACCATG- 

GACCCTGTGCTCCGAGTGGT 881 

Qy 571 CAT — CGGAATCCCGACACCTG TCC- 

TCATCTGGA 6 02 



Db 880 

GGTGCCTGATCTCCACCACCAGGCAGCAGTAGAGGCCGCTATCCAGCAGGGTCAGGTTGC 821 
Qy 6 03 ACAAGGT- 

AAAAAGGGGTCACTATGGAGTTCAAAGGACAGAACTCCTGCCTG 653 

I I I I I I II II I I I I I I I I I I I I I I I I I I I I I 
Db 82 0 GCATGGTGATGGAGAAGTTGCCATGGTGGTCGGAGGCC — 

GACTCCAGCCCGTGGCGCTG 763 

Qy 654 GTGACCGG GACAACCTGG CCATTCAGACCCGG- 

GGTGGCCC 693 



Db 762 AGCCAGGTCGTGGCTGGTGTTGGCAGCCTGGTGGCCTCCAT — 

GGTGCAGGTGAAGGTCC 705 

Qy 69 4 AGAAAAGCATGAAGTAAC TGGCTGG GTGCTGGTAT- 

CTCCTCTAAGTAAG 742 



Db 70 4 TGGAA — 

CGTGAGGTTGCGGATGGGCCGGCGCTCTGAGCAGGTCTGCACCTCGCCCCTCG 6 4 7 

Qy 7 43 GAAGATGC TGGAGAATATGA GTGCCATGCATCCAAT — 

TCCCAA 78 4 



Db 646 — 

AGCTGCGGTACCACGTCTTGTAGAAGGTCACATCGTGCCCTTTGTCCACAGGGCCCAA 5 89 

Qy 785 GGACAGGC TTCAGCATC AGCAAAAATTACAGTG- 

GT 819 



Db 588 

GAGCCTGCAGGTGAGGGTGACGTTCTGCCCCTCGGGACAGACATACAGGGAATACGGCGT 52 9 

Qy 82 0 TGATGCCTT ACATGAAATAC-CAGTGAAAAAAGG 

TGAAG 857 

I I I I I I I I I I I I I I I I I I I 

Db 528 

GGCGACCTTGAAGGCTGCCACCGGACCTGCTCAGAGAGAGGAGAGCCCTGTCACCTGACT 469 

Qy 858 GTGCC-GAGCTAT AAACCTCCA GAATATTATTAG — 

TCTGCATGGTT 901 



Db 46 8 GATCCAGTGCTACTCCTGGCAACC-CCAGAGCGGGGAAGAACCTTAGCCTCTGC- 

TACTT 411 

Qy 9 02 AAAAG TAGTCATG GATAACTACATTACCTGTTCTTGCCTAATAAGT 

TT 949 



Db 410 

CCCAGCCACAGGCCAGGGGCATGTCAGCTGACCTCCCTAAGACTGCCTTCTGCATTTGTT 351 

Qy 950 CTTTTAATCCAATCCA — CTA ACA CTT- 

TAG 9 77 



Db 350 

CACAAACCCCACTCCATCCTAGGCTGTTGTAGGGGGAGTCACAGAAGCTTCCCCAACTAG 2 91 

Qy 9 78 TTATATTCACTGGTTTT ACA CAGAGAAATAC 

AAAATAAAG 1017 

I I I I I I I I Ml I I I I I I I I 

Db 290 GGGTGCTC- 

CTGGATGCTGGACAGGGCAGGGAGAGACTTGGAGGGCAGTGATGCAGTCGG 232 

Qy 1018 ATCACACATC AAG AC T AT C T AC AAAAAT T T AT 

TAT AT ATT 1057 

I I I I I I I I I I I I I I I I I I I I 

Db 231 

GTGTGTCCCTCTTCCCAGTAGAGTATATGCAAAACCTGTTGGGTGGGAGGCAGGTTGGGG 172 

Qy 1058 TACAGAA GAAAAGCA — TGCATATCAT 

TAA 1085 



Db 171 

AGCAGGAAGGCTGTTTCGAACAGCAGCTCCTTGTCCTTGGCCGCTCTGTAGGGGCCATCA 112 

Qy 10 86 AC AAAT AAAAT AC T T T T T ATC AC AAAAAAAAAAAAA 1121 

I I I I I I I I I I I I I III 

Db 111 ACACCTCTGCCCCTTCCTA-CTCCTAGGACTGAACA 77 



RESULT 5 

US-10-478-914-64/C 

; Sequence 64, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 

TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
; FILE REFERENCE: 7388-80899 

; CURRENT APPLICATION NUMBER: US/ 1 0 / 478 , 9 1 4 

; CURRENT FILING DATE: 2003-11-26 

; PRIOR APPLICATION NUMBER: PCT/ JP02 / 0529 4 

PRIOR FILING DATE: 2001-05-30 

PRIOR APPLICATION NUMBER: JP 2001-162775 
; PRIOR FILING DATE: 2001-05-30 

PRIOR APPLICATION NUMBER: JP 2001-255226 
; PRIOR FILING DATE: 2001-08-24 
; NUMBER OF SEQ ID NOS : 417 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 6 4 

LENGTH: 212 0 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-478-914-64 



Query Match 42.0%; Score 2907; DB 1; Length 2120; 

Best Local Similarity 38.2%; Pred. No. 9.7e-173; 
Matches 678; Conservative 0; Mismatches 402; Indels 696; 
Gaps 98; 

Qy 1 GCCGCTGCC— ACC GCAC CC— 

CGCCATGGAGCGGCCGTCGCT 39 



Db 18 85 GCAGTTGCCCTTCCTTCTTTCTTTTTGCACATCTTCCGTCTCTAGGGTGAGGAGG — 

GGT 1828 

Qy 40 GCGCGCCCTGCTCCTCGGCGCCGCTGGGCTGCTGC — 

TCCTGCTCCTGCCCCTCT 92 

II III I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 182 7 GCAGGCAC AGGCACC- 

CAAGACAGCTGCGGTCCAGCCCCGGCCCCACCTGTGG 1776 

Qy 93 CCTC — TTCCTCCTCTTCGGACACCTGCGGCCCCT GC- 

GAGCCGGCCTCCT 140 



Db 17 75 TCTCAGTGACGCCCCAGAGG- 

CCCCATCTTCCCCACATAATGAGGCTGCTCCATCCTCCT 1717 

Qy 141 GCCC GCCCCTGCCCCCG 

CTGGGCTGCCTG 169 

I I I I I I I I I I I I I I I I I I I 

Db 1716 

CAAAGCCCAGACCTATTTCATAAGCCCCAGACCCCACCTTCACCCAGGGCCCCAAGAGAA 1657 

Qy 170 CTGGGC — GAGACCC-GCGACGCGT- 

GCGGCTGCTGCCCTATGTGCGCCCGCGGCGAGGG 225 



Db 1656 

CAGAGCTGGAGACACTTCTACTCCTAGCACTGGATGCCCTCTCCCCTCCTGTGAAGCTGT 159 7 

Qy 226 CGAGCCGTGCGGGGGTGGCGG-CGCC GGCA 

GGGGGTACTGCGCGCCG 2 71 



Db 1596 AG GTG- 

GGGGGTGGAGGACACCCCTTTAAGGCAGGCTCGGGGGGCTATGAACTCCA 15 42 

Qy 2 72 GG CATGGAGTGCGTGAAGAGCC-GCAAGAGGCGGAAGGGTA 

AAGCC 316 

I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 1541 AGACTCCTGGA 

AGCCAGTAAGAAGTGGGAAGGTAGGCACACCATTTGGGA 1492 

Qy 317 GGGG CAGC AGCC 

GGCGGTC- 335 



Db 1491 

TTTTTATTTTTCACTGAGCTAAGAGAGGAGATTCCAGCTATGGAATCCAGAGAGCGTTCT 1432 



Qy 336 CGGGTGTAAGCGG CGTGTGC- 

GTGTGCAAGAGCCGCTACC 3 74 



Db 1431 

ACAGGAACAGAACAGAAATTGGCTGGGCCCTGGGCCCAGAGGAGGCCTGGAGCCGCTTAC 13 72 

Qy 3 75 CGGTGTGCGGCAGCGAC GGCAC CAC 

CTACC 40 4 



Db 1371 

TGTTCAGTGCCAGCTTCATCCTTGGCACTCTGCCTCTCACACAGCATCGCTCCGGCTCCC 1312 

Qy 405 CGAGC GGCTGCC AGCTGC-GC-GCCG-CCAGCC— 

AGAGG 439 



Db 1311 

AGAGCCCACAGTGGCAGCCGCTGGAGGCTGTGGCAGCAGCTGCTGCCCTGCCCAAGGAGG 12 52 

Qy 440 GCCGAGAGC CGCGGGGAG — AAGGCCA 

TCACCCAGGTCA- 4 76 



Db 1251 

GATGTGGGCTTGGCTCTGACAGTTCAAGCCAGAAAGAGCCTTAGAAGTCATTCAGGACAT 1192 

Qy 477 GCA AGGGCACC- 

TGCGAGCAAGGTCCTTCCA 506 

II I I I I I I I I I I I I I I I I I I 

Db 1191 

TCCCATTT T AC AG ATGATGAAAT AAGGC TC AGAGAGAGC AGGGGC T T TC AAGAAC AAT AA 1132 

Qy 50 7 TAGTGAC GCCCCCCAAGGACATCTG- 

GAATGTCACTGGTGCCCA 549 

I I I I I I I I I I II I I I I I I I I 

Db 1131 AGGTGGAGACTGGAACTCAGGCCTCCT GTCTGCTGAGGGTGTC 

TGCCCATGC 1080 

Qy 55 0 GGTGTACTTGAGCTGTGAGGTCATCGGAATCCCGACAC 

CTGTC 592 



Db 1079 

TCTCTCTCAGAGTTTCTGAGTGGTGGTGTCTGGAGGGGCTGGACACGCAAGGGAGCTGGC 102 0 

Qy 593 CTCATCTGGAACAAGGTAAAAAGGG — GTCACTATGGAGT 

TCAAAGGACAGAAC 6 44 

I I I I I I I I II I II I I I I I I II III 

Db 1019 

CTCGTGTGTGTCTGTCTGTGCCGGGATGTGTCTGTGGCTTCTGTCTCTGAACGTGCCAGG 96 0 

Qy 6 45 TCCTG CCTG-GTGACCGGGAC- 

AAC 667 



Db 959 GGTTTTGTGTCCTGTGTGTGTCTCTCTGTGGGTGCTCCTGTGTG- 

CAGGAACTTGGGGCC 901 

Qy 66 8 CTGGCC AT TC AG AC— 

CCGGGGTGGCCCAGAAAAGCATGAAGT 70 8 



1 1 1 1 1 1 1 1 1 II I 1 1 1 1 1 1 1 I I 1 1 1 1 1 

Db 900 ATGGCCAGAGAGGGATGGGCCTATTCCTACAATCTTGGGAGGCCTCCACAGGC- 

TGAGCT 8 42 

Qy 709 AACTGGCTGGGTGCTGG 

TATCTCCTCTAAGTAAGGAA 745 

I I I I I I I I I I I I I I I 

I I I I I 

Db 841 

GGTGGGGAGGGTCCTGAAGGGTAGAGTGAAGATGGGGACTTTACTCCCAGCGGAGAGGAA 7 82 
Qy 746 

GATGCTGGAGAATATGAGTG 765 

II I II I I I I II 

Db 781 

GAAACCAGAAGATAGGCTTGGGGCATGGATGGGAAAGGTCTTGGAACTAGGATGGGGCAA 72 2 

Qy 766 CCATGCATCCAAT TCCCAAG GACAGGCT 

TCAGCA 799 

I I I I I I I I I I I I I I I I I I I I 



Db 721 

GGGAAACAAAGCCCCAAATGGAGTATGGGTCCCTGGAAGTTGACACACTCTTAGCCCTCA 662 

Qy 800 T CAGCAAA AATTAC AGTGGT TGA — 

TGCC 826 

I I I I I I III I I I I I I I I I I I I 

Db 661 

TGGGGGCCCAGCCCAGTGGGTAACCCCGGGGAAGTGGTCACATGACGTGACACTTCTCAG 6 02 

Qy 82 7 TTACATGAAATACC 

AGTGAAAAAAGGT 853 

I I I I I I I I I I I I I I I 

Db 601 

CTGATTGCTTGAGATCCCCCAGTGTGGCCTGGGCCTTGTCCTTGAGTGAGCCCAGGTTCA 5 42 

Qy 854 GAAGG TGCC 

GAGCTATAAACCTCCAGAATATTATTAG 89 0 

I I I I I I I I I III I I I I I I I 

I 

Db 541 AGCCAGGAAGGCTGGCCAGCTGCCCAAGGACTGAGGCCTTCTCCTCCT- 

CCTCCTCTGTG 483 

Qy 891 TCTGCATGGTTAAAAGTAGTCA TGGATAACTACA 

TTACCT G 931 

I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 482 

TCCTCCTCGATCATCTTGGCCAGCTCCCGGGGCAGCTCCACGTCTCCACCTGCCATCTGG 423 

Qy 932 TTCTTGC-CTAATAAGT TTCTT 

TTAATC C 959 

I I I I I I I I I II I I I I I II II 

I 

Db 422 

ATCTGGCTCTCATCTGTCTCGTTCTTGGGTAGCCGGTATTTGTCTCGGAAGTGGCTCCGC 363 

Qy 96 0 AAT CCACTAACACTTTAGTTATATTCACTGG 

TTTTAC 996 



Db 362 

AGTGTGGCCCGCTCTGCCTTCCTCTGTGTGAACTGTGCATCCCGCTCCATCTTCTCTTCC 3 03 

Qy 99 7 ACAGAGAAATACAAAATAAAGATC — ACACATCAAGACT 

ATCTACAAA 10 42 

I I I I I III II I I I I I III 

Db 302 AC- 

GAGTTGCTTCTGATACTCCTCGTACTCCTCCCGGCTCATGCCCTGAGCTTCGGCTGC 2 4 4 
Qy 1043 

AAT T TAT TAT AT AT T T AC AGAAGAAAAGCATGCATATCATTAAACAAATAAAAT AC T T T - 1101 
II I I I I I I I I I I I I I I I I 

Db 2 43 CGACTTGTCCCCATCTCCCTTATCCTCGCC- 

GCCTCCCAGGCTCCCAGTGAGGTTCTTCA 185 

Qy 1102 T T AT C AC AAAAAAAAAAA 1119 

I I I I I I II I 
Db 184 GCTGGCCGCCCACCATGGTCTTCACCATGAACGCCA 149 



RESULT 6 

US-10-478-914-17 

; Sequence 17, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: N AKAG AWARA , AKIRA 

; TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
; FILE REFERENCE: 7388-80899 

; CURRENT APPLICATION NUMBER: US/10/478,914 

; CURRENT FILING DATE: 2003-11-26 

; PRIOR APPLICATION NUMBER: PCT/JP02/05294 

; PRIOR FILING DATE: 2001-05-30 

; PRIOR APPLICATION NUMBER: JP 2001-162775 

; PRIOR FILING DATE: 2001-05-30 

; PRIOR APPLICATION NUMBER: JP 2001-255226 

; PRIOR FILING DATE: 2001-08-24 

; NUMBER OF SEQ ID NOS : 417 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 17 

LENGTH: 219 8 

TYPE : DNA 

ORGANISM: Homo sapiens 
US-10-478-914-17 

Query Match 41.0%; Score 28 40; DB 1; Length 219 8; 

Best Local Similarity 36.4%; Pred. No. 2.4e-168; 
Matches 752; Conservative 0; Mismatches 343; Indels 970 
Gaps 119; 

Qy 1 GCCGC TGCCACCGCACCCCGCCA TGGAG- 

CGGCCGTC— 36 

I I I I I I II I I I I I I I I I II II 

Db 123 

GCCGCAAAGAATCAAGTTGCAATGAACCCCACCAACACAGTTTTTGATGCCAAACGTCTG 182 



Qy 37 

CCTCGGCGCC 61 



-GCTGCGCGCC CTGCT- 



Db 183 ATTGGACGCAGATTTGATGATGCTGTTGTCCAGTCTGATATGAAACATTGGC- 

CCTTTAT 241 

Qy 62 GCTGG GCTGCT GCTCCTGCTCC 

TGCCC 8 8 

I I I I I I I I I I I I I I I I III 

Db 242 

GGTGGTGAATGATGCTGGCAGGCCCAAGGTCCAAGTAGAATACAAGGGAGAGACCAAAAG 3 01 

Qy 89 CTCTCC TCTTC 

CT c 10 2 

I I I I I I I I I I II 

I 

Db 302 

CTTCTATCCAGAGGAGGTGTCTTCTATGGTTCTGACAAAGATGAAGGAAATTGCAGAAGC 361 

Qy 103 CT — CTTCGGACACCTGCGGCCCCTGC GAGCCGGCCTCC- 

TGCCCGCCCC 149 

II I I I I I I III II III I I I I I I I I I II 

I 

Db 362 

CTACCTTGGGAAGACTGTTACCAATGCTGTGGTCACAGTGCCAGCTTACTTTAATGACTC 421 

Qy 150 TGCCCCCGCTGGGCTGCC TGCTGG 

GCGAGACCCGCGACGCGTG 192 

I I I I I I I I I I I I I I I I I I I I I II 

Db 422 T— 

CAGCGTCAGGCTACCAAAGATGCTGGAACTATTGCTGGTCTCAATGTACTTAGAATT 4 79 

Qy 193 CGGCTGCTGCC-CTATGTGCGCCCGCGGCGAGGGC 

GAGC- 230 

I I I I I I I I I I I I II I I I I II 

I I I I 

Db 48 0 ATTAATGAGCCAACTGCTGCTGCTAT 

TGCTTACGGCTTAGACAAAAAGGTTGGAGCA 536 

Qy 231 CGTGC GGGGGTGGC GGCG — 

CC 250 

I I I I I I I I I I I I I II II 

Db 537 

GAAAGAAACGTGCTCATCTTTGACCTGGGAGGTGGCACTTTTGATGTGTCAATCCTCACT 596 

Qy 251 GGCAG GGGG TACTGC 

GCGCCGGGCATGGAGTGCGT 2 85 

III III I I I I I III I I I I I I I 

I 

Db 59 7 ATTGAGGATGGAATCTTTGAGGTCAAGTCTACAGCTGGAGACACC-CACTTGG-GTG- 

GA 653 

Qy 2 86 GAAG AGCCGCAAG 

AGGCGGAAGGGTAAAGCCGGGGCAGC 32 4 

I I I I I I I I I I I I I I I I I I I I III 

Db 654 GAAGATTTTGACAACCGAATGGTCAACCATTTTATTGCTGA — 

GTTTAAGCGCAAGCATA 711 

Qy 325 AGCCGGCGGTCCGGG TGTAAG 

CGGCGTGTGCGTGTG 360 

I I I I I I I I I I I I I I I I I I I I I 



AGAAGGACATCAGTGAGAACAAGAGAGCTGTAAGACGCCTCCGTACTGCTTGTGAACGTG 7 71 

Qy 361 C-AAGAG CCGCT ACCCGG TGTG 

CG 383 

I I I I I I I I I I I I I I I I I 

I 

Db 772 

CTAAGCGTACCCTCTCTTCCAGCACCCAGGCCAGTATTGAGATCGATTCTCTCTATGAAG 831 

Qy 384 GCAGCGACGGC ACCACC — TACCCGAGC GGCTG- 

CCAGC 419 

I I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 832 

GAATCGACTTCTATACCTCCATTACCCGTGCCCGATTTGAAGAACTGAATGCTGACCTGT 891 

Qy 42 0 TGCGCGCCGCC AGCCAGAGG GC CGAGA- 

GCCG 450 

I I I I I I I I I I I I I II I I I I I I I I 

Db 892 

TCCGTGGCACCCTGGACCCAGTAGAGAAAGCCCTTCGAGATGCCAAACTAGACAAGTCAC 951 

Qy 451 CGGGGAGAAGGCC ATCACCCAGG- 

TCAG 477 

II III I I I I I I I I I I I 

Db 952 

AGATTCATGATATTGTCCTGGTTGGTGGTTCTACTCGTATCCCCAAGATTCAGAAGCTTC 1011 

Qy 478 -CAAGGGCACCTGC GAGCAAGGTCC 

TTC 504 

I I I I I I I I I I I I II 

I I 

Db 1012 

TCCAAGACTTCTTCAATGGAAAAGAACTGAATAAGAGCATCAACCCTGATGAAGCTGTTG 10 71 

Qy 505 C — ATAGTGACGCCCCCCAAG — GACATCTGGAATGTCACTGGTGCCCAG 

GTG 553 

I I I I I I II I I I I I I I I I I III I I I I I I I I 

I I 

Db 10 72 CTTATGGTGCAGCTGTCCAGGCAGCCATCT TGT — 

C T GG AG AC AAG T C T G AG AAT G 1125 

Qy 554 TACTTGA GCTGT GAGGTCA — TCGGAATCCC 

GACAC 58 7 

I I I I I I I I I I I I I I I I I I I I I I I 

Db 1126 

TTCAAGATTTGCTGCTCTTGGATGTCACTCCTCTTTCCCTTGGTATTGAAACTGCTGGTG 1185 

Qy 58 8 CTGTCCTCATCTGGAACAAGG 

TAAAAAGGGGTCACTATGGA 62 8 

I I I I I I I I I I I I II II III I I I I I 

Db 1186 GAGTCATGACTGTCCTCATCAAGCGTAATACCACCATTCCTACCAAGCAGACAC — 

AGAC 12 43 

Qy 629 GTTCA 

AAGGACAGAACTCCTGCCTGGTG 656 

I I I I III I I I I I I I I I I 

Db 1244 

CTTCACTACCTATTCTGACAACCAGCCTGGTGTGCTTATTCAGGTTTATGAAGGCGAGCG 13 03 



Qy 657 -ACCGGG ACAACCT GGC 

CATTCAG- 6 79 

III I I I I I I I I I I I I I 

I 

Db 1304 

TGCCATGACAAAGGATAACAACCTGCTTGGCAAGTTTGAACTCACAGGCATACCTCCTGC 1363 

Qy 680 -ACCCGGGGTGGCCCAGAAAAGCATGAAGTAAC TGGC-TGGGTGC TGGTAT- 

CT 730 

I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 1364 ACCCCGAGGTGTTCC 

T C AG AT T GAAG T C AC T T T T G AC AT T G AT GCC AATGGT AT AC T 142 0 
Qy 731 C 

CTC 734 

I III 

Db 1421 

C AAT GTCTCTGCTG T GG AC AAG AG T AC GGGAAAAGAGAAC AAGAT T AC TAT C AC T AAT G A 14 8 0 

Qy 735 

TAAGTAAGGAAG A 747 

I II I I I I I I I 

I 

Db 1481 

CAAGGGCCGTTTGAGCAAGGAAGACATTGAACGTATGGTCCAGGAAGCTGAGAAGTACAA 15 4 0 

Qy 748 TGCTG GAG AAT AT GAG T GC C AT G CATCC AATTCCCAAG-GAC 

AGG 791 

I I I I I I I I I I II I I I I I I I I I I I I I I I III 

I 

Db 1541 

AGCTGAAGATGAGAAGCAGAGGGACAAGGTGTCATCCAAGAATTCACTTGAGTCCTATGC 16 0 0 

Qy 792 CTTCAGCAT — CAGCAA AAATTACAGTG 

GTTGATGC 825 

I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 1601 

C T T C AAC AT G AAAGC AAC T G T T GAAG AT G AG AAAC T TC AAGGC AAG AT T AAC GAT G AGG A 166 0 

Qy 826 CTTACATGAAAT ACCAGTGAAAAAAGGTGA 

AGG 858 

I III III II MM II I II II 

Db 1661 

CAAACAGAAGATTCTGGACAAGTGTAATGAAATTATCAACTGGCTTGATAAGAATCAGAC 172 0 

Qy 859 TGCCGAG C TAT AAAC CTC — CAGAA 

TATTATTAGTCTGCA 896 

I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1721 

T GC T G AG AAGG AAG AAT T T G AAC AT C AAC AGAAAGAGC TGGAGAAAG T T T GC AAC C C C AT 17 8 0 

Qy 89 7 TGGTTAAAAGT AGTCAT GGATAACTAC 

ATTACC 929 

II I I I I I I I I I I I I I I I III 

I I 

Db 1781 

CATCACCAAGCTGTACCAGAGTGCAGGAGGCATGCCAGGAGGAATGCCTGGGGGATTTCC 18 4 0 



Qy 930 T 

GTTCTTGCCTAATAAGTTTCTT 952 

I I I I I I I I I I I I I 

Db 1841 

TGGTGGTGGAGCTCCTCCCTCTGGTGGTGCTTCCTCAGGGCCCACCATTGAAGAGGTTGA 19 0 0 

Qy 953 TTAATCCAA TCCAC 

T AAC AC T T TAG T T 9 79 

I I I I I I I I I I I I I I I I I I I I I 

I 

Db 1901 

TTAAGCCAACCAAGTGTAGATGTAGCATTGTTCCACACATTTAAAACATTTGAAGGACCT 196 0 

Qy 980 ATATTC ACTG GTTTTACACAGAGAA- 

ATACAAAATAAAGATCACAC 102 4 

I I I I I III I I I I I I I I I II I I I I I I I I 

Db 1961 

AAATTCGTAGCAAATTCTGTGGCAGTTTTAAAAAGTTAAGCTGCTATAGTAAGTT 2 015 

Qy 1025 ATCAAGACTATCTACAAAAATTTATTAT AT AT T T AC AG AAG 

AAAAGCA-T 10 73 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 2 016 -ACTGGGCATTCT- 

C AAT AC T T G AAT AT GG AAC AT ATGC AC AGGGGAAGGAAAT AAC AT T 2 0 73 

Qy 10 74 GCATATCATTAA 

ACAAATAAAATACTTTTTAT 1105 

I I I I I I I I I I I I I I I I 

I I I I 

Db 2074 

GCACTTTATAAACACTGTATTGTAAGTGGAAAATGCAATGTCTTAAATAAAACTATTTAA 2133 

Qy 1106 CACAAAAAAAAAAAAAAAA 1124 

III I I I I I I I I I I I I I I 

Db 213 4 AATTGGCACCATAAAAAAAAAAAAA 2158 



RESULT 7 

US-10-478-914-61/C 

; Sequence 61, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 

; TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
; FILE REFERENCE: 7388-80899 

; CURRENT APPLICATION NUMBER: US/ 1 0 / 478 , 9 1 4 

CURRENT FILING DATE: 2003-11-26 

PRIOR APPLICATION NUMBER: PCT/JP02/05294 
; PRIOR FILING DATE: 2001-05-30 

PRIOR APPLICATION NUMBER: JP 2001-162775 
; PRIOR FILING DATE: 2001-05-30 
; PRIOR APPLICATION NUMBER: JP 2001-255226 

PRIOR FILING DATE: 2001-08-24 
; NUMBER OF SEQ ID NOS : 417 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 61 

LENGTH: 2 43 7 
TYPE: DNA 

ORGANISM: Homo sapiens 



US-10-478-914-61 



Query Match 40.9%; Score 2833.5; DB 1 ; Length 2437; 

Best Local Similarity 38.4%; Pred. No. le-167; 

Matches 704; Conservative 0; Mismatches 390; Indels 739; 
Gaps 116; 

Qy 2 CCG CTGCCA CCGCA 

CCCCGCCA 23 

III I I I I I I I I I I I I I I 

Db 1813 



CCGAGCTCCAGCAATATCCCGAAGTGAAGTAAAGAGTTCATCCACGGCAATTTTGGATTT 175 4 

Qy 2 4 TGGAGCGGCC— GTC GCTGC 

GCGCCCT 48 

I I I I I I I III I I I I 

Db 1753 

GTGAAATGTCAGATTAATTGAAGAGTCCCAGTCCGTAATTTTTTGCAGCAAAGGTGACAA 169 4 

Qy 49 GC TCCTC GGCGCCGCTGGGCTGCTGC — TCCTG 

CTCCTGC 86 

I I I I I I I I I I I I I I III I I I I I I I I 

Db 1693 TCCAGAT TCCTC AC AGAAGAC- 

CTGAGCCAATGCTTTCTTGACTTTTTCTTCTGCTTCAC 1635 

Qy 8 7 CC CTCTCC 

TC- 96 



Db 1634 

ACAGGTCTTTGTC T AC C G T G AGC T C T C C AG T G AC AG AAT AAAT AT AT AT G AG AAG G AT C A 1575 

Qy 9 7 TTCC TCCTC TTC GGACAC 

CT 116 



Db 1574 

GCAGTTCCTCAGGGCTGTAGTCCTCGTTGGTTCTCTGGGTTACAGGCTTAATCATGGGCA 1515 

Qy 117 GCGGC CCCTGCGAGC CGGCCTCC TGCCC 

GCCCCT 150 



Db 1514 

GCAGCTGATTTAACACAACGGACATTGCTGACTCCCCAATGCTCTGAAGAAGGAGCCTTT 1455 

Qy 151 GCC CCCGCTGGGCTGCCTG CTGGGC- 

GAGA 179 



Db 1454 

CAAAAGCCAGAAAGTTGTCCCACTTGGCAGTCTGTGGGTGTTTCAACGTTTGAGCTGTGG 1395 

Qy 180 CCCG-CGACGCGTGCGGCTGCTGCC CTATG 

TGCGCCCGCGGC 220 



Db 1394 

CCAGTCCAAGCTGGAGGAGGCCACAATGATTCATTAGAGCTTTGAGGTTGTTCTTGAAGA 1335 



Qy 221 G AGGGC— GAGCCGTGCGGGGGTGGCGGCGCC 

GGC 253 

I I I I I I I I I I I I I I I I I I III 

I I I 

Db 133 4 GCTGAATATAGGACATGAGCTGTCCCGGTGTGAC- 

TCTCCCCATACTCATCTTGATTGGC 1276 

Qy 254 AGGGGGT- 

ACTGCGCGCC G 271 

III I I I I I III 

I 

Db 1275 

AGGTTTTCTCTGCTTGCCGCTTCCACTAGATGTCTCCGAACTTCCATCACTGCCTCTTTG 1216 

Qy 2 72 GGCATGGAGTGC-GTGAAGAGCCGCAAGAGGC GGAAG- 

GGT 310 

I I I I I I I I I I I I III III II I I I I 

Db 1215 

TGCTTAGTGTTCAGTAAAGCTTCCCATAGGGCTTTGGCTGTGGTGTCACTGAATTGTGAA 1156 

Qy 311 — AAAGCC-GGGGCAGCAGC CGGCGGTCCGG 

GTGTAAGC- 3 46 

I I I I I I I I I I I I I I I I I I III 

I I 

Db 1155 

AGACAGCCTGGTGCAACCACATTATAATTTTCCTCCTCAGTATGGAGTGCAGTGAGCGCT 1096 

Qy 347 GGCGTGTGCG- 

TGTGCAAGAGC 36 7 

III II I I II I III II 

Db 1095 

ATCATGTTAACCATCACATCATTTGTGTGGCCTGGGAGCTGGGGAAGTGCTGAAATGATC 1036 

Qy 36 8 — CGCTAC CC — GGTGTGCGGCAGCGACGG CA 

I I I I I I I I I I I I I I I I I II 

I I 

Db 1035 

TTCTCTACTAAGTTGTCTCCATGATGTCCAACTGCTCCTGTGAGATCCAGGGTTCTGTCC 9 76 

Qy 39 8 ACCTACCCGAGCGGCTGCCAGCTGCGCGCCGC CAGCCAG 

AGGGC- 441 

II I I I I I I I I I MM I I I I I III 

I I I I 

Db 9 75 ACAAAAACCA-CTGATGCC — 

CTGCCTGCAGCAGTCTTCTTCCTGTTCTTTGCAGGGGCA 919 

Qy 4 42 CGAGAGCCGCGGGGAGAAGGCCATCACCCAGG TCAGCAAGG- 

GCACCT 48 8 

I I I I I I I I I II I I I III I I I I I I 

I I I 

Db 918 TAATTGGCCAGATCCGC — 

AGCGATGACCTGACT T AAG G AAC C T AC AGC AAAAC AC T C C T 861 

Qy 489 -GCGAGC AAGGTCCTTCCATAG TGACGCC — 

CCCCAAGGACATCTGGAATGT 53 7 

II I I I I I I I I II I I I I I I I I I I I I I I I I I 

I I 

Db 86 0 CCCGTACTCCTAAATGTTCACACAGAGAACTGAGGCCTGACACTAGG- 

CATCTGATCTGC 8 02 



Qy 53 8 CACTGGTGCCCAGGTGTACTTGAGCTGTGAGGTCATC 

GGAATCCCGAC 585 

II I I I I I I I III III III I II I I I I 

Db 801 AGCAGCAGCTCTGGGGT — 

TAGCGTAGTGGAGTCCACATCACCCAGGCTTCCCAGCTTCC 7 4 4 

Qy 586 ACCTGTC CTCAT-CTGG 

AACAAGGTAAAAAGGG 617 

I I I I I I I I I I I I I I I I I 

I I I I I I I 

Db 743 

TCTTGTCCGGTCGGGCGCTATTAAGGAGGTGCACATCCTGGGGTAGCAGTGGGAAAAGGG 6 8 4 

Qy 618 GT CACTATGGAGTTCAA 

AGGACA- 6 40 

I I I I I I I I I I 

I I I I I 

Db 683 

ATGCAAAAGCTGGAGTCAAGGCAAAGTGGGGAGCAACAGGGGCAAGCAATAACGGGACAT 62 4 

Qy 641 -GAACTCCT — GCC 

TGGTGACCG 66 0 

I I I I I I I III I I I I I I 

Db 623 

GGAACACCTCGGCCGTGTAGTTCATGTTGCCCATCCATTCACACAGCTTCTCCTCCAGCT 56 4 

Qy 661 GGACAAC — CTG — GCCATTCAGACCCGG — GGTG— GCC CA- 

GAAAAG 700 

I I I I I III III II I I I I III III II 

I I 

Db 563 

GCTCGAACACCGGCTGCTGCCCCTCCATCTCGGCCGCTGCCGCCGCTGGGACATGATTAG 50 4 

Qy 701 C-ATGAAGT AAC-TGGCTGGGTGCTGGTATCTCCTCTAAGTA- 

AGGAAGATGCTGG 753 

I I I I I I I I I I I I I III I I I I I III I I I I I 

I I I 

Db 503 

CTGTGAGGTGGACAGCGTGGCTCACGGTTGTGACCACCACACAATACTGGAAGTGACTGC 4 4 4 

Qy 75 4 AGAATATGAGTGCCATGCATCCAATTCCCAAGGACAGGC— TTCAGCA 

TCAG — 8 03 

I I I I I I II II II I I I I I I I I I I I I I I I I 
Db 4 43 GGCAGATGATGTCC 

CGTAGGATCTCCACGGTCCGGCCTTTCAGCAGGCAGCTCAGCA 3 8 7 



Qy 804 CAAAAATTACAGTGGTTGATGCCTTACATGAAATACCAGT 

GAA A 8 47 

I I I I I I I III I I I I I II I I I I I I II 
Db 3 86 CAAACACTGCCTTGG — GCTGCTTGGC- 

TCCACCACCAATTGCGTCGGGCTCGAACTCTC 330 

Qy 8 48 AAAGGTGAAGGTGCCGAGCTATAAA CCTCCAGAATATTATTAG- 

TCTGCA 896 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 329 GCAGGTGACAGT- 

CAGGGCCCCCCACCGCCTCCAGGAGACGGGTGGATCCGCAGCCCCAG 2 71 

Qy 89 7 TGGTT AAAAGTAGTCATGGATAACTACATTACCTGTT 



CTT 936 

III I I III II I I I I I I I I I I I III 

Db 2 70 TGCAGGCTCTCGGCGCAGGCGGCGTCCAGGTAAACCACA — 

GCCCGTTTCACTTTGGCCA 213 

Qy 93 7 — GCCTAATAAGTTTCT TTTAATCCAATCCAC TAACACT-TTAGTT — 

AT 981 

I I I I III II I I I I I I I I I I I I 

Db 212 

GCACCTGCTCCCATCCTTGCTGGGTAAAGGACAGTACGCCCGAGGCGCTCATGGTTGGGG 153 

Qy 9 82 ATTCACTGGTTT TAC-ACAGAG-AAATACAAAATAAAGATCACA 

CATCAAGAC 1032 



Db 152 

ATTCGCAGACTTGGGAAACTACGGTGCAGGAACTTCTTTCAGAACTCACCGCTTCCGGAA 93 

Qy 1033 TAT C T AC AAAAAT T T AT TATA- T AT T T AC AG AAG A- 

AAAGCATGCATATCATTAAAC 1087 



Db 92 ATTGGGCTCCGGGAGACTTTGACAGTCTCCACAGTACACCCAGTAGGCC- 

AACAGTGCTC 3 4 

Qy 1088 AAATAAAATAC T T T T T AT C AC AAAAAAAA 1116 

I I I I I I I I I I I I I I I I 

Db 33 GAG - G AAT T CC GC AGC T T T T AGAGC AGAAG T AA 2 
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Qy 



1 GCCGCTG- 



-CCACC- 



-GCACC-CCGCCATGGA- 



Db 1881 GCC- 

CTGAGGAAGCACCACCAGAGGGAGGAGCTCCACCACCAGGAAATCCCCCAGGCATT 1823 

Qy 33 CGTCGCTGCGCGC — CCTGCTCCTCGGCGCCGCT GGGCTGC- 

TGC 7 4 

III II II I I I I I I II I I I I I I I I I I 

I 

Db 1822 

CCTCCTGGCATGCCTCCTGCACTCTGGTACAGCTTGGTGATGATGGGGTTGCAAACTTTC 1763 

Qy 75 TCCTGCTCCTGC 

CCCTCTC 93 

I I I I I I I I I I I I I I I 

Db 1762 

TCCAGCTCTTTCTGTTGATGTTCAAATTCTTCCTTCTCAGCAGTCTGATTCTTATCAAGC 17 03 

Qy 9 4 CTCT— TCC 

TCCTCTTCG 109 

III III I I I I I I I I 

Db 1702 

CAGTTGATAATTTCATTACACTTGTCCAGAATCTTCTGTTTGTCCTCATCGTTAATCTTG 16 43 

Qy 110 GA CACCTGCGGCCCCTGC GAGCC 

GGCCTCCTG 141 

II I I I I I I III III I I I I I 

Db 1642 

CCTTGAAGTTTCTCATCTTCAACAGTTGCTTTCATGTTGAAGGCATAGGACTCAAGTGAA 1583 

Qy 142 CCCGCCCCTGCCC CCGCTGGG 

CTGCCTGC 170 

I I I I I I I I I I I I I I 

I 

Db 1582 

TTCTTGGATGACACCTTGTCCCTCTGCTTCTCATCTTCAGCTTTGTACTTCTCAGCTTCC 1523 

Qy 171 TGGGCGAGACCCGCGACG 

CGTGCGGCTGCTGCCCT 205 

I I I I I I I III I I I I I I I I I I 

Db 1522 

TGGACCATACGTTCAATGTCTTCCTTGCTCAAACGGCCCTTGTCATTAGTGATAGTAATC 1463 

Qy 2 06 ATGTGCGC CCGCGGCGAGGGC GAG 

CCGT 233 

I I I I I I I I I I II III III 

Db 1462 

TTGTTCTCTTTTCCCGTACTCTTGTCCACAGCAGAGACATTGAGTATACCATTGGCATCA 14 03 

Qy 23 4 — GCGGGGGTGGC GG — CGCC GGCAGGGGGTA 

CTG 264 

I I I I I I I I I I I I I I I I I I I I 

Db 1402 

ATGTCAAAAGTGACTTCAATCTGAGGAACACCTCGGGGTGCAGGAGGTATGCCTGTGAGT 13 43 

Qy 265 CGCGCCGGGCATGGAGT 

GCGTGA 287 

I III I I I I II 



TCAAACTTGCCAAGCAGGTTGTTATCCTTTGTCATGGCACGCTCGCCTTCATAAACCTGA 12 83 

Qy 288 AGAGCCGCAAGAGGC 

GGA 305 



Db 1282 

ATAAGCACACCAGGCTGGTTGTCAGAATAGGTAGTGAAGGTCTGTGTCTGCTTGGTAGGA 12 23 

Qy 3 06 AGGG TAAAGCC GGGGCAG CAGCCGGCGGT 

CCGGGT 340 



Db 1222 

ATGGTGGTATTACGCTTGATGAGGACAGTCATGACTCCACCAGCAGTTTCAATACCAAGG 1163 

Qy 3 41 GTAAGCGGCGTGTGCGTGTGCAAGAGCCGCTACCCGGTGTGCG 

GCAGCGACGGCACC 39 7 



Db 1162 G AAAG AG G AG T G ACATCCAAGAGCAGCAAATC- 

TTGAACATTCTCAGACTTGTCTCC 1107 

Qy 39 8 —AC CTACCCGAGCGGCTGC CAGC- 

TGCG 423 



Db 1106 

AGACAAGATGGCTGCCTGGACAGCTGCACCATAAGCAACAGCTTCATCAGGGTTGATGCT 10 4 7 

Qy 42 4 CGCCGCCAG CC AGAGGGCCGAGAGCCGC 

GGGGAGAAG 46 0 



Db 1046 

CTTATTCAGTTCTTTTCCATTGAAGAAGTCTTGGAGAAGCTTCTGAATCTTGGGGATACG 9 8 7 

Qy 461 GCCATCACCCAGGTCAG CAAG 

GGCACCTGC 49 0 



Db 986 

AGTAGAACCACCAACCAGGACAATATCATGAATCTGTGACTTGTCTAGTTTGGCATCT-C 92 8 

Qy 491 GAGCAAGGTCCTTCCATAGTGACGCC CCCC AAGGACA 

TCTGGAATG 536 



Db 927 G 

AAGGGCTTTCTCTACTGGGTCCAGGGTGCCACGGAACAGGTCAGCATTCAGTTCTT 8 71 

Qy 53 7 TCACTGGTGCCCAGGTGTACTTGAGCTGT- 

GAGGTCATCGGAATCC 581 

I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 8 70 CAAATCGGGCACGG — GTAATGGAGGTATAGAAGTC 

G AT T C C T T CAT AG AG AG AAT 817 

Qy 582 CGACACCTGTCCTCATCTGGAACAAGGTAAAAAGGGGTCACT 

ATGGAG 629 

III I I I I I I I I I I I I I I I I I I I II 



Db 816 

CGATCTCAATACTGGCCTGGGTGCTGGAAGAGAGGGTACGCTTAGCACGTTCACAAGCAG 75 7 

Qy 63 0 TTCAAAGG ACAGAACTC 

CTGCCTGGTGACCGGGACAACCTGGCCATTCA 678 



Db 756 

TACGGAGGCGTCTTACAGCTCTCTTGTTCTCACTGATGTCCTTCTTATGCTTGCGCTTAA 69 7 

Qy 6 79 GACCCG GGGTGGCC CAG AAAAGCAT 

GAAGT 70 8 



Db 696 

ACTCAGCAATAAAATGGTTGACCATTCGGTTGTCAAAATCTTCTCCACCCAAGTGGGTGT 63 7 

Qy 709 AACTGGCTGGGTGCT GGTATC — TCCTCTA-AGTAAGGA- 

AGATGCTGGAG 755 

I I I I I II I II I I I I I I I I I I I I I 

Db 636 

CTCCAGCTGTAGACTTGACCTCAAAGATTCCATCCTCAATAGTGAGGATTGACAC 5 82 

Qy 756 AATATGAGTGCCATGCATCCAATTCCCAAGGACAGGC 

TTCAGCATCAGC 804 

I I I I I I I I I I III I I I I I I II I I I I 

Db 581 ATCAAAAGTGCCAC-CTCCCAGGT — 

CAAAGATGAGCACGTTTCTTTCTGCTCCAACCTT 525 

Qy 8 05 AAAAATTACAGTGGTTGATGCCTTACATGAAATAC- 

CAGT 8 43 



Db 524 

TTTGTCTAAGCCGTAAGCAATAGCAGCAGCAGTTGGCTCATTAATAATTCTAAGTACATT 465 

Qy 8 44 GAAAAAAGGTGAAGGTGCCG-AGCTAT — AAACCT — C- 

CAGAATATTATTA 889 

III II II I I I I I I I I I I I I I I I I I I I I 

Db 464 

GAGACCAGCAATAGTTCCAGCATCTTTGGTAGCCTGACGCTGAGAGTCATTAAAGTAAGC 4 05 

Qy 89 0 GT CTGCATGGTTAAAAGTAGT — CATGGATAACTAC 

ATTACCTG 931 

II I I I I I I I I I I I I I I I I I III 

Db 404 

TGGCACTGTGACCACAGCATTGGTAACAGTCTTCCCAAGGTAGGCTTCTGCAATTTCC— 3 4 7 

Qy 932 TTCTTGCCTAATAAGTTTCTTTTAATCCA-ATCCACT 

AACACTTTAGTTATATTC 9 85 



Db 3 46 TTCAT-CTTTGTCAGAACCATAGAAGACACCTCCTCTGGATAGAAGCTTTTGGTCTC- 

TC 289 

Qy 9 86 AC T GG T T T T AC AC AG AGAAAT 

ACAAAATAAAGATC 102 0 

I I I I I I I I III I I I I I I I I I 

Db 288 



CCTTGTATTCTACTTGGACCTTGGGCCTGCCAGCATCATTCACCACCATAAAGGGCCAAT 2 2 9 



Qy 1021 AC AC AT C AAG AC T - AT C T AC AAAAAT TTATTATATATTT 

ACAGAAGAAAAGCA 10 72 



Db 22 8 GTTTCATATC- 

AGACTGGACAACAGCATCATCAAATCTGCGTCCAATCAGACGTTTGGCA 17 0 

Qy 10 73 TGCATA TC AT T AAAC AAAT AAAAT AC T T T 

TTATCACAAAA 1112 



Db 169 T-CAAAAACTGTGTTGGTGGGGTTCATT — GCAACT- 

TGATTCTTTGCGGCATCACCGAT 114 

Qy 1113 AAAAAAAAAA 1122 

I I I 
Db 113 CAACCGTTCA 104 
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Qy 1 GCC-GCT GCCACC 

GCACCCCGCCATGGAGCGGCCGTCGCTGCGCGCCCT 4 8 



Db 1 GCCTACTGGATTCGCCATCATTTTGCACACTGCGA-GGGGC-TCCGT — 

GTGTGCGCCCT 56 

Qy 49 G-CTCCTCGGCGCCG — CTGGGCTGCTGCTCCTGCTCCTGCC-CCTCT 

CCTCTTC 99 

III I I I I I I I II I I I I I I I I I I I I I I I I I I I I 



Db 57 GTCTTGTCTG- 

GCCGTCCTCATCCCTTCCCCCACCCCCTGCCGCCACTCCGACCCGCTCC 115 

Qy 100 CTCCTCT TCGGACACCTGCGGCCC CTGCGAGCCGG — CCTC- 

CT- 140 

I I I I I I I I I I I I I I I I I I I III III 

I I 

Db 116 

CAAAGTGGCTTCACAATAGTCGGTCCTCGGCGGTGTAGGCTGCGCACCAGGTCCACACTT 175 

Qy 141 GCCCGCCC CTGCCCCCGCTGGG 

CTGC— 166 

I I I I I I III II I I I I III 

Db 176 AAGCGAAATCAAGGAGACC- 

CCCTTCTACTTCTACCTTTGGGTTTGGTGCTCAATGCGAA 23 4 

Qy 16 7 -CTGCTG-GGCG-AGACCCG 

CGACGCGTGCGG 195 

I I I I I I I I I I I I I I I I I I I I I 

Db 235 

GCTGCTGCAACTCAGACACGCCTAAGTCAACTCATGCAGAAAAAGGAGAAAAGTTTTGGT 2 9 4 

Qy 196 CTGCTGCCCTATGTGC-GCCCGCGGCGAGGGCGAGCCGTGCGGGGG 

TGGCGGC- 247 

I I I I I I I I I I I I I I I I I I I I II I I I I I 

I I 

Db 295 AT AC AAAT GC T C T C AG T C C AGC C AG AC AC 

CAAGCCGAAAGGTTGTGCTGGCTGCA 349 

Qy 2 48 GCCGG CAGGG GGTA-CTGC GCGC 

CGGGCATGGAG — T 281 

III I I I I I I I I I I I III I I I I I I I I I 

I 

Db 350 

ACCGAAAGATCAAGGACCGGTATCTTCTAAAGGCACTGGACAAATACTGGCATGAAGACT 4 09 

Qy 2 82 GCGTGAAG AGCCGCAAGAGGCGGAAGGG 

TA 311 

I I I I I I I I I I I III I III 

I I 

Db 410 

GCCTGAAGTGTGCCTGCTGTGACTGTCGCTTGGGGGAGGTGGGCTCCACCCTGTACACTA 469 

Qy 312 AAGC CGGGGCAGCAGCCGGCGGTCC — GGGTGT 

AAGCG 3 47 

I I I I III I I I I I I I I I II II 

I 

Db 470 

AAGCTAATCTTATCCTTTGTCGCAGAGACTATCTGAGGCTCTTTGGTGTAACGGGAAACT 52 9 

Qy 3 48 GCGTGTGCGTGT-GCAAGAGC — CGCTACC CGGTG-TGCG- 

GCAGCGACGGC- 39 4 

III I I I I I I I I I I I I I I I I I I I I I I I I I I III 

I 

Db 53 0 GCGC-TGCCTGTAGTAAGCTCATCCCTGCCTTTGAGATGGTGATGCGTGC — 

CAAGGACA 586 

Qy 395 ACCACCT-ACCCGAGCGGC- 

TGCCAGCTGCG 423 



1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 I 

Db 587 

ATGTTTACCACCTGGACTGCTTTGCATGTCAGCTTTGTAATCAGAGATTTTGTGTTGGAG 6 46 

Qy 42 4 -CG CC GCC — AGCCAGAGGGCCGAGAGCCGCGG — 

GG- 455 

I II II I I I I I I I I I I I I I I 

I I 

Db 6 47 ACAAATTTTTCCTAAAGAATAACATGATCCTTTGCCAGACGGACTA 

CGAGGAAGGT 702 

Qy 456 AGAAGGCCAT-CA-CCCAGGT CAG 

CAAGGGCACC 48 7 

I I I I I I I I I I I I I I I I I II III I I I I 

Db 703 

TTAATGAAAGAAGGTTATGCACCCCAGGTTCGCTGATCTATCAACATCACCCCATTAAGA 762 



Qy 48 8 -TGC-GAGCAAGGTCCTTCCATAGTGAC 

GCCCCCCAAGGACATCTGGAATGTCA 539 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 763 ATACAGAGCAC — TACATTCTTT- 

TATCTTTTTTGCTCCACATGTACATAAGAATTGACA 819 

Qy 540 CTGGTGCC CAGGTGTACTTGAGCTGTGAGGTCATC- 

GGAATC 58 0 

I I I I I I I I I I III I I I I I I I I I I 

Db 820 

CAGGAACCTACTGAATAGCGTAGATATAGGAAGGCAGGATGGTTATATGGAATAAAAGGC 8 79 

Qy 581 — CCGACACCTGT CCTCA — 

TCTG G 601 

I I I I I I I I I I I I I I 

I 

Db 880 

GGACTGCATCTGTATGTAGTGAAATTGCCCCAGTTCAGAGTTGAATGTTTATTATTAAAG 939 

Qy 6 02 AACAAGGTAA AAAGGG 

GTCAC— 622 

I I I I I I I I I I I I I I I I I 

Db 940 

AAAAAAGTAATGTACATATGGCTGGATTTTTTTGCTTGCTATTCGTTTTTGTGTCACTTG 999 

Qy 623 

TATGGAGTTCAAAG 636 

I I I I I 

I I I I 

Db 1000 

GCATGAGATGTTTATTTTGGACTATTGTATATAATGTATTGTAATATTTGAAGCACAAAT 1059 

Qy 63 7 G AC AG AAC T C C T G CC — TGGTG- 

ACC 659 

I I I I I III I I I I I I I I 

Db 1060 

GTAATACAGTTTTATTGTGTTACCATTTGTGTTCCATTTGCTTCTTTGTATTGTTGCATT 1119 

Qy 66 0 -GGGACAACCTG GC CATTCAG 

ACCCG 684 

I I I I I I I I I I I I I I I I I 

Db 1120 



TAGTACAATCAGTGTTTAAACTTACTGTATATTTATGCTTTCTGTATTTACCAGCTATTT 1179 



Qy 6 85 GGGTGG- 

CCCAG 695 



Db 1180 

TAAATGAGCTGTAACTTTCTAGTAAAGAATTGAAAAGCAAATCTCACTAATGATACACAG 1239 

Qy 696 A AAAGCATG 

AAGTAACTGGCTGGG — 719 

Db 1240 

AT AG AT AAAGC AAG T C T AT C AAC AT T AAAAAT AC T AAAAAAT AAAG AC AC AC AC AG AGC A 12 99 

Qy 72 0 TGCTGGTATC-TCCTCTAAGTAAGGAAGAT GCTGGAGAATATGAGTG- 

CCATG 770 

I III I I I I I I I II I II I I I I I I I I I I I I 

Db 1300 

TTTTAGTGACATCCACTACTTATTGCCGCTATGAGTTAGAGTCTATCAGTGTTCTTGTTA 1359 

Qy 771 CATCCAATTCCCAAGGACAGGCTTCAGCATCAGC 

AAAAATTACAGTGG 818 

I I I I I I III I I I I I I I I I I I I I I I I I I I I 

Db 136 0 TAACCCCCTATTTTCAGG 

GGGTTAAAAATCAGCTTTAAAAAAATACATAAAAATTT 1415 

Qy 819 TTGATGC- 

CTTACATGAAATACCAGTGAAAAAAGGTGAAGGTGCCG 863 

I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1416 CATCTTAAAGCACTTTCATTTTATACCAACGTGAAAA 

GTGCCATTTTTAGAAT 146 8 

Qy 86 4 AGCTATAAACCT — CCAG AATA TTATTAGTCTGC 

ATGGTTAAA 904 



Db 1469 

AACTTTAAAGCTTAACAGGTTTCCTTTTAATATCCTTTTTTTGTGTGCTCTTTACTTACA 152 8 

Qy 9 05 AGTAGTCATGGAT AACTACA TTACCTGTTCT — 

TGCCTAATAAGTT 9 48 



Db 1529 CAATGGCTTTGTTTTGCTTTTTCAGCCACACCCCTTATGTGAACTAGTGCCT- 

TTGGGTA 158 7 

Qy 9 49 TC TTTTAATCCAA TCCACTAACACTTTAGTTA 

TATTCACTG 9 89 



Db 158 8 TCACGTAAAATTTTTTCCAAAGGGTTACTTTAAAAATCT- 

GTTACCACAATTATGAGATG 16 46 

Qy 99 0 G T T T T AC AC AG AGAAAT AC AA AATAA 

AGATCACACATCAAG 103 0 



Db 1647 

ATTTTTAAGTGATAAATTAAACTTCTTCTTGTATAAATTCTGCCCAGATCTCTCCACAAG 17 06 



Qy 1031 A-CTAT C T AC AAAAAT T TAT TATATATTT ACAGAAGAAA 

AGCA 10 72 

III I I I I I I I I I I I I I I I I I I I I I I I 

Db 1707 

AGCTGAGGGTTTCATAACTTTATGGCTTAATAAATGTATGACACTGAAAAGATTTGAGTG 1766 

Qy 10 73 TGCAT AT CAT T AAA 

CAAATAAAA 1095 

I I I I I I I I I I I I I I I II 

Db 1767 

TGAATCTACTGAAATCACTATAATGCACATTGAAGCTATGATGGTATTTGAGTAGTGAGG 182 6 

Qy 1096 -TACTTTTTATC ACAAAAAAAAAAAAAAA 1123 

I I I I I I I I I I I I I I I I I I I 

Db 182 7 T T AC T T T T GAT C GG AGC AAC AT AATGC TC AT AGAA 1861 



RESULT 10 
US-10-478-914-62 

; Sequence 62, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 

; TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
; FILE REFERENCE: 7388-80899 

; CURRENT APPLICATION NUMBER: US/ 1 0 / 478 , 9 1 4 

; CURRENT FILING DATE: 2003-11-26 

; PRIOR APPLICATION NUMBER: PCT/JP02/05294 

; PRIOR FILING DATE: 2001-05-30 

; PRIOR APPLICATION NUMBER: JP 2001-162775 

; PRIOR FILING DATE: 2001-05-30 

; PRIOR APPLICATION NUMBER: JP 2001-255226 

; PRIOR FILING DATE: 2001-08-24 

; NUMBER OF SEQ ID NOS : 417 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 62 

LENGTH: 19 46 

TYPE: DNA 
; ORGANISM: Homo sapiens 
US-10-478-914-62 

Query Match 39.7%; Score 2747; DB 1; Length 1946; 

Best Local Similarity 40.7%; Pred. No. 1.5e-162; 
Matches 667; Conservative 0; Mismatches 424; Indels 548; 
Gaps 88; 

Qy 1 GCCGCTG CCACCGCACCCCGC 

CATGGAGCGGCCGTCGCT 39 



Db 2 59 GCAGATGGCATGTGTCTGTGACAACCTCTGCTCTCCACATCTCTTGGGCCTGTTTAC- 

CT 317 

Qy 40 GCGCGCCCTGCTCCTCGGCGCCGCTGGGCTGCTGCTCCTG 

TCTC-C 94 



Db 318 GCGCTCCCAGAGCCT CCGC CAGCATC- 

CCAGAATCTCCATCCCCATCTCTC 36 7 



CTCCTGCCCC- 

I I I I I I I I I I I I 



Qy 95 TCTTCCTC-CTCTTCGGACACCTGCGGCCCCT — GC-GAGCCGGCCTCCT 

GCCCG 145 

III I I I I I I I I I I III II I I I I I I I I I II 

Db 368 

ACTTATACACACATCAGTCATCGGTTATCCATTAGCTAAACCGCCTTCCTTAATAGCTTT 42 7 

Qy 146 CCCC TGCCCCCGCTGG 

GCTGC 166 

II I I I I I I I I I I I I 

Db 428 

ACACTGTTTGCTTTCTCTGGAACATTTTTAGTTAAAATTTCATAATGCAGTTGCACACAA 4 8 7 

Qy 16 7 CTGCTGGGCGAGACCCGCGACGCGT 

GCGGCTGCTGCC 2 03 

I I I I I I I I I I I I I I I I I I I 

Db 488 

ATGAAGACACAGATGGCTGCATCCTCCGTCTCTTCCCCTCGTTTACAGGAAGCTGCGGAT 5 4 7 

Qy 204 CTATGTGCG CCCGC — GGCGAGGGC-GAGCCGTGCGGGG 

GT- 241 

I III I I I I II I I I I I I I I I I I I I I 

I I 

Db 548 

CAGGGAGGGGTGTTAGGGTTACCCACATGGTAAGGGCAGAGACAAGAGGGGACCCCAGTT 6 0 7 

Qy 2 42 GGCGGCGC CG — GCAGGGG 

GTACTGCGCGCCGGGCATGGA 2 79 

I I I I I I I I I I I I I I I I I I I I I I I 

Db 608 TTCCATGCTGCACATGGTCATTGCTGGGGACTGAGGTTTGCACATCAC- 

CCTGCCCTGTT 666 

Qy 280 GTGCGTGAAGAGCCGCAAGAGGCGGAA GGGTAAAGCCGGGGCAGCAGC — 

CGGCG 332 

III I I I I I I I I I I III III I I I I I I I I 

I 

Db 66 7 CTCCCT CCGC-TGGGGAGAAAGTCAGGGATGGAGC — 

AAGCTGCAGCATCTTCT 717 

Qy 333 GTCCGGGTGTAAGCGGC GTGTGCGTGT GCAAGAGCCG-CTACCC — 

GGTGT 380 

I I I I I I I II I I I I I I I I I I I I I I I II 

I 

Db 718 

GAAAAAGAAAAGGTGGCCTTGTCTCCAGGTCTCCCCTCAAGTCCCACCTTCCCATAGTTT 7 7 7 

Qy 381 GCGGCAGCGAC GGCAC 

CACCTA 402 

I I I I I I I I I I 

I I 

Db 778 

TCTGCCACTTCTCTTGATTTTCCTCTCTGCCACTTCTCTTGATTTTCCTCTCTGCCACTT 83 7 

Qy 403 CCC — GA GCGGCTG CCAGCTGCGCGC— 

CG 428 

I I I I I I I I I I I I I I I I I I I 

I 

Db 838 

CTCTTGATTTTCCTCTCTGCAGCTGCTTTGAGGTGGGTTTTCTCCAGATGCACACTTTCC 89 7 



Qy 429 

CCAGCCAGAGGGCCGAGAGCCGCGGGGAGAAGGCCATCACCCAGGTCAGCAAGGGCAC 
I I I I III I I I I I I I I I I II 

Db 89 8 CCTGCTTTGCGTCCTTATTCT 

GGTAGAAGCACAATCTAAAGCTCATTAAGGGAACTA 954 

Qy 48 7 CTGC-GAGCAAGGTCCTTCCATAGTGACGCCCCCCAAGGACATC- 

TGGAATG 536 



Db 955 

ATCAATTCTGTGCATGGCGCTAGCTCAGCAGATCACCACACAGGCAGCACTATTAGCAAG 1014 

Qy 53 7 TCACTGGT — GCCCAGGTGTACTTGAGCTGTGAGGTCATCGGAATCCCG 

ACA — 586 

II III I I I I I I I I I I III II I I I I I I 

Db 1015 TCGGTGCTTAACACATG-GCACTT 

CCATGAATCGATATGGAGCCCGTGTAGAACAAG 1070 

Qy 587 -CCTG 

TCCTCATCTGGAACAAGGTAAAAAGGGGTCACTATGGAGTTCAAAG— 636 

III I I I I I I I I I I I I I I I I I I I 

Db 1071 GCATGGGTTTTTTTCTCTTCC- 

C AT T AAG AAAAAC T G AT GC C AAAAAT AAC T TC TC AGAT 112 9 

Qy 63 7 GAC 

AGAACTCCTGCCTGGTGACCGGGACAACCTGGC 672 



Db 113 0 ATTTTCAAGTATGACTTTTATGAAGGGAAAAAGCATTTTTGTTTGCAAAATCATGCT- 

TC 1188 

Qy 6 73 CATTCAGACCCGGGGTGGCCCAGAAAAGCAT 

GAAGTAACTGGCTGG 718 



Db 1189 

AGTGCAGGCCAGTTGTGAATTGTGATGGCTTTTATTTCTCCTGGGGCTGTAACTTTAAGG 12 4 8 

Qy 719 GT GCTGGTATCTCCTCTAAG-TAAGGAAGATGCTGGAGAATATGAGTGC- 

CATG 770 



Db 1249 

GTTTAGAATTTGGAACCACAGCCTAGCTAATCATGACACACACACACACACACACACATG 13 0 8 

Qy 771 CATCCA-ATTCCCAAGGAC-AGGCTTC AGCATCA- 

GCAAAAATTACAGTGGTT 82 0 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1309 

CACACACATACACATACACAAAGCATCACGAAGAACCATACAAATTGTACATTATTTTAC 136 8 

Qy 821 GATGC C T T AC ATGAAAT AC C AG T 

GAAAAAAGGTG 854 

I I I I I I I II I I I I I I I I I I I I 

Db 1369 ACATGGAGGCTCACTCTAAAAT- 

AGATACCATTTTAAATATTAACTAAAACTTGTGCTCA 142 7 



— 486 
III I I I I I I I 



Qy 855 

CAGAAT A 8 83 



-AAGGTGCCGAGCTATAAACCTC- 



Db 1428 

TTGTATGTTCATTCTATATGTACTGATTTGTTATTCACATTTCTTTCAAAATCGTTCAAA 14 8 7 

Qy 884 TTATTAGTCTGCATGGTTAAAAGTAGTCATGG — 

ATAACTACATTACCTGTTCTTGCCTA 9 41 



Db 148 8 TTTCTAGCC C AC AT C AAT T AAA- TAT T AAT AG T AC TTTCTACA- 

AACATGAGCGCACAAA 15 45 

Qy 9 42 ATAAGTT TCTTTTAATCC AATCCACTAACACTTT 

AGT— 978 

I I I I I I I I I I I III I I I I I I I I I I 

Db 1546 

ATAAATTCAAATCTATTTTTCCCACTGGTGTTATCAATACTGCTCATACTTTGTCAGTAA 16 05 

Qy 9 79 -TATAT-TCAC-TGGTTTTA CACAGAGAAATA 

CAAAAT- 1013 



Db 1606 

CTAAGTATCACATGATCTTAAAACTAATGTCACATACTAAAAAGCTTCTGAGGCAAATTG 1665 

Qy 1014 AAAGATCACACATCA 

AG AC T AT C T AC AAAAAT T T AT TAT AT AT T 1057 



Db 1666 

TAGAAAGAACTCTCAACATCATTGTTCTACTGGACAACATACATAAAATTATTTTACAGT 1725 

Qy 1058 TACAGAAGAAAAGCATGCATATCAT — TAAA 

CAAATAAAATA 109 7 

I I I I I I I III I I I I I I I I I I I I I I I 

Db 1726 

GATGGGAGAGAAATAGGCTCCTCATCCTAAAAGCTGCGAAGACAGTAGCGGTGCCGTGTT 17 85 



Qy 1098 CTTTTT- 

ATC- 1106 



Db 1786 

TTGGCGTTACTCCCCTGTGGATCCCAGCGACGGGTGGATTTCTCCTGTGCTTTATCATCA 18 45 



Qy 1107 -ACAAAAAAAAAAAAAAAA 1124 

I I I I I I I I I I I I 
Db 18 46 GACACAAAATGGACCAAAA 1864 



RESULT 11 
US-10-478-914-57 

; Sequence 57, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 

; TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
; FILE REFERENCE: 7388-80899 

; CURRENT APPLICATION NUMBER: US/ 1 0 / 478 , 9 1 4 
; CURRENT FILING DATE: 2003-11-26 



PRIOR APPLICATION NUMBER: PCT/JP02/05294 
PRIOR FILING DATE: 2001-05-30 
PRIOR APPLICATION NUMBER: JP 2001-162775 
PRIOR FILING DATE: 2001-05-30 
PRIOR APPLICATION NUMBER: JP 2001-255226 
PRIOR FILING DATE: 2001-08-24 
NUMBER OF SEQ ID NOS : 417 
SOFTWARE: Patentln version 3.2 
SEQ ID NO 57 

LENGTH: 2336 

TYPE: DNA 

ORGANISM: Homo sapiens 



Query Match 39.1%; 
Best Local Similarity 36.3%; 
Matches 687; Conservative 
Gaps 97; 



Score 2704; DB 1; Length 2336; 
Pred. No. 2e-159; 
0; Mismatches 408; Indels 796; 



Qy 1 GCCGCTG CCAC-CGCAC CCCGC CATG — 

GAGCGGCCGTCGCTG 4 0 

I I I I I I I I I I I I III I I I I I I 

I I I I 

Db 29 

GCTGCGGAATTCCTCGAGCACTGTTGGCCTACTGGAATGCGAGCTGAGCAGACAGGGCTG 8 8 

Qy 41 C GCGC CCTGCTCCTCGGC— 

GCCGCTGGGCTGCTGCTCCTGCT 81 

I I I I I I I I I I I I I I I I I I I I I I 

Db 89 CAAGGAAATCTGGCGCGGTTCAATACCTCGTCTAGC — 

CTGGGTTCCAGTATCTAATTTT 146 

Qy 82 CCTGCCCCTCTC 

CTC 96 

I I I I III III 

Db 147 

TTTTTTGTTTTAACTGACAAACTCATTTCTCTACTGGGACAGGATGCTGTGCTGGCTGGA 2 06 

Qy 9 7 — TTCC TCCTCTTCGGACA 

CCTGCGGCCCCTG 126 



Db 207 

AG TTCCATTTC T AC AGC AAG AAT C C T AT C T GGAAAC AC AG AAG TTGTCCTC T AGC C AC AG 2 66 

Qy 127 CGAGC CGGCCTCCTGCC— 

CGCCCCTGCCCCC 156 

I I I I I I I I I I I I I I I I I 

Db 267 

CAGCTCGAACTTTTTTGATTGTCGTTGCTGCTTTCTCCCATCACCCCCATCCCCTTTTGA 326 

Qy 157 

GCT 159 



Db 327 

CAAAGATCCAACTGTAAAAAGTCTTACGTAACAGTTCAGGACTACTTCGGTTCTTTTACT 3 86 



Qy 16 0 GGG- 

TG 169 



-GCC- 



Db 387 

GGGTAAGCACTTTCAATTTTTTTTTTTTAACTAAAAGCCATTTTAAAATTGAATCTGTTG 4 46 

Qy 170 CTGGGCGAGAC 

CCG 183 

I I I I I I I I 

Db 447 

AGGGGCTTGACTAAAATCTTTTAAGTAATTTGTGTAATGGAATACTGTCAGTGGATTTTT 5 06 

Qy 184 -CGACGCG-TGCGGCTGCTGC-CCTATGTGCGC 

CCGCGGCG 221 

I I I I I I I I I I I I I I I I I I I II II 

Db 507 

TCGTCTCATTTCTGCACGTGCTCCTTTGTTCTCAGAACAGAAGCTTTTTATACACATCCC 566 

Qy 222 AGGGCG-AGCCGTGCGGGGGTGGCGGCGCCGG 

CAGGGGGTACTGC 265 

I I I I I I I I I I I I I I I I II 
Db 56 7 ATAACGCAGCTG- 

G AG AG AG T T AT G AAG T C AG T T AT T AT AAGG AAC AC AAAGGT TGC T T T 62 5 

Qy 266 GC GCC 

GGGCATGGAGTGCGTGAAGAG 291 



Db 626 

CCATTCTTTGCCTTTAGATAATTAATTTTTTTGTTTTCTTAAAATGGAGTATTTAAAGAA 685 

Qy 292 CCGCAAG AGGCGGAAGG 

GTAAAG 314 

I I I I I I II II I I I I I 

Db 686 

GGAAGAAATTCACAAGAAATAAACTGTTGGAGAATTTAGAAAAGTTTGAAGTTTTTACCA 7 45 

Qy 315 CC GGGGC AGCAGCC 

GGCGGTCCGGGTG 341 



Db 746 CCTTTTCTATCTCTAGTTTTGTGTGGCCAAACACTTGTGCCGCCTGGGGCGGT — 

GGGGG 8 03 

Qy 3 42 TAAGCGGCGTGTGCGTGTGCA-AGAGCCGCTA — CCCGGTGTG CGGCA- 

GCGACG 392 



Db 804 T-AGAGGC — 

AAGCATAGACAGAGAGGAACTAAGCCAGACATGGACAAAGGCACGAGCCA 86 0 

Qy 393 GCACCA CCTACCCGAGCGGCT-GCCAGCTGCGCG 

CCGCCAGCCA 435 

I I I I I I I I I I I I I I I I I I I I I I 

Db 861 

AAACCAGACAGTCCTGGCCGTTCGATAGGCCAGCAGTGGGTGAGACAGGTCAGCCAGCTG 92 0 

Qy 436 GAGGGCCGAGAGCCGCGGGGAG AAGGCCATCACCCAGGTCAGCAAGG- 

GCACCTGCG 491 

I I I I I I I I I I I I I I I I I I I I I I II I III 

I I I I 



Db 921 CAGGGGCGGGAGCGGGGGGGCGGGTAGGGACAT 

GGATGTGTGAGGTGCTCATGCG 9 75 

Qy 492 AG CAAGGT CCTTCCATAGTGAC-GCCCC-CCAAGGAC- 

ATC 529 

I I I I I I I I I I I I I I I I I I II 

I I I 

Db 976 

TGCGATCGTGACTAGATTCTGAAACTGCCAGCCATTTTCCCAGCTCCGCTTTGTGCAATC 1035 

Qy 53 0 T GGAATG TCACTGGTGC 

CCAGGTGTACTTGAGCTGTGAG 56 8 

I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 1036 

TAAAGGAATGCATCCCCTCTGAAGCAGTCTTGCCAGAGCCTAGTGAGGGAGAGAAGTATG 1095 

Qy 569 GTCATCGGAATCCCGACACCTGTCCTCATCTGGAACAAGGTA- 

AAAAGGGGTCACTATGG 62 7 

III I I I I I I II III II III I I I I I I 

I I 

Db 1096 GTAA 

ATACCACAACATATGGAATCAGAAAATACCGGGAACTGGAGTGGGCAAGGGGG 1152 

Qy 62 8 AGTTCAAAGG ACAGAACTCCTGCCTG 

GTGACCGGGACAACC 66 8 

I I I I I I I II I I I I I I I I I I I I I 

I I 

Db 1153 

AATGCAGAGGGTGTGGAAATTTTTTAGTGATCTGGAATGTGTTGAGTGACAGGAAGTGCC 1212 

Qy 669 TGGCCA TTCAGACCCGGGGTGGCC 

CAGAAAAGCATGAAGTAACTGGCT 716 

III III III II III I I II I II 

I I 

Db 1213 CCAAGCTTCTCCCCC ACCAACTCTTCTCAGTCGCGCCTGCTTT 

TGTCT 1260 

Qy 717 GGGTGCTGGTATCTCC 

TCTAAGTAAGGAAGATGCTGGAG 755 

I I I I I I I I I I I I I I II I 

I 

Db 1261 

AACTCTTGTAATCTACACACTACTGCTTACAAAGCTGTCTGAGTTTAAGACAAAAAGAAA 132 0 

Qy 756 AATATGAGT-GCCATGCATCCAATTCCC AAGGACAG 

GC 792 

II III I I I I I I I I I I I I I I I I II 

Db 1321 

CCTAAAAGTCTCCTTACTTCATAGTACCTGTGATATGGAAGGAATGTAAAAGCATGACCC 13 8 0 

Qy 793 -TTCAGCATCAG — C AAAAAT T AC AG T 

GGTTGATGCCTTACATGAAATACCAGTGA 845 

I I I I I I I II II II I II I I I I I I I I I 

I I 

Db 1381 

T T T AAGC C T AT GG AC AT T T T T TC AGGT AT AC AGGAGAAAGAT AAAAT AAT T T T C C AC AG A 14 4 0 

Qy 8 46 AAAAAG GTGAAGGTGCC 

GAGCTATA 8 70 



Db 1441 

AAAATGAGAATTATGAATTATATAGTTCAGGTTCCAAATCTAATTTTTAAAAGAATTCTG 15 0 0 

Qy 8 71 AACCTCCAGAATATTA TTAGTCTG 

CATGGTTAAAAGTAGTCATG 914 

I III I III I I I I II II II I I I I I I I I 

Db 1501 

ATTCTGCTACACTTTACAAATGCTTAGGTTGGTTCCTAATTTGAAGGAGACTTGTTTTAT 156 0 

Qy 915 — G AT AAC T AC AT T AC C — TGTTCTTG — CCTA ATAAGTTT 

CTTT — 953 

I I I I I I I I I II I I I I III I I I I I I I 

Db 1561 

TTGGTTAATGCATTGCATTTGAACTTGTTTCTATTTTCTTTGCATAAATTTGGACTTTGG 162 0 

Qy 954 TAATCCAATCCACTAA CACTT 

TAGTTATATTCACT 988 



Db 1621 

GAGAAAAATGCAAAGTAATAAGTAGAATGCACTTGGGGGAAAAAAGGAGGATTTTTCCCT 16 8 0 

Qy 9 89 GGTT TTACACAGAG 

AAATACAAAATAAAGATCAC 1022 

I I I I II I I I I I I I I I I I I I I 

Db 1681 

TCATGGGTTGGAAAGTATTTTAAAGGGTTGTTTTCTTGAAAAAACAAGCTCTCTCTTACT 17 4 0 

Qy 1023 ACATCAAGACTAT CTACAAAAA 

TTTA 1048 



Db 1741 

TTCTGCATCTATGCTATAAAGATAACTATGCTATAATAAATGTAAGGTAGAAAACTTTAA 18 0 0 

Qy 10 49 TTATATATT T AC AG 

AAG AAAAGC AT GC AT AT CAT T AAAC AAAT AAAAT AC T T 110 0 



Db 1801 AGAGAAAATAACAGTGTTCTAAGTGAAAAGC- 

TACTTAGCATTTTCCCAAACTCACACAT 1859 

Qy 1101 TTT AT C AC AAAAAAAAAAAAAAAA 112 4 

II I I I I I I I I I I I I I I I 

Db 186 0 TAT C AAC AG AAAC AC T AAAAAT T AAGAGAAA 189 0 



RESULT 12 
US-10-478-914-20 

; Sequence 20, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 

; TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
; FILE REFERENCE: 7388-80899 

; CURRENT APPLICATION NUMBER: US/ 1 0 / 478 , 9 1 4 

; CURRENT FILING DATE: 2003-11-26 

; PRIOR APPLICATION NUMBER: PCT/JP02/05294 



PRIOR FILING DATE: 2001-05-30 
PRIOR APPLICATION NUMBER: JP 2001-162775 
PRIOR FILING DATE: 2001-05-30 
PRIOR APPLICATION NUMBER: JP 2001-255226 
PRIOR FILING DATE: 2001-08-24 
NUMBER OF SEQ ID NOS : 417 
SOFTWARE: Patentln version 3.2 
SEQ ID NO 20 
LENGTH: 2495 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-478-914-20 

Query Match 39.0%; Score 269 7.5; DB 1; Length 2 495; 

Best Local Similarity 34.5%; Pred. No. 7.1e-159; 
Matches 728; Conservative 0; Mismatches 390; Indels 995; 
Gaps 109; 

Qy 1 GCCGCTG CCACCGCAC 

CCCGCC 22 

I I I I I I I I I I I I I I 

Db 349 

GCCAATGTGGAACTATACTACAGAGCAATACAGTTCTACTTAGAATTCAAGCCTCTGTTG 4 0 8 

Qy 2 3 ATGGAGCGGCCGTCGCTG CGCGCCC-TGC— 

TCC 53 

I I I I I I I I III I I I I I I I II 

Db 409 

TTAAATGATTTGCTGATGGTGCTGTCTCCACGGTTGGATCACACTCGTGCAGTCAATTAT 46 8 

Qy 54 -TCGGCGCCGCTGGGCTGCTGCTCCTGCT CC TGC 

CCCT 90 

I I I I II I I I I I I I I I II III II 

I 

Db 469 

TTCAGCAAGGTTAAACAGCTACCACTGGTGAAACCGTATTTGCGTTCAGTTCAGAACCAT 52 8 

Qy 91 CTCCTC TTCC 

TC— 102 

II III II 

Db 529 

AAC AAC AAAT C T G T G AAT G AAT C AT T G AAC AATC T T T T TAT T AC AG AAG AAG AT TAT C AG 5 8 8 

Qy 103 -CTCTTCGGACACC TGC GGCCCCT GCGAGC- 

CGGC— 135 

I I I I I I I I I I III I I I I I I I I I I I 

Db 589 

GCTCTGCGAACATCAATAGATGCTTATGACAACTTTGACAATATCTCGCTTGCTCAGCGT 6 4 8 

Qy 136 CTC CTGCCCGCCCCT 

GCCCC 155 

III I I I I III II 

I 

Db 649 

TTGGAAAAACATGAACTCATTGAGTTCAGGAGAATTGCTGCTTATCTCTTCAAAGGCAAC 7 0 8 

Qy 156 CGCTGG — GCTGCCT GCTGGGCGAGACCCGCGACGCGTGC — 

GGCTGCTGCC 203 

I I I I I I II I I I I I I I I I I I I II I I I I I 

Db 709 



AATCGCTGGAAACAGAGTGTAGAGCTGTGCAAGAAAGACAGCCTTTACAAGGATGCAATG 76 8 

Qy 204 CTATGTGCGCCCG CGGCGAGGGC GAGC-CGTGCGGGGG 

TGG 243 

I I I I I I I II III I I I I I I I I I I I 

I 

Db 769 

CAGTATGCTTCTGAATCTAAAGATACTGAATTGGCTGAAGAACTCCTGCAGTGGTTTTTG 82 8 

Qy 2 44 C— GGCGCCGGCAGGGGGTACT GCGC GCC 

GGG 2 73 

III I I I I I I I III II 

I I 

Db 829 

CAGGAAGAAAAAAGAGAGTGCTTTGGAGCTTGTCTGTTTACCTGTTACGATCTTTTAAGG 8 8 8 

Qy 2 74 CATGGAGTGCGT — G AAG AG CCGCA-AGAGGCG G AA 

GGGTAAAGCCGGGGC 321 

I I I I I I I I I I I I I I I I I I II III III II 

Db 8 89 CCAGATGT- 

CGTCCTAGAAACTGCATGGAGGCACAATATCATGGATTTTGCCATGCCCTA 9 4 7 

Qy 322 -AGCAGCCGGCGGTC CGGGTGTAAGCGGC GTG 

TGCGT 357 

I I I I I I I I I I I I I II III I I I I 

Db 948 TTTCATCC— 

AGGTCATGAAGGAGTACTTGACAAAGGTGGATAAATTAGATGCTTCAGAA 10 05 

Qy 358 GTGCA AGAGCCGCTACCCGGTGTGC 

GGCAGC — 388 

I II I I I I I I I I I I III I I I I I 

Db 1006 

TCACTGAGAAAAGAAGAAGAACAAGCTACAGAGACACAACCCATTGTTTATGGTCAGCCC 1065 

Qy 3 89 GACGGCA CCA 

CCTACCCGAGCGGCTGCCAGCTG- 421 

I I I I I I III I I I I I I II II II 

Db 1066 

CAGTTGATGCTGACAGCAGGACCCAGTGTTGCCGTCCCTCCCCAGGCACCTTTTGGTTAT 112 5 

Qy 422 CGCGCCGC CAGCCAGAG GGC — CGAGAGCCGC 

GGGGA 456 

I I I I I I I I I I I I I I III II III II 

Db 1126 

GGTTATACCGCACCACCGTATGGACAGCCACAGCCTGGCTTTGGGTACAGCATGTGAGAT 1185 

Qy 457 GAAG-GC CATC 

ACCCA 471 

Db 1186 

GAAGCGCTGATCCTGTAGTCACCTATTTTCGTACTGAAACATCGTCTTTACCCACTTCTC 12 45 

Qy 4 72 GGTCAGC 

AAGGGC — 48 4 

I I I I I I I I I 

Db 1246 

AGTTTATAATGGGGGAAAACAGGCAACGTGTTCTTGTAACCTTTATTTCATGAAGGACTT 13 05 

Qy 485 ACCTGCGAGCAAGGTCCTTCCATAGTGACGCC 

CCCCAAGGAC 52 6 



Ill I I 1 1 I I 1 1 I I 1 1 II 

1 1 

Db 1306 

CTTTTTGTTTCTAACTATAAACTTGGATCACCTATGTTAAAACCTTATTTCACATTCCAC 1365 

Qy 52 7 ATC TGGAATGTC ACTGGTGCCCAGGTGT 

ACTTGAGCTGT 565 

III I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 1366 

ATCATTTTAGAATTTATTTTCGAAGGGGAATAGTTTCAATGTTTTATTCACTTGGGCTTT 142 5 



Qy 566 GAGGTCATCGGAATCCCGAC 

ACCTG 590 

I I I I I III I 

I I I I I 

Db 1426 

TTTTCTTCCCCCTCTTTCTTTAAAGAACTGCTCAATATTCAATCTGTTGTGAAGAACCTG 1485 

Qy 591 TCCTCATCT GGAACAAGG 

TAAAAAGGGGTCAC 622 

I I I I I I I I I I I I I I I I I I III 

Db 1486 ATTTGCAC- 

TCTGTAGTGTTTAAAGAAACAAAGAAACTCTAATATTGAATCTCTTAAATT 15 4 4 

Qy 623 TATGGA 

GTT 631 

I I I I I 

I I 

Db 1545 

TAGTGTATGTAAACAGCTTACAAATACGTATTGTCTAAATGCATTTAAATCTGTTTTATT 16 0 4 

Qy 632 CAAAGGA CAGAACTCCTGCCTGGTGACCGGGACAACCTG 

GCC 673 

I I I I I I I I I I I I I I I I I I I I I III I 

Db 1605 

CAAAGAAAAGCTAAAGCAAAAACACTGGCATATGACCATGCAAGACTGTCAGTGCCAACA 166 4 

Qy 6 74 ATTCAG ACCCGGGGTGG CCCAGA— 

AAAGCATG 70 4 

I I I I I I I I I I I I I I I I I I I 

Db 1665 

AAGACAACACTAATCAGCACATCGTACACTGGATTGCAGTGCTTCCCAGATTATTGAAAA 172 4 

Qy 705 AAGTAACTGGC TGGG 

TGCTG 724 

I I I I I I I III 

I 

Db 1725 

ATGTTACAGACAACTTGCCTGATTTTTAAATGAGCGTAAAAGGCCCTCTAACCTATGCAG 17 8 4 

Qy 72 5 GTATCTCCTCTAAG— TAAGGAAGATGCTGGAG AATATG- 

AG 763 

I I I I I I III II I I I I I I I I I I I I I I 

I 

Db 1785 

GTTTCCCCATTATGCATATAGAAAATGCTAGTATGTTTTGCTCACTTCATATGTAACAGG 18 4 4 

Qy 76 4 TGCC ATGCATCCAATTC CCAAGG — ACAGGC — 

TTCAGCATCA- 8 02 



1 1 1 1 1 1 1 1 1 1 I I I II 1 1 I 1 1 1 1 

1 1 

Db 1845 

TGCCCTTATGTTGTGCTGTATCCTGTGCTTTTTCTGTGGGACCATTCCATTCAGGAGCAA 19 0 4 

Qy 8 03 GCAAAAATTACA GTGGTTGATG — 

CCTTACATGAAATACCAGTGAA 8 46 

II I I I I I I I I I I I I I I I I I I I I 

I 

Db 19 05 AGAGCACCATGATTCCAATCTTGTGTGTGTTTACTAACCCTTCCCTGAGGTT- 

TGTGTA 1962 

Qy 8 47 AAAAGG TGAAGGTGC 

C G AGC T AT AAAC C T C C AG AA 881 

II I I I I I I I I I I I I 

I I 

Db 1963 

T G T T GG AT AT TGTGGTGTTT TAG AT C AC T G AG TGT AC AGAAGAGAG AAAT T C AAAC AAAA 2 02 2 

Qy 8 82 TATTATTAGTCTGCA TGG TTAAAAGTAGTCATGGAT- 

AACTACATTA 92 7 

I I I I I I I I I I III I I I I I I I I I I III 

I I I I 

Db 2023 

TATTGCTGTTCTTCAGTTTTGTTTGTGGAATTTGAAATTACTCAAATTTAAAATAAATTA 2 0 82 

Qy 92 8 C CTGT TCTTGCCTAATAAGTTTCTTTTAATCCAAT- 

CCACTAACAC 9 72 

I I I I I I III I I I I I I I I I I I I I I I I I I I 

Db 2083 

CTGGACTGTG G AAAT AAC AT AG AAT T G AAG T T T T AAT T AAAT AC C AC T C AAAC G AAAAG A 2142 
Qy 973 

TTTAGTTATAT 9 83 

I I I I I I III 

Db 2143 

AC AG TAG T T T T T G TAG T T T T AT AT TGGAT AC TGAGGC AT T AGGGAGGC ATGAAAGGAAG A 2 2 02 

Qy 984 TCACTG 

GT 991 



Db 2203 

GG AAT G AGG AT T GAG AC AT G T G AAG AC AT TGT GC AT TAT AT C AAT GTGCATTCCTG TAG T 2262 

Qy 992 T — TT AC AC AG AG AAAT AC AAAA TAAAGA TCAC 

AC 1024 

I III II I I I I I I I I I I I I I I I I I 



Db 2263 

TCATTAACAAGGTACATGCAATAGTCTAAAGAACCAGAGTCACTACTATAGTGGCTTAAC 232 2 

Qy 1025 AT C AAG AC T AT C T AC AAAAAT T TAT TATA 

TAT T T AC AGAAGAAAAGC - 10 71 

II I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 2323 

ATTTAATCTGTCTCCAATATTTTAACCAAGTGACACCGGGGTTTTTATCGAAGCATTTCA 23 82 
Qy 10 72 ATG— CATATCAT TAAAC 



AAAT AAAAT AC T T T T TATCACAAA 1111 

III I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 2383 

CTTAAATGAACAAATCATGGCTGTTATATTAACTTGAAATAAAATATATTTAAACATGTA 2 4 42 

Qy 1112 AAAAAAAAAAAAA 1124 

I I I I I I I I I I I I I 
Db 2443 AAAAAAAAAAAAA 2455 
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US-10-478-914-20/C 

; Sequence 20, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 
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; TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
; FILE REFERENCE: 7388-80899 

; CURRENT APPLICATION NUMBER: US / 1 0 / 47 8 , 9 1 4 

CURRENT FILING DATE: 2003-11-26 
; PRIOR APPLICATION NUMBER: PCT/ JP02 / 0 52 9 4 
; PRIOR FILING DATE: 2001-05-30 
; PRIOR APPLICATION NUMBER: JP 2001-162775 
; PRIOR FILING DATE: 2001-05-30 
; PRIOR APPLICATION NUMBER: JP 2001-255226 
; PRIOR FILING DATE: 2001-08-24 
; NUMBER OF SEQ ID NOS : 417 
; SOFTWARE: Patentln version 3.2 
; SEQ ID NO 2 0 

LENGTH: 2 495 

TYPE: DNA 
; ORGANISM: Homo sapiens 
US-10-478-914-20 



Query Match 3 8.3%; 

Best Local Similarity 35.0%; 
Matches 686; Conservative 
Gaps 103; 



Score 2 653.5; DB 1; Length 2 495; 
Pred. No. 5e-156; 
0; Mismatches 426; Indels 847; 



Qy 2 CCGCTG CCACCGCACCC CGCCATGGAGCGG 

CCGTCGCT 39 

III I I I I I I I I I I I I I I I I II 

Db 1977 

C C AC AAT AT C C AAC AT AC AC AAAC C T C AGGGAAGGGT T AGT AAAC AC AC AC AAG AT T GG A 1918 

Qy 40 GCGCGCCCTGCTCCT CGGCGCCGCTG 

GGCTGCTGCT 75 



Db 1917 

ATCATGGTGCTCTTTGCTCCTGAATGGAATGGTCCCACAGAAAAAGCACAGGATACAGCA 185 8 

Qy 76 CCT GCTCCTGCCCCT CTCCTCTTCCT — 

CCTCTTC 108 

I I I I I I I I I I I I I I I I 

Db 1857 

CAACATAAGGGCACCTGTTACATATGAAGTGAGCAAAACATACTAGCATTTTCTATATGC 179 8 
Qy 109 GGACACCTGC GGCCCCT— GC 



GAGCCGGCCTC 13 8 

I I I I I I I I I I I I I I II II 

Db 1797 

ATAATGGGGAAACCTGCATAGGTTAGAGGGCCTTTTACGCTCATTTAAAAATCAGGCAAG 173 8 

Qy 139 CTGCCCG C CCCTGC CC 

CCGCTGGGCTGC 166 

I I I I I I I I I I I I I I I I 

I I I I 

Db 1737 

TTGTCTGTAACATTTTTCAATAATCTGGGAAGCACTGCAATCCAGTGTACGATGTGCTGA 16 7 8 

Qy 16 7 CTGCTG GGCG AGACCCGC GACGCGTGC 

GGCT- 197 

III III I I I I I II III 

I I I 

Db 1677 

TTAGTGTTGTCTTTGTTGGCACTGACAGTCTTGCATGGTCATATGCCAGTGTTTTTGCTT 1618 

Qy 19 8 — GCTGCCCTATG — TGCGCCCGCGGCGAGGGC GAGCCGTGCGGGGGTG- 

GCGG 246 

III I I I I I II III I I I I II 

I 

Db 1617 

TAGCTTTTCTTTGAATAAAACAGATTTAAATGCATTTAGACAATACGTATTTGTAAGCTG 155 8 

Qy 247 CGC 

CGG 252 

I I 

Db 1557 

TTTACATACACTAAATTTAAGAGATTCAATATTAGAGTTTCTTTGTTTCTTTAAACACTA 149 8 

Qy 253 CAGGG GGTACTGCGCGCCGG GCATGGAGTGC- 

GTGAAGAGCCGCAAGA 299 

I I I I I I I I I I I II III I I I I I 

I I I I 

Db 1497 

CAGAGTGCAAATCAGGTTCTTCACAACAGATTGAATATTGAGCAGTTCTTTAAAGAAAGA 143 8 

Qy 300 GGCGGAAG-GGTAAAGCC 

GGGGCAGCAGCC 32 8 

I I I I I I I I I I I I I I I II 

Db 1437 

GGGGGAAGAAAAAAAGCCCAAGTGAATAAAACATTGAAACTATTCCCCTTCGAAAATAAA 13 7 8 

Qy 329 GGCGGTCCGGGT 

GTA 3 43 

I I I I I I II 

I I 

Db 1377 

TTCTAAAATGATGTGGAATGTGAAATAAGGTTTTAACATAGGTGATCCAAGTTTATAGTT 1318 

Qy 3 44 AGCGGCGTG TGCGTG TGCAAGAGC- 

CGCTACC 3 74 

II I I I I I I I I I I I I I I I I I 

Db 1317 

AGAAACAAAAAGAAGTCCTTCATGAAATAAAGGTTACAAGAACACGTTGCCTGTTTTCCC 12 5 8 



Qy 375 - 

GCGGCAG 387 



Db 1257 

CCATTATAAACTGAGAAGTGGGTAAAGACGATGTTTCAGTACGAAAATAGGTGACTACAG 119 8 

Qy 388 CGACGGCAC — CACCTACCCGAGCGG CTGCCA-GCTGCGCGCCG- 

CCAGCCAGA 43 7 



Db 119 7 GATCAGCGCTTCATCT-CACATGCTGTACCCAAAGCCAGGCTGTG- 

GCTGTCCATACGGT 1140 

Qy 43 8 GG GCCGA GAGCCGCGGGGAGAAGGCCATCAC 

CCAG— 472 

II III I I I I I I I I I I I I I I I III 

Db 1139 

GGTGCGGTATAACCATAACCAAAAGGTGCCTGGGGAGGGACGGCAACACTGGGTCCTGCT 10 8 0 

Qy 4 73 GTCAGCA AGGGC — ACC TGCG AGCAAGGTC — 

CT 502 



Db 1079 

GTCAGCATCAACTGGGGCTGACCATAAACAATGGGTTGTGTCTCTGTAGCTTGTTCTTCT 102 0 

Qy 503 TC CATAGTGACGC 

CCCCCAAGGACATCTGGAATGTCACT 541 



Db 1019 TCTTTTCTCAGTGATTCTGAAGCATCTAATTTATCCACCTTTG TCAAGTAC- 

TCCTT 964 

Qy 542 GGTGCCCAGG-TGTACTTGAGC TGTGAGGTCAT- 

CGGAATC 58 0 



Db 963 

CATGACCTGGATGAAATAGGGCATGGCAAAATCCATGATATTGTGCCTCCATGCAGTTTC 9 0 4 

Qy 581 CCGACACCTGTCC TCAT — CTGG 

AACAAGGTAAAAAGGGGTC 62 0 



Db 903 

TAGGACGACATCTGGCCTTAAAAGATCGTAACAGGTAAACAGACAAGCTCCAAAGCACTC 8 4 4 

Qy 621 ACTAT 

GGAGTTCAAAGGACAGAAC 644 

Db 843 

TCTTTTTTCTTCCTGCAAAAACCACTGCAGGAGTTCTTCAGCCAATTCAGTATCTTTAGA 7 8 4 

Qy 6 45 TCCTGCCTGGTGACC GG — GACAACCTGGC 

CATT 6 76 



Db 783 

TTCAGAAGCATACTGCATTGCATCCTTGTAAAGGCTGTCTTTCTTGCACAGCTCTACACT 72 4 
Qy 6 77 CAG— ACCCGGG GTGGCCC AGAAAAGCATGAAGT 



AACTGGCTGGG 719 



Db 723 

CTGTTTCCAGCGATTGTTGCCTTTGAAGAGATAAGCAGCAATTCTCCTGAACTCAATGAG 66 4 

Qy 72 0 TGCTGGTATCTCCT CTAAGTAAGGAAGATGCTGGAGAATATGAGTGCCAT — 

GCAT 7 73 



Db 663 TTCATGTTTTTCCAAACGCTGAGCAAGCGAGATATTGTCAAAGTTGT 

CATAAGCAT 608 

Qy 774 CCA ATTCCCAAGGACAGG 

CTTCAG 797 

II I I I I I III I I I I I 

Db 607 

CTATTGATGTTCGCAGAGCCTGATAATCTTCTTCTGTAATAAAAAGATTGTTCAATGATT 5 4 8 

Qy 79 8 CAT CAG CAAAAAT 

TACAGTGGTT 82 0 



Db 547 

CATTCACAGATTTGTTGTTATGGTTCTGAACTGAACGCAAATACGGTTTCACCAGTGGTA 4 8 8 

Qy 821 GATG CCTTACAT G AAAT AC CAG — 

TGAAAAAAGGTGAAGGTGCCGAGCTATAA — 871 

III I I I I I I I I I I I I I III I I I I I III 

Db 48 7 GCTGTTTAACCTTGC- 

TGAAATAATTGACTGCACGAGTGTGATCCAACCGTGGAGACAGC 429 

Qy 8 72 AC C T C C AG AAT AT TAT TAG TCTGCATGG 

T T AAAAG TAG T C AT GG AT AAC T 921 

III I I I I I I I I I I I I I II I I I I I I II 

Db 428 

ACCATCAGCAAATCATTTAACAACAGAGGCTTGAATTCTAAGTAGAACTGTATTGCTCTG 369 

Qy 922 ACATT ACCTG 

TTCTTGCCT- 940 

I I I I I III 

Db 368 

TAGTATAGTTCCACATTGGCAACCTTGGTAATGATATCTTTGAATTGCCCTTCTTTCCAG 3 09 

Qy 9 41 -AATAAGTT 

TCTTTTAATCCAATC 963 

I I I I I I I I I I I I 

Db 308 

GCATCAGTTGGATGATTCATCATGGTAATTATGGCATTATCATATTCTTCATACTTGTCA 2 49 

Qy 96 4 C AC T AAC AC T T T AGT T AT AT TC A CTGGTTTTACA 

CAGAGAA 1004 



Db 248 

TACAAAAACACCAGTTCTGCCCAAAGATGAGCTTGTTCTGCAGCTCTTAGCACCTTGGGA 189 

Qy 10 05 ATACAAAATAAAGATC AC AC AT C AAG ACTATCT 

ACAAAAATTTA 10 48 

III II I I I I III I I I I I I I I I 



Db 188 

ATATTCACTCTAGACCAGAACAGCTCCAGGTGCTCCCTCATTTTCTGAGGCTTAAATTTA 12 9 



Qy 10 49 TTATATA- 

CATATCATTAAACA 1088 

I I I I 



- T T T AC AG AAG AAA- 



I I I I I 



-AGCATG — 



Db 



128 



GAGTATAGAATAGCTAATTCAGTAAACATTCCCATGTGAGCTCGCTCAAGTCCCAGTGCT 6 9 



RESULT 14 
US-10-478-914-3/C 

; Sequence 3, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 
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; PRIOR FILING DATE: 2001-08-24 
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Gaps 99; 

Qy 1 GCCGCTGCCACC-GCACCCCG— 

CCATGGAGCGGCCGT 35 

I I I I I I I I I I I I I I I III I I I I I I 

Db 2080 GCGGCCGCGACCTGCAGCTCGAGCACAT 

GTGGCCTTTTTTTTTTTTTTTTTATTTTG 2 02 4 

Qy 36 CGCTGCGCGCCCTG CTCCTCGGCGCCGCTGGG 

CTGCTGCTCCTGC- 80 



Db 2023 C-CTTCTTGCTCTGGTGGTTAGTCTCCTACTATAAACATGATTTTCCTTCTTC- 

CCTTCA 1966 



Qy 



10 89 AAT AAAATACTTTTTATCACAAAAAAAAAAAAAAAA 112 4 



Db 



6 8 GCTTCCAACATGGTGATCAGCTCTTCAAAATAGCCACGA 3 0 



Qy 81 TCCTGCCCCTCTCCTC TTCCTCCT- 

CTTCGGACACC 115 



Db 1965 

GAG TCCTGTCAATCTTTACTTTTTTTCTCAT T AAT AT AC AGAAG T TAT AC T AC AG AT AC A 19 06 

Qy 116 TGC-GGCCC CTGC GAGCCG 

GCCTCCT 140 

I I I I I I I I I I I I I I I 

Db 1905 

TTCAAGCTCATGACTGCTAAAAACTTCCTGAAATGAGCCTTAATTCATACGAATCATTTC 18 46 

Qy 141 GCCCGCCCCT 

GCCCCCGCTGGGCTGCCTGC 170 



Db 18 45 TATTTGTTCTTTTTTTTTTGAGACAGACTCTTACTCTGTCGCCCAGGCTGGAGTGC — 

AC 1788 

Qy 171 TGGGCGAGACCCGCGACGCGTGCGGCTGCTGCCCTATGTGCGCCCGCG- 

GCGAGGGCGAG 229 



Db 1787 T-GGCATGATCT-CGGCTC ACTGCAACC TCTGC- 

C T C AC GAG T T AGG T C AAG 1739 

Qy 23 0 CCGTGCGGGGGTGGCGGC— GCCG GCAGGG GGT- 

ACTGCGCGCCGGGCAT 2 76 



Db 1738 CAATTCTTCTGCCTCAGCCTCCCGAGGAGCTAGGATTACAGGTCCCTGC- 

CACCATGCCT 16 8 0 

Qy 2 77 GGAGTGCGTGAAGAGCCGCAAGAGGCGGAAGGGTAAAGCCGGGGCAGC — 

AGCCGG 33 0 

I I I I I I I I I I I I I I I I I I I I 

Db 1679 - 

GAGTAATTTTTGTATTTTTAGTAGAGATGGGGTTTCACCATGTTAGCCAAGCTGGTCTC 1621 

Qy 331 CGGTC— CGGGTG TAAGC 

GGC 349 



Db 1620 

GAATTCCTGACCTCAGGTGATCCGCCCACCTCAGCCTCCCAAAGTGCTGGGATTACGGGC 1561 

Qy 350 GTGTGCGT — GTGCAAGAGCCGCTA 

CCCGGTGTGC— 382 

I I I I I I I I I I I I I I I I III 

Db 1560 GTGAGCCACGGTGCCTG- 

GCCTCTATTTTTTATTTTAAATACCTTTTGCCCTGAAATCTA 1502 

Qy 3 83 -GGCAGC GACGGCACCACCTACC 

CGAGCG 410 



Db 1501 

TGTCCTCTGATTATCAACACTGCTTACCAGTTTTATTTTTGTTGGTATTTATTTCATTC- 14 43 

Qy 411 GCTGCCAGCTGCGCGCC— GCCAGCCAGA GGGCCGAGAGC 

CGCGG — 453 



I 1 1 1 1 1 1 II 1 1 1 1 I III I III 

Db 1442 

AATCCCAGCTCTACAACTTACCAGTATAAAAATTTGGATCAGTTACTTAATCTTGCAGTT 13 83 

Qy 45 4 GGAGAAGG CCATCA 

CCCAGGTCAGCAAGGG 4 83 

I I I I I I I I I I I I I I I I 

Db 1382 

TCTTCCTTTTAAAAATGGAATAATATTATCTATCATATAAAATGCCCAGTTAA 133 0 

Qy 48 4 CACCTGCGAGCAAGGTCCTTCCATAGTGAC — GCCCCC — 

CAAGGACATCTGGAATGTCA 539 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 132 9 CAAATATGAACTAACACCTACAAT-GTGCCAGGCACCATTCTAGGGAA — 

TAGAATGTAA 12 73 

Qy 540 CTGGTGCCCAGGTGTACTTGAGCTG — TGAGGT — CATC 

GGAATCCCGAC 585 

I II I I I I I I I I I I I I I I III III I I I I 

Db 12 72 ATAAGTTCAA 

G T T C T T G AC C T GAATAAG TTTACATTTTAGT GAGAAGAC AG AC AAT A 1216 

Qy 586 ACCTGTCC 

TCATCTGGAACAAGGTAAAAAGGGG 618 

I I I I I I I II II I I I I I I I I I I I I I 

Db 1215 

AATCAATAACATGTCCATTATGCCAGATAGTTTTAAATGTCACAAAGAAAAAGGGGGAGC 1156 

Qy 619 

TCACTATGG 62 7 



Db 1155 

AATCAGATAGTGATGGAAGTTACACTCTAGAGAGGGTGGACAGAAAAAGTTTCTCTTAGG 1096 

Qy 62 8 AGTT CAAAG GACAG AACTCC 

TGCCTG 653 

III I I I I I I I I I I I I I 

I I 

Db 1095 

AG AT G AC AT C T G AAC AG AG AT C T AAATGAC TGAGT AC AAAC C C C AAT GG AAAAG TGATTG 1036 
Qy 654 GTG AC— 

CGGGACAACC 66 8 

III III I I I I I 

Db 1035 

GTGTGTTGTGAAACAGCAACACAGCCAGTTTGACTAGATGAGTAGATCAGAATGACTGAG 9 76 

Qy 669 TGGCCAT TCAGA CCCGGGGTGGCC 

CAG— 695 

I I I I I I I I I I I I I I I I I I I I 

Db 975 

GTGGGACTGTGGTCATCAAAGAGATCAGAGAGGTAGCCTGGGAAGGCCACATCACAGAAT 916 

Qy 696 AAAAGC ATGAAGTAACTG- 

GCTGGGTGCTGGTATCTC 731 

I I I I I I I I I I I I I I I I I III 

Db 915 

GTCCTTTGGGCCTCTGTAAGAGCGTTTTGAAAAAACAGTTCTGAGCAAAGGTTAGTCATT 856 



Qy 732 CTCTAAGTAAGG 

AAGATGCTGGAGAA 75 7 

I I I I I I I I II 

I I I I I I I I 

Db 855 

TAACTTGACTTACTCTTTAGAAGGATCACTTTGGCTATTGAGTGGCAAACAACTGGAGAA 796 

Qy 758 TATGAGTGC 

CATGCATCCAATTCCCAAGGACAGGCTTCAGCATCAGC 80 4 

I I I I I I I I I I I II II I I I I I I I I I 

Db 795 

GTGGTTAGGGGAGCACTGTGGGCAGGAATAGAAGGAGGAAAAGCAGACTGGAGCCTAATC 736 

Qy 8 05 AAAAAT T AC AG TGGTTGA — TGCCT — TACATGA 

AATACC 8 40 

I I I I I I II II I I I I I I I I I I I I I I I 

Db 735 

CAGTACTTCAGATGAGAAATGACAGATTTGCCTGGTACATCATTTCCTATCCCTTTACTT 6 76 

Qy 8 41 AGTGAAAA 

AAGGTGA 855 

I I I I I I I II 

I I I 

Db 675 

ATAACTTATTGGTGTATTTTTATTTTAAGTGTAAATTTTTATAAATGAAGCATAATTTGA 616 

Qy 856 AGGT GCCGAG CTATAAAC 

CTCCA 8 78 

III III I I I I I I I I I I I 

Db 615 

AGATTGATATGTTCGGACTTGTGTTTTCTATTATCATGGTGTCTCTCCACATACCTTTTC 556 

Qy 8 79 G AAT AT TAT TAG — 

TCTGCATGGTTAAAAGTAGTCATGGATAACTACAT 925 

I I I I I I I II III II I III II 

I 

Db 555 

CCTTTAGTTCTGCTTCTTATTGTGCTCCTCAAGTCTATTTTTGTTCTTTTCCCTCTACTT 496 

Qy 926 TACCTGTTCTTGCCT-AATAAG TTTCTTTTAATCCA 

ATCCACTAACACTTT 9 75 

I I I I I I II I I I I I I I I I I I III I I I I I I 

I I I 

Db 495 

CTACTGTTTTGAACTGTATATGCTATTTCTATTTTGCCAGGAGTTACCCCCTTCAATTTT 436 

Qy 9 76 AGTTATATTCACT-GGTTT — TACACAGAGAAATACAAAATAAAGATC- 

ACACATCAAGA 1031 

I I I I I I I I I III I I II I I III I I I I I I I 

Db 435 

CGTTGTACTCTCTCAGTCTCAAACACATACCCATGCTCTCTTCTGTCCTTCTCTTCATAT 3 76 

Qy 1032 C T AT C T AC AAAAAT T TAT TAT A TATTTACAG- 

AAGA 1066 

I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 375 C- 

ATCTTCTAGAATTAGTTCTACCCCGATTTTCTACCAATTTATTTTATTTCCAATCAGT 317 



Qy 106 7 A AAGCATGCATATCA 

T T AAAC AAAT AAAAT AC T T T T T AT C AC AAAAAAAA 1116 

I I II I I I I I II I I I I I II I I I I I I 

Db 316 

ACATTGATTCAGATTTATCAATATGTTTTACCAATTTTATCATTTTTTTTGGAGGCTTGT 2 5 7 

Qy 1117 AAAAAAA 1123 

I I I 

Db 256 TATTTAA 250 



RESULT 15 

US-10-478-914-60/C 
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; Patent No. 7335755 
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Gaps 95; 

Qy 1 GCCGCT GC-CA CC 

GCAC C 17 



Db 2020 

GCAGCTCGAGCACATGTGGCCTTTTTTTTTTTTTTTTTAGATGGAGTTTTGCTCTTTTGC 1961 

Qy 18 CC GCCATGGAGCGGCCGTCGCTGCGCGCCCTGCTCC-TCGGCGCCG- 

CTG 65 



Db 196 0 CCAGGCTGGAGTGCAATGGCGCGATC-TCG GCTCACTGCAACTTC — 

CGCCTCCTG 1908 

Qy 66 GGCTGCTGCTCCTGCTCCTGCCCCTCTCCTCTTCCTCCT CT 

TC 108 



Db 19 0 7 GGTTCAAGCTATT-CTCCTG- 



CCTCAGCCTCCTGAGTAGCTGAGGTGAGAGGAC 1856 

Qy 109 GGACACCTGCGGCCCCTGC — GAGCC GGCCTC CTG 

CCCG 145 

I I I I I I I I I I I I I I I II I III I 

I I 

Db 1855 TGACGCGAGAGG — 

ATTGCTTGAGCCTAGGAGGTTGAGGTTGCAGTGAGCTGTGTTCACG 179 8 

Qy 146 CCCCTG — CCCCC-GCTGGGCTGCCTGCTGGGCGAGACCC- 

GCGACGCGTGCGGCTGCTG 201 

I I I I I III I I I I I I I I I I I I I I I I I I I I I 

Db 179 7 CCACTGGACTCCAGGCTGGG-TG 

AC AAAGC AAG AC C C T G T C T C AAAAAC AAAAAC AA 1742 

Qy 202 CCCTATGTGC 

GCCCGCGGCGAGGGC G 227 

III I I I I I 

Db 1741 

ACAAACAAACAAACAAAAAAATAACCAACAAAAAAAACCCTGAGTTTAAGTTTGATTTTA 16 82 

Qy 22 8 AGCCGTGCGGGGGTGGCGG CGCCGGCAGGGGGTACTGCG- 

CGCCGGGCATGGA 2 79 

I I I I I III III I I I I I I I I I I I I I 

I I I I 

Db 16 81 AGACCTGTAGGGACAGCAGATTAGTTCTACTTTAATGGGTA-TGCATCATC 

TGGA 1628 

Qy 280 GTGCGTGAAGAGCCGCAAGAGGCGGAAGGGTAAAGCCGGG 

GCAGCAGCCGGCG 332 

II I I I I I I I I I I I I III I I I I I I 

I 

Db 1627 AAGC 

AAGGGTCATAAAAAGAAGATACATCTTCACTGGCACCATCACATCAGTCATCA 1571 

Qy 333 GTCCGGGTGTAAGCGGCGTGTGCGTGTGCAAGAGCC 

GCTACC 3 74 

I I I I III I I I I I I II I I I I I 

Db 1570 GTCC T TAG AT AC - 

AGAACCTTTACTGACTCCCTCAAAACTTCCATGCTTCCAAAC 1517 

Qy 3 75 CGGTGTGCGGCAGCGACGGCACCA 

CCTACCCGAGC 409 

I I I I I I I I I I I I I I I I I 

Db 1516 

TAACCAAGCAGTTCTTCTACATTGGGATCATCGAATTTCCCTTCATGTTCCAACTCTCCT 145 7 

Qy 410 — GGCTGCCAGCTGCGC GCCGCC — AGCCA GAGGGCC 

GAG AG 447 

I I I I I I I I I I I I I I I I II II III 

Db 1456 

CTAGCTTCAATATACACATAGACAACCTTAGCCATCTGAAAACCTACTTTTAGAATATTC 139 7 

Qy 448 CCGCGGGG AGAAGGCCATCACCCAGGT 

CAGCAAGG 482 

I I I I I I I I I I I I I I I I I III 

Db 1396 

CCGAGTGGTTCCACGTAAGTCCTCTGCCAGATTCACATACCCATAAATTCTACTTCATAA 133 7 

Qy 483 GCACCTGC 



GAG 493 

I I I I I 

Db 1336 

GC C CAT AC T G T T C T T AAAAAAGAAAT AGAAAAAAT AT ATC AAT AT AAC T T T AAAAT AAG A 12 7 7 

Qy 49 4 CAAGGTCCTTC 

CATAGTGACGCCC 517 

II I I I I I I I I I I I I I I 

Db 1276 

CACAAACCTTCATGAAAATATAGTCATCAAATTTATTTTCATTGGGATGCCATTTTTTGA 1217 

Qy 518 CCCAAG GACAT 

CTGGAATGTC 53 8 

I I I I I I I I I I I I I I I 

Db 1216 

AG AAT T C C T AAG AC T AAT G T T T C T T G AC AT GC AAGAGT T AGC AT T AAT AGC T T AC G T T AC 115 7 

Qy 539 ACTGGTGCCCAGGTGTACTTGAGCTGT GAG 

GTCATCGGAATCC 581 

I I I I I I I I III I I I I I III II 

Db 1156 

TATAAATACTGCTGCTTGGAAGCAGTACAACTGTTTTAGAGTTTTAAGACTACAGACTTT 109 7 

Qy 582 CG — ACACCTGTCCTCATCTG GAACA — 

AGGTAAAA 613 

I III I I I I I I I I I I I 

Db 1096 

CATTACTCAAATCTTATTCAGTAAATGTAAAAATCAGAAGGTTCTGAACAGCTGGTTAGG 103 7 

Qy 614 AGGG — GTCACTATGGAG TTCAAAGGACAG — 

AACTCCTG 649 



Db 1036 

AAGGTAGCCAAGATGCAGGAAAGATGTCTGCGCCTCCTTTTCAAGGGCAGCCAACTCTTG 9 7 7 

Qy 650 CC TGGTGACCGGGA CAACCTGGCCATTCAGAC 

CCGGGGTGGCCCA 69 4 

I I I I I I I I I I I I I I I I I I I I I I 

Db 9 76 AACAGTAGGTG- 

CCAAAATATCCACATGGCCTTTATAGCTTCCACCTGCGTCACACACAA 918 

Qy 695 GAAAAGCATGAAGTAAC-TGGCTG 

GGTGC — T 723 



Db 917 

ATAACAAAGTAATGTAAATGACATGCTTGAAATAACTTGTCAGCAATACTTTGGCTCAAT 85 8 

Qy 724 

GGTATCTCCTCTAAGTAAGGAAGATGCTGGAGAATATGAGTGCCATGCATCCAATTCCCA 7 83 



Db 857 TATATCACTTTAAACTTTGCTTGCAGCAATAAAATGT-ACTACTGGGCAAATTA- 

TCTCC 800 

Qy 78 4 AGGAC — AGGC T TC AGCATCAG CAAAAATTACAGTG 

GTTGATGCC 826 

I I I I I I I I I I I I I I I I I I I I I 

I I I 



Db 799 

AG AAC TTTGACTT T AAAAAAT G T T AAAT C AAT T AT T T C AAT AT AAAT AAAT AAT T AG G C C 7 4 0 



Qy 82 7 TTAC ATGAAATACCA 

GTGAAAAAAGGTGAAG— 857 

III I I I I I I I I I 

Db 739 

TCACTAACAGCTGTCCCAACACCTCTACCCACCCTTGGCTCGGAAACAACTAGTTAGGCC 6 8 0 

Qy 858 GTGCCGAGCTATAAA CCTCC 

AG AAT ATT A 8 86 



Db 679 

TGAGTCTTACAATTTTTCAGGAACAAGCTAACAAAAGGGCATCCCAGGCAGAGGAAATAA 62 0 

Qy 887 TTAGTCTGCATGGTTAAAAG TAGTCATGGATAACTACATTACCTGTTC 

TTGC 938 



Db 619 TATG-CTGAATTACCACCTGAGTTAAGTCAAAGGCTTCTG- 

ATTCCTTGTTCAAAATCAC 562 

Qy 939 CTAATA AGTTTCTT — TTAATCCA 

ATCCACTAACACTTTAGTTATAT 9 83 

I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 561 CTATTATGCTCTTGTTTCCTAATAATTCCACAAAATGTTCAACT- 

ACTCTTATGACATTA 503 

Qy 984 TCACT GGTTTTACACAG-AGAAATACA 

AAATAAAGATCACACA 1025 

I I I I I II I I I I III I I I I I I I I I 

Db 502 

ACACTTCAAGAGAAAGAGCTAAGCTGAAGTCTTTCATTCACATAAAGATGACACAATTTA 4 43 
Qy 1026 TCAAGAC 

T AT C T AC AAAAAT TTATTATATATT T AC AG AAG AAAAG 10 7 0 

I I I I I III I I I I I I I I I II II 

Db 442 

AAAATGCTACAAGGCAGGTATGGACAAAGTACTATGAGGGTAATAAGGAGGGAATGATTA 3 83 

Qy 10 71 -CATGC ATATCATTAAACAAAT 

AAAATACTTTTTATCACAAAAAAAA 1116 



Db 3 82 AC AC G C T AAAAAT AT AC T C T T AAC C T AAT G AAAT G AAAG TACTGCCT- 

TCCCAAACCGAA 32 4 



1117 AAAAAAAA 1124 
I I I I 

323 GTTCAAAA 316 
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RESULT 1 
US-10-478-914-4 

Sequence 4, Application US/10478914 

Patent No. 7335755 

GENERAL INFORMATION: 
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Query Match 3 8.0%; Score 6539.5; DB 1; Length 3210; 

Best Local Similarity 44.5%; Pred. No. 0; 

Matches 1573; Conservative 0; Mismatches 997; Indels 963; 
Gaps 215; 

Qy 1 TTGAAAT-CAGGAAATCAGGCCGGG CGCAGTGGCTC 

ATGCCT 41 

III I I I I I I I I I I I I I I I I I I I I I I I I 

Db 7 TTGGCCTACTGGCC — 

CAGGCCTTGCACACGCAGGTGCTCTCTTAGTGCTCCCTGGCAAG 6 4 

Qy 42 GTAAGCC CAGCACT TTGG — GAGGCGGAGGCG — 

GGTGGATC C 8 0 



Db 65 

GATGGGAAGGCATGTTCAGTCCTAGGAGTAGGAAGGGGCAGAGGTGTTGATGGCCCCTAC 12 4 

Qy 81 ATTGAGGTC AGGAGC TCAAGA 

CCAGCCTGGTCAACATGGTGAAA 12 4 

I I I I I I I I I I I I I I I II I I I I I I II II 



Db 125 

AGAGCGGCCAAGGACAAGGAGCTGCTGTTCGAAACAGCCTTCCTGCTCCCCAACCTGCCT 18 4 



Qy 125 

CCCCGTCTCTACTAAAAATACAAAAAAAAAATTAGCTGGGCGTGTTGGCGGGA- 
III I I I I II I I I I I I I I I 

Db 185 CCC — ACCCAACAGGTTTTGC ATATACTCTACTGGG 

AAGAGGGACACACCC 233 

Qy 178 GCCTGTAGTCCCAGCTACACGGGAGGCTAAGGTGGGAGAATTG- 

CTTGAACCCGGGAG — 23 4 

I I I I I I I I I I II II I II 

Db 23 4 GACTGCA-TCACTGCCCTCCAAGTCTCTCC 

CTGCCCTGTCCAGCATCCAGGAGCA 287 

Qy 235 

GCGGAGGTTGCAGTGAGCTGAGATTGCACCATTGCACTCCAGCCTGGGCGACAG 2 8 8 



Db 288 CCCCTAGTTGGGGAAGC TTCTGTGACTCCCCCT ACAACAGCCTAG — 

GATGG 33 7 

Qy 289 

AGC AAG AC T C T C T C AAAAAAAAAAAAAAAG AAAGAAAG AAAT C AG AAAAT C G AC C AC AG T 3 4 8 



Db 33 8 AGTGGGGTT 

TGTGAACAAATGCAGAAGGCAGTCTTAGGGAGGTCAGCTGA 387 

Qy 3 49 GGTAGCCACCTGGCCTAATGCTGTGTTTTTG- 

TACC 3 83 

I I I I I I I I I I I I I I I I I II 
Db 388 CAT-GCC- 

CCTGGCCTGTGGCTGGGAAGTAGCAGAGGCTAAGGTTCTTCCCCGCTCTGGG 4 45 

Qy 384 — TGACAGGGGT — CAC TCAT T T T AGGC AC AAC TC - 

CTTCATTCTTTGTGAAATT- 433 



Db 446 

GTTGCCAGGAGTAGCACTGGATCAGTCAGGTGACAGGGCTCTCCTCTCTCTGAGCAGGTC 5 05 

Qy 43 4 -AGTGAGTTTCCTTCTA CCCGTCACCAGATTCAATATGT- 

TCTATTAATACACCGAT 48 8 

I I I I I I I I I I II III I I I I I I I I I I I 
Db 506 CGGTG- 

GCAGCCTTCAAGGTCGCCACGCCGTATTCCCTGTATGTCTGT 552 

Qy 489 

AACCACAGGGGAAGGGCACTTGTCGCTCTCCCACCTGGTTACCACAGTCTCCATGGGTCT 5 4 8 



Db 553 — CCCGAGGGGCAGAAC GTCACCCT— CACCTG- 

CAGGCTCTTGGGCCCT 59 7 

Qy 549 TTTGCC GTGACC 

ACAAATAAAGGAAACACTCATCACTAGTATCT 592 



Db 59 8 GTGGACAAAGGGCACGATGTGACCTTCTACAAGACGTGGTACCG — 



177 
I I I I I 



CAGCTCGAGGGGCG 655 



Qy 593 AAGT-CGGGCTT- 

TACAGTAACTATGCACCTTCTGTGTGCTTCACCTCACTCTCTA 6 46 

I I I I I I I I I I I I I I I I I I I I I I I I I III 

Db 656 AGGTGCAGACCTGCTCAG-AGCGCCGGCCCATC 

CGCAACCTCACGTTCCAGGAC 70 8 

Qy 6 47 CTTCAAACAGC- 

CCATGGAGGGAGGTATTATTATACTCCTTATGTTGACAGTGAAGAATC 7 05 

I I I I I I I I I I I I I I I I I III I I I I I I 

Db 709 CTTC-ACCTGCACCATGGAGG CCACC— 

AGGCTGCCAACACCAGCCA 752 

Qy 706 TGA 

GGCCCAGAGAGGTTGGGGACTTGAG T AAAG T C AC AC AGC C C T G AG AGGC AG G AC 762 

II I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 753 CGACCTGGCTCAGCGC CACGGGCTGGAGT — CGGC-CTCCGACC 



Qy 763 CAGGGTTCCAT- 

TCCTGCTCTATCCAGTTCCAAGCCCTTGTGTTTTCCATTATGTTTAG- 82 0 

I I I I I I I I I I I I I I II I I I I II I I I 

I 

Db 796 

CATGGCAACTTCTCCATCACCATGCGCAACCTGACCCTGCTGGATAGCGGCCTCTACTGC 855 

Qy 821 TGCCTCTTTG CTAACAGCAACA-TCTG CAAGAT TTGTG- 

TTGGTT 863 

I I I I I II I I I I I II I I I I I I I I I I I 

Db 856 

TGCCTGGTGGTGGAGATCAGGCACCACCACTCGGAGCACAGGGTCCATGGTGCCATGGAG 915 

Qy 86 4 TTGATGGAGAACTCTAGCT CATCCACATGC-TAGTG — 

CCCAAGTGGTGGA 911 

II I I I I I II I I I I I I II I I I I I I I I 

Db 916 

CTGCAGGTGCAGACAGGCAAAGATGCACCATCCAACTGTGTGGTGTACCCAT 96 7 

Qy 912 GGGGCCACCTCAGCAGGTGGGT 

TCTGAATGCAGCCAAGGCTGTCCCCGCAATGGG 966 

I I I I I I I I I I I I I I I I I I I II I I I I 

I I I I 

Db 96 8 CCTCCTCC- C AGGAGAGTGAAAGT AAGGG AC C AAC C T CTTGCCCC- 

TTTTGGG 1018 

Qy 967 TGAGACTC — GCTCCAACTGCCCGC — 

CCTCAGAGCAGGTGCCTAAGTCCTCCCTGGCAC 1022 

I III II I I I I I I III III III III 

I I 

Db 1019 T TCTCTGTTTTCTTCTGTCCTCATCCT GCA 

CCCAGACCCTGTTTGGAAC 106 7 

Qy 1023 

TGGCAGGCCTTACCTCACATTGCTAAATTAAAGCAATGCAATTCCTCTTGGGTAAGAGGA 10 82 
I I I I I I I I I I I I I I I III III II 

Db 106 8 T — CTGGCCTCATCACCC CAAGCCCT- 

CAGAACCCCCCGG 1104 



Qy 1083 

ATTCCTCCTTCTTTACTAACTGATCCCCAGCAAGGAAATAAAATGTTAGGCTTTAAAAAT 1142 

I I I I I I I I I I I I I I I I I I I I I II I I I I 

Db 1105 — TCCTCCTCCTTTTCT-GCTGCTGCACATCCC 

TTCTGCTT 1142 

Qy 1143 CCCTACTTTGTCATATCAGACTATATTCTAAAACTATATTTGAGCGAAACCTGTCAT- 

TG 1201 



Db 1143 -CCTCCTTGGTGCAATC — CCCAGAAGCCCACTCT- 

CCTTCCATCTG 1185 

Qy 12 02 CGTCT — AATTTCAAATATAC- 

AGAATCTCCTTAAGAGCTGTTGCCTTATTTTTTTGTAA 12 5 8 



Db 1186 C-TCTGGAGTCTCTGCTCCTCTTGACTCTCTGGAGTGGCTG-TGCCTTG- 

GCAG 1236 

Qy 12 59 AGCCTCTCTGACATCAAATGGGGAGAAATGGTGGCACCTCCAGACA 

CC 1306 



Db 123 7 TGAC-CTTTGGC — CA GGGCAA GTG 

CCTCATGACAGGTACTGGGTGCCC 12 82 

Qy 1307 CT- 

GAAACTACACACCATTTCTTCCCTGCTCAGCTTCTGCTCAGGAGTTCTGTGAGCTAT 1365 



Db 1283 CAGGCAGCTAAGTGCCG CCCTGCCCA CCAGCCCC CTATG- 

GCT-T 1325 

Qy 1366 GGGAAGGCCATTGGTTGTATTTG- 

CTACTTTTACTTTCATCTTCCTCTGCTGTAG 1419 

I I I I I I I I III I I I I I I I I I I I I I I I I I 

Db 1326 GGGAAGGCTGGGGGTCCTCTTGGCCAACAGGGTGAAACCCCAT- 

CTCTACTAAAAACACA 13 8 4 

Qy 142 0 AGCCATTTAATGT 

TATTGTCATATGCTGCTGGTGAGGTAAAGGT 1463 



Db 13 85 AAAATTAGCCAGGCATGGTGGTGCGCGCCTGTAGTCCCA- 

GCTACTCAAGAGGCTGAGGC 14 43 

Qy 146 4 GGGTCCGGGTGCCTT CCCAGGGGTTAGAGGATGTTCAAAGGGCCGATTT 

CAGC 1516 



Db 1444 AAG— AGAATCGCTTGAACCCGGGAGTCAGAGGTTG — 

CAGTGAGCCGAGATCGCGCCAC 1499 

Qy 1517 AGGAGTTCAGAGGGCTTAT G AT GG AT GGTGAGAGATT 

T G AC AAC C AC C AG AGC AC — 1571 

I I I I I I III I I I I I I I I I 

Db 150 0 TGCACTCCAGCCTGCT 

GATAGAGCAAGACTCCATCTCAAAAAAAACAACCCAA 1552 



Qy 1572 -ATGTGCTCTGACCCTCTCCTGGGCATTGGT— TCCTGCTGGTAC- 

CGGGCGGTTCAGAC 162 7 



Db 1553 AATTTGC-CTGGCATGGTGGCAGGCATCTGTAATCC — 

CAGCTACTCGGGAGGCTGAGAC 16 09 

Qy 162 8 CTTCAAATAGGTTGCTTTCAAAAGAGCTTTCAGGCACTTAT- 

TGAGAATTAATGTT 16 82 

I I I I I I I I I II I I I I I I 

Db 1610 AT GAGA GCTGCTTGAACCTGGG 

AGGCAGAAGTTGCAGTGAGCCGAGATCAC 166 0 

Qy 16 83 TAAACAGACATAATAGCCTAGATGA AC 

TCCCAAGAGATCTATTAAAT 172 9 



Db 1661 ACCACTG- 

CACTCCAGCCTGGGTGACAGAGCGAGACTCTGTCCCAAAAAATCAAAAAAAT 1719 

Qy 173 0 — CTTGTGG GCTGAA TAAATATCTCGTGCAGGAC- 

TGTGCAACAGTAGC 17 75 



Db 172 0 CACTTTTGGTAGAGATGCACTCTCGCTATGTTGCCCAGGCTGGTCTTGAACTCCTG- 

GGC 1778 

Qy 1776 CCA GAGCATCCTGCCTGTGGGCATCCACCTCCCAGGTGAGGG 

CAGTGGGAAG 182 7 



Db 1779 TCAATTGATCTTCCCACCT-TG 

ACCTCCAAAGTGCTGGGATTACAGGTGTGAG 1830 

Qy 182 8 CTGGCCCGAC-GGCAGCCAGAACTTGTT-TCTCACCTCCCACCAGCA- 

ACCCCCCACCCA 1884 



Db 1831 

CCACCATGCCTAGCCTCAGGGAATTCTTATAAGAACTCTATGAAGTAGGCATCACCATCT 189 0 

Qy 18 85 ACTCTGGGCCC C AGGC AC AC G AAGC AC AAG T 

CTCAGGGGACCATTCCCACAT 1936 



Db 1891 TCTCTGTATC C ATGGAAAGAGAGGC - 

C TAG AG AT G T AT GC T AAC T T GC C C AAGC T C AC AT 1949 

Qy 1937 TG GGGGATC CTGAGG GAGC CC 

ATCACCGCCTCTT 1970 



Db 1950 

AGCCCAGGGTAGCATAGCTGGGATGTTGAGCTGAGGCCGGAGGAGAAGTAGCAGTCGCTG 2 0 09 

Qy 19 71 GCATA-CA-ACTGTC — CACTAGG AGGCAC— 

GCC 1999 

I I I I I I I I I I I I I I I I I I I I 

Db 2010 

GCAGAGCACACAGGCTGCTCTGGGGGATGAGCTGGTGCGTTTAAGGAACAGGCCAGCACT 2 069 



Qy 2000 CAGTG TGGGAGAGATGT ATGGTCT- 

TGCCTTCCA 2032 



Db 2070 

GGCATTCGCAAGCAGTGGGGAAGGGGAGAGATGCCGAGGTGGTCAGTATCCTGACTTTCA 212 9 
Qy 2033 

CCTGTAAAAACTGCACATATGCAAGCCATTTGCACTCTGGAACTGCATGCCGTGAA 2 0 8 8 



Db 213 0 GAGGCCTTTTTT TGTTTGTTTTAAT TTTTGCTAGATTGATAT- 

TAAA 2175 

Qy 2 0 89 AACTCCTAATGGTGTGGAACTTA — 

GTTTGAATTTGAAATCACGCCGCATGCACAAAGGG 2146 



Db 2176 AACTC ATGTGGAGGAACTCAAGGAATG-TTTAGAAG 

ACCAAAAGTCCCCAATG 222 7 

Qy 2147 ACAGGCCCAGGCCCGAC CTCAGGTCA-TCC- 

GC 2177 



Db 2228 

ACAGGAACAAAAGCAACCAATTTTTAACTTTCTCTTCTCATTCCTGTTTTCATTGATTTC 22 8 7 

Qy 2178 CCGCTGGCTGCAGAGCAT CCCTGGGAG-CCAAGGCGA 

GGCCCGTGGAGC- 2225 



Db 2288 CCAC— 

ATGTAGTCCTTTTGCTCAGGAAGTCTTTGGGGAAATTAAGGATCTTTGAAGCT 23 4 4 

Qy 2226 

CTGAGCTTTGTGTAGCTCGAGCTTTGTGTAGCTCGTGCACTTATTATGC- 
I I I I I II I I I I I I I I I I I I 

Db 23 45 CTGAAATAGGT GATC- 

AGGTTAGTGGTGTCTGTCAGCTGTCTAAGAGGTTGGAAAAT 2 4 0 0 

Qy 2275 -ACCACCTCCCTTCAGTCAC — CACTCCT CTTCCTC-CGCCAT 

CC 2315 

I I I I I I I I I I I I I I I I I I I I I I I 

Db 2401 

G AAC T AC T C AAG AT AG T C AC G AAAAT AC TGAAAGT T TGAT TTTTCTTTCCATATTT GAAT 2 46 0 

Qy 2316 TCATTTATACTGATTGC 

ACACCCCCCGCTCAAACA— 2350 

I I I I I I I I I I I I I I I I I I I 

Db 2461 

TAATTTTTTCTGTTTGACTGGAAGGGGTTTTTGTATAACTAAAACCTCAGCGCATAAAGG 2 52 0 

Qy 2351 ACAATGTCC T TAT TAT GAT G AC CAT CTCG 

TAGT 2383 



Db 2521 

AGATTTAAAAGGAGCACATGATTTAGTGGGTGGGCCATGAAACTAGAGATGGGATTTGGG 2 5 8 0 



2274 

II III 



Qy 23 8 4 GGTACAT TCCATTCCT — ATTTAAGGTAAGCC CAA 

AGCC 2 42 0 

III II I I I I II I I I I I III II I I I I 

Db 2581 

GGTGAATTTGTCAATATCTGGATTTTAATCCAGACATCTCTGCTAACGAGCCTTTGGTAA 2 6 4 0 

Qy 2 421 — CACTTTTGG ATTTTCTC 

GACTGTCCGAGAAAAGTTGTGT 2 459 

I I I I I I I I I I I I I I I I I II I III 

Db 26 41 GTCACTTCAGATACTTTTCCTCCTTTTTACAAAGAGAGGGCTGGCTTAGTTAT- 

TTGCCA 2699 

Qy 2 46 0 AAGC GCCTGCGTTCTTCTGG 

GTTTGGCTAGATAGGGTTGTGTCCCTCT 250 7 

I I I I I I I I I III I I III II I I I I I III 

I I I I 

Db 2 70 0 AAGCCCCTTCCAGGCCTGAATTCCACAAGTACGATTTACT — 

GTAGTGTCTTATCACTCT 2757 

Qy 250 8 ATGGAATGGAG 

AGTGATGTGGGCAAGGGTGTCATTTTCTCGCACA 2552 

III I I I I I I I I I I I I I I I I I I III 

I I 

Db 2 758 TTCATGTCACAATAGCGTGGAGCATTAGAGA-AAAGCCTAGACTTTTAGTTGATAGC- 



Qy 2553 AT— 

AC AAC TCACTGAGGATGCTTCTG T AG AAG T G AG AAAC AC GAT GAG T AC AT T C AG AA 2610 

I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 2816 

GTTGAAATATCATTGATAGAATTTTAGTTTTAGGAAAAATTGGTTTGATTTCTAGCTTTA 2 8 75 

Qy 2611 

TTACAATAACTCACTCTCACTGGGTAACTTCTCATGATAGATTTGTATGATCAATACGGG 26 7 0 

I I I I I I I I I I I I III I I I I I I I I 

I 

Db 2 8 76 TTACTAT TAGGTATGTGAGCTTGGGCAAATCGCTTAATCTTT- 

GAG 2920 

Qy 26 71 TCTATTTTTATGTC-AACTGAACACTGTAGGGTA CC 

TTCCAGTCTTTT 2717 

I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I 

I 

Db 2921 TCTAGTTTTCTCTCAAAATGAGAACATTAGGCTAAATGATTTCCGAGTTTCCAG- 

CTAGT 2 9 79 

Qy 2 718 TCAAGATTGTTAAATTGAGACAAGTAATTGAA- 

TAATTTGTCCTATTTTTATTTTAAAAA 2776 

I I I I I I I I I I III I I I I I I I I I I I I I I I III I 

I I 

Db 29 8 0 C C TAG AG T T C TAT AT T TC T AC A — 

TAGTTGAATTATTTTATCATGCTGTTGCTGGGGAAT 3037 

Qy 2 777 AAG TGAATGGACTGA AATGTTAAATGTGAATGTACAT — 

TTCTTAA 2820 

II I I I I I I I I I I I I I I I I I I I I I II 



ATGACTAACCCTTTTGAAGCTACTAATTTTATGTCGAGCTTTAAAGTCCATAATTGTTAT 3 09 7 



Qy 2 821 TTGCA ATTTTT CTACTGAGTGT TTGCACTATA 

CTTT — 2856 

III I I I I I I I I I I I I I I I I I I I I I I 

Db 3098 

CTTCAGAAAATATTATTTGACCTACAGTATGTCCAAATCAATTTAATAAAATCGCTTTAT 315 7 

Qy 2 857 — C T GG AAT C T T AT T T AAC AAAAAT AAAGGGAAAAAA TTGCTTGA 2 899 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 3158 AACAGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGGCCACATGTGCTCGA 3210 
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Qy 1 TTGAAATCAGGAAATCAGGC 

CGGGCGCAGTGGCTCATGCCTGTAAGCCCAG 51 



Db 3170 TTTTTTTCCTGTTATAAAGCGATTTTATTAAATTGATTTGG-ACAT- 

ACTGTAGGTCAAA 3113 

Qy 52 CACTTTGGGAGGCGGAGGCGGGTGGATCCATTGAGGTC — 

AGGAGCTCAAGACCAGCCTG 109 



Db 3112 TAATAT TTTCTGAAG 

AT AAC AAT T AT GG AC T T T AAAGC T C G AC AT AAAAT T A 3061 

Qy 110 G TCAACATGGT GAAACCCCGTC- 

TCT ACT AAAAAT AC AAAAAAAAAATT 157 



Db 3 06 0 GTAGCTTCAAAAGGGTTAGTCATATTCCCCAGCAACAGC — 

ATGATAAAATAATTCAACT 3 0 03 



Qy 158 AGCTGGGCGTGTTGGCGGGAGCCTGTAGTCCCAGCT ACACGGG- 

AGGCT 2 05 



Db 3 0 02 ATGTAGAAATATAG 

AAC TCTAGGACTAGCTG G AAAC T C G G AAAT C AT T T AGC C T 2 9 49 

Qy 2 06 AAGGTGGGAGAATTGCTTGAACCCGGGAGGCGGAGGTT — GCAGTGAG- 

CTGAGATTGC- 261 



Db 2948 

AAT GTTCTCATTTT G AG AG AAAAC T AG AC TC AAAGAT T AAGCGAT T T GC C C AAGC T C AC A 2 8 89 
Qy 262 -ACC— 

ATTGCACTCCAGCCTGGGCGACAGAGCAAGACTCTCTCAAAAAAAAAAAAAAAG 318 

III I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2888 TACCTAATAGTAATAAAG- 

CTAGAAATCAAACCAATTTTTCCTAAAACTAAAATTCTATC 2 83 0 

Qy 319 AAAGA-AAGAAATCAGAAAATCGACCACAGTGGTAGCCACCTGGC-CTAATGCT — 

GTGT 374 



Db 2 829 AATGATATTTCAACTGGCTATCAACTAAAAGTCTAG — 

GCTTTTCTCTAATGCTCCACGC 2772 

Qy 3 75 TTTTGT-ACCTGACAGGG GTCACTC 

ATTTTAGGC- 40 7 



Db 2771 

TATTGTGACATGAAAGAGTGATAAGACACTACAGTAAATCGTACTTGTGGAATTCAGGCC 2 712 

Qy 40 8 ACAACT 

CCTTCATTCTTTGTGAAAT 432 

I I I I I I I I I I I I I I I I I I I 

Db 2 711 TGGAAGGGGCTTTGGCAAATAACTAAGCCAGCCCTC — 

TCTTTGTAAAAAGGAGGAAAAG 2654 

Qy 433 TAGTGAGTTTCCTTCTACCCGTCACCAG 

ATTCAATATGT 471 



Db 2653 

TATCTGAAGTGACTTACCAAAGGCTCGTTAGCAGAGATGTCTGGATTAAAATCCAGATAT 2 59 4 

Qy 4 72 T — CTATTAATACACCGATAACCACAGGGGAAGGGCACTTG 

GCTCTCCCACCT 52 4 

I I I I I I I I III I I I I 

Db 2 593 TGACAAATTCACCCCCAAATCCCAT 

CTCTAGTTTCATGGCCCACCCACTA 2544 

Qy 52 5 GGT TACCAC 

TTTTGCCGTGACCAC 562 



— TC 

II I I I I I I I I 



-AGTCTCCAT GGGTC 

I I I I I I I III I I I I 



Db 2543 

AATCATGTGCTCCTTTTAAATCTCCTTTATGCGCTGAGGTTTTAGTTATACAAAAACCCC 2 4 8 4 

Qy 563 AAATAAAGGAAACACTCATCACTAGTAT 

CTAAGTCGGGCTTTACAGTAA 611 

II I I I I I I I I I I I I I I I I I I I I 

I I I I I 

Db 2 4 83 T T C C AG T C AAAC AGAAAAAAT T AAT TC AAAT ATGG AAAG AAAAAT C AAAC T T T - 

CAGTAT 2 42 5 

Qy 612 CTATGCACCT-TC-TGTGTGCTTCA CCTCACTCT CTACT- 

TCAAACAGCCCA 66 0 

I I I I I I I I I I I I I I II I I I I I II I I I I I I 

I 

Db 2424 

TTTCGTGACTATCTTGAGTAGTTCATTTTCCAACCTCTTAGACAGCTGACAGACACCAC- 2366 

Qy 661 TGGAGGGAGGTATTATTATACTCCTTATGTTGA-CAGTGAAGAATCTGAGG 

CCCAGA 716 

II I I I I I I I I III I I I I III I I I I 

I 

Db 2365 TAACCTGAT- 

CACCTATTTCAGAGCTTCAAAGATCCTTAATTTCCCCAAA 2317 

Qy 717 GAGGTTGGGGACT T GAG T AAAG T C 

ACACAGCCCTGAGAGGC 757 

III I I I I I III I I I I I I I I I I I I I I I 

Db 2316 

GACTTCCTGAGCAAAAGGACTACATGTGGGAAATCAATGAAAACAGGAATGAGAAGAGAA 2257 

Qy 758 AGGACCAGG GTTCCATTC CTGCT 

CTATCC 786 

II II I I I I I I I I I I I I 

I I I 

Db 2256 

AGTTAAAAATTGGTTGCTTTTGTTCCTGTCATTGGGGACTTTTGGTCTTCTAAACATTCC 219 7 

Qy 787 AGTTCCAAGCCCTTGTGTTTTCCAT-TATGTTTAGTGCCTCTT — 

TGCTAACAGCAA 8 40 

I I I I I I I I I I I I I I I III I I I I II I I I I I 

I I 

Db 2196 TTGAGTTCCTC- 

C AC AT GAG TTTTTAATATCAATC TAGC AAAAAT T AAAAC AAAC AAAAA 213 8 

Qy 8 41 CATCTGCAAGATTTGTGTTGGTTTTGA 

T GG AG AAC T C T AGC T CATC C AC A- 89 0 

I I I I I I I II II III II I I I I I I I I I I 

I I 

Db 213 7 AAGGCCTCTGAAA GTCAGGATACTGACCACCTCGGCATCTCTC- 

CCCTTCCCCAC 2084 

Qy 891 TGCTAGTGCCCAAGTGGTGG-AGGGGCC 

ACCTCAGCAGGTGGGTTCTGAATGC 9 42 

I I I I I I I I I I I I I I I I II I I I I I I I I I I 

I I 

Db 2 0 83 TGCTTGCGAATGCC — 

AGTGCTGGCCTGTTCCTTAAACGCACCAGCTCATCCCCCAGAGC 2 02 6 

Qy 9 43 AGCC— AAGGCTGTCCCCGCAATGGGTGAGACTCGCTCCAACTGC- 

CCGCCCTCAGAGCA 999 



Db 2 025 AGCCTGTGTGCTCTGCCAGC 

GCTACTTCTCCTCCGGCCTCAGCTCA 1976 

Qy 1000 GGTGCCTAAGT CCTCCCT-GGCACTG GC — AGGCC — 

TTACCTCACATTGCTAA 10 48 

III I I I I I I I I I II II III I I I I I 

Db 1975 

ACATCCCAGCTATGCTACCCTGGGCTATGTGAGCTTGGGCAAGTTAGCATACATCTCT — 1918 

Qy 10 49 ATTAAAGCAATGCAATTCCTCTTGGGTA AG AG G A 

ATTCCTCCTTCTT T 1096 



Db 1917 AGGCCT — CTCTTTC- 

C AT GG AT AC AG AG AAG AT GG T G AT GC C T AC T T C AT AGAG T 1865 

Qy 109 7 AC T AAC T — GAT C C C C AGC AAGGAAAT AAAATG T T AGGC T T T AA 

AC 1148 



Db 186 4 TCTTATAAGAATTCCCTG — AGGCTAGGCATGGT — 

GGCTCACACCTGTAATCCCAGCAC 18 09 

Qy 1149 TTT GTCAT- 

ATCAGACTATATTCTAAAACTATATTTGAGCGAAACCTGTCATTGCG 12 03 



Db 1808 TTTGGAGGTCAAGGTGGGAAGATCAATTGAGCCCAGG — 

AGTTCAAGACCAGCCTGGGCA 1751 

Qy 1204 TCTAATTTCAAATATACAGAATCT-CCTTAAGAGCTGTTGCCTTATTTTTTTG 

TAAA 1259 



Db 175 0 AC — ATAGCGAGAGTGCA-TCTCTACCAAAAGTGATTTT — 

TTTGATTTTTTGGGACAGA 1696 

Qy 126 0 GCCT — CTCTGACATCAA — 

ATGGGGAGAAATGGTGGCACCTCCAGACACCCTGAAACTA 1315 



Db 1695 GTCTCGCTCTGTCACCCAGGCTGGAGTGCAGTGGTGTGATCTC — 

GGCTCACTGCAACTT 1638 

Qy 1316 C— ACACCAT— TTCT TC-CCTG-CTCAGCTTC 

TGCTCAGGAGTTCTG 1357 



Db 163 7 CTGCCTCCCAGGTTCAAGCAGCTCTCATGTCTCAGCCTCCCGAGTAGCT- 

GGGATTACAG 15 79 

Qy 135 8 TGAGCTATGGGAAGGCCA TTGGTTGTATTT 

GCTACTTTTACT 1399 



Db 1578 

ATGCCTGCCACCATGCCAGGCAAATTTTGGGTTGTTTTTTTTGAGATGGAGTCTTGCTCT 1519 



I I I I I I I I I I I I I I I I I I I I 
-GACT- 



AAATCCCT — 

I I I I I I 



Qy 140 0 TTCATCTTCCT CTGCTGTAGAGCCATTTAATGTTATTG — TCATATG- 

CTGCTGGTG 1453 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I 

Db 1518 

ATCAGCAGGCTGGAGTGCAGTGGCGCGATCTCGGCTCACTGCAACCTCTGACTCCCG 1462 

Qy 1454 AGGTAAAGGTGGGTC-CGGGTGCCT TCCCAGGGGTTAGAGGATGTTCAAAGGGC- 

CG 1508 



Db 1461 -GGTTCAAGCGATTCTC — TTGCCTCAGCCTCTTGAGTAGCTGGGACTACA 

GGCGCG 140 8 

Qy 1509 ATTTCAGCAGGAGTTCAGAGGGCTTATGATGG 

AT GG T GAG AG AT T T G AC AAC C AC 1563 



Db 140 7 C-ACCACCATGCCT GGCTAATTTTTGTGTTTTTAGT- 

AGAGATGGGGTTTCACC 1356 

Qy 156 4 CAG— AGCACATGTG 

CTCTGACCCTCTCCTGGGCATTGGTTCCTGCTGGTACCGGGC 1618 



Db 1355 CTGTTGGC-CAAGAGGACCCCCAGCCTTC-CCAAGCCATAGGGGGCTGGTGG- 

GCAGGGC 1299 

Qy 1619 G G T T CAG AC C T T C AAAT AG 

GTTGCTTTCAAAAGAGCTTTCAGGCACTTATTGAG 1672 



Db 129 8 GGCACTTAGCTGCCTGGGGCACCCAGTACCTGTCATGAG-GC — 

ACTTGCCCTGGCCAAA 1242 

Qy 16 73 AATTAATGTTTA — AACAGACA TAATAGCCTAGATGAAC 

TCCCAAGAGATCT 172 2 

I I I I I I I I I I I I I I I I I I I I I II 

Db 1241 

GGTCACTGCCAAGGCACAGCCACTCCAGAGAGTCAAGAGGAGCAGAGACTCCAGAGCAGA 1182 

Qy 1723 ATTAAATCTTGTGGGCT GAAT AAAT AT C T C G 

TGCAGGACTGTGCAACA 1770 



Db 1181 

TGGAAGGAGAGTGGGCTTCTGGGGATTGCACCAAGGAGGAAGCAGAAGGGATGTGCAGCA 112 2 

Qy 1771 GTA GCCCAGAGCATCCT GCCTGTGG GCATCCA- 

CCT 1805 



Db 1121 

GCAGAAAAGGAGGAGGACCGGGGGGTTCTGAGGGCTTGGGGTGATGAGGCCAGAGTTCCA 1062 

Qy 18 06 CCCAGG TGAGGGCAGTGGGAAGCTG 

GCCCGACGGCAGCCAGAA 18 48 



Db 1061 

AACAGGGTCTGGGTGCAGGATGAGGACAGAAGAAAACAGAGAACCCAAAAGGGGCAAGAG 10 02 



Qy 18 49 CTT-GTTTCTCAC CTCCCACC 

AGCAACCCCCCACC 18 82 

I I I I I I I I I I I I I I I I I I I I I 

Db 1001 

GTTGGTCCCTTACTTTCACTCTCCTGGGAGGAGGATGGGTACACCACACAGTTGGATGGT 9 42 

Qy 18 83 -CAACTCTGGGCCCCAG GCACACGAAGCACAAGTCTCAGGGGACCAT — 

TCCCACAT 1936 

I I I I I I III I I I I I I I I I I I I I I I I I I II 

Db 9 41 GCATCTTTG CCTGTCTGCACCTGCAGCTC 

CATGGCACCATGGACCCT 895 

Qy 193 7 TGGGGGATCC TGAGGGAGCCC-ATCACCGCCTCTTGCATACAACTGT 

CCACT 19 8 7 

I I I I I II I I I I I I I I I I I I I II I I I I I 

I I 

Db 89 4 GTGCTCCGAGTGGTGGTGCCTGATCTCCACCAC 

CAGGCAGCAGTAGAGGCCGCT 8 41 

Qy 19 8 8 AGGAGGCA CGCCCAGTGTGGGAG-AGATGT ATGGTC — 

TTGCC — 2027 

I III III I I I I I I I I I I I I I I I I 

Db 840 

ATCCAGCAGGGTCAGGTTGCGCATGGTGATGGAGAAGTTGCCATGGTGGTCGGAGGCCGA 7 81 
Qy 2 02 8 TTCCA- 

CCTGTAAAAACTGCACATATGCAAGCCATTTGCACTCTGGAACTGCATGCCGTG 2 0 86 

I I I I I I I I I I I I I I I I I I I I I II I 

I 

Db 78 0 CTCCAGCCCGTG GCGCTGAGCCAGGTCGTG 

GCTGGTGTTGGCAGCCTGG 732 

Qy 2087 AAAACTCC TAATGGTGTGGAACTTAGTT — 

TGAATTTGAAATCACGCCGCATGCAC 2140 

I I I I I I I I I I III III III I I I I 

I 

Db 731 TGGCCTCCATGGTGCAGGTGAAGGTCCTGGAACGTGAGGTTGCGGATGGGCCG- 

GCGCTC 673 

Qy 2141 AAAG— GGACAGGCCCAGGCCC GACCTC AGGT- 

CATC 2174 

I I I I I I I I I I I I I I I I I I I I 

I I I I 

Db 672 

TGAGCAGGTCTGCACCTCGCCCCTCGAGCTGCGGTACCACGTCTTGTAGAAGGTCACATC 613 

Qy 2175 -CGCCCGCTGGCTGCAGAGCATCCCTGGGAGCC — AAGGCGAG 

GCCCGT 2220 

I I I I III I I I I II I I I I I I I I I I I I 

I 

Db 612 GTGCCCTTTGTCCACAGGG 

CCCAAGAGCCTGCAGGTGAGGGTGACGTTCTGCCCCT 5 5 7 

Qy 2221 GGAGCCTGAGCTTTGTGTAGCTCGAGCTTTGTGTAGCTCGTGCA — 

CTTATTATGCACCA 22 78 

I I I I I I II I I I I I I I I I I I I I I I 

I I I 

Db 556 CGGGACAGACATACAGGGAATACG-GC 

GTGGCGACCTTGAAGGCTGCCACCGGACCT 501 



Qy 2279 CCTC CCTTCAGTCACC — AC TCCTCTTC — CTCC- 

GCCATCCTC 2317 

III II ! I I I I I I II III II I I I I I I I 

Db 50 0 GCTCAGAGAGAGGAGAGCC — 

CTGTCACCTGACTGATCCAGTGCTACTCCTGGCAACCCC 443 

Qy 2318 A 

TTTATACTGATTGCACACCCCCCGCTCAAACAACAATGTCCTTAT 2363 

I I I I I I III I I I I I I I I I I I I 

Db 4 42 AGAGCGGGGAAGAACCTTAGCCT — CTGCTACTTCCCAGC- 

CACAGGCCAGGGGCAT — G 388 

Qy 2364 

TATGATGACCATCTCGTAGTGGTACATTCCATTCCTATTTAAGGTAAGCCCAAAGCCCAC 2 423 

I I I I I I I I I I II I I I I I I II I II I I I I I 

I I I I I 

Db 387 TCAGCTGACC-TCCCTAAG AC-TGCCTT— CTGCATTTGTT 

CACAAACCCCAC 339 

Qy 2 42 4 TTTTGGATTTTCTCGACTGTC-CGAGAAAAGTTGTGTAAGCG CCTGC 

GTTCT 2474 

I I I I I I I I I I III I I I I III 

I I 

Db 338 T 

CCATCCTAGGCTGTTGTAGGGGGAGTCACAGAAGCTTCCCCAACTAGGGGTGCT 2 8 4 

Qy 2475 

TCTGGGTTTGGCTAGATAGGGTTGTGTCCCTCTATGGAATGGAGAGTGATGTGGGCAAGG 253 4 

I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I 

I 

Db 2 83 CCTGGAT GCTGGACAGGGCAGGGAGAGACT-TGGA — 

GGGCAGTGATGCAGTCGGGT 23 0 

Qy 2535 GTGTCATTTTCTC-GCACAATACAACTCACTGAGGATGCTT 

C T G T AG AAG T GAG AAA 2 59 0 

I I I I I III I I I I I I I I I I I I I I I I III 

Db 229 GTGTCC CTCTTCCCAGTAGA GTATATGCAAAACCTGTTG- 

GGTGGGAGG 182 

Qy 2591 

CACGATGAGTACATTCAGAATTACAATAACTCACTCTCACTGGGTAACTTCTCATGATAG 2 65 0 

I I I I I I I I I I I I I I I I I I I I I I 

Db 181 CAGGTTGGGGA GCAGGAAGGCTGTTTCGAAC AGCAGCTCCT 

TGT 138 

Qy 2651 

ATTTGTATGATCAATACGGGTCTATTTTTATGTCAACTGAACACTGTAGGGTACCTTCCA 2 710 
III I I I I I I I I I I I I I I I I I I I 

I I I I I I 

Db 13 7 CCTTGGCCGCTCTGTAGGGGCC ATCAAC— ACCTCT 

GCCCCTTCC- 95 

Qy 2711 

GTCTTTTTCAAGATTGTTAAATTGAGACAAGTAATTGAATAATTTGTCCTATTTTTATTT 2 7 7 0 
II I I I I I 

Db 9 4 TAC 

TCCTA 8 7 



Qy 2771 

TAAAAAAAGTGAATGGACTGAAATGTTAAATGTGAATGTACATTTCTTAATTGCAATTTT 2 83 0 
I I I I I I I I III II II I 

Db 86 GGACTGAA CATG- 

CCTTCCCATCCTTGCCAGGGA 5 4 

Qy 2 831 TCTACTGAGTGTTTGCACTATACTTTCTGGAATCTTATTTAA 

CAAAAATAAAGGGAA 2887 

I I I I I I I I I I I III I I I I I I 

Db 53 GC-ACTAAGAG— AGCAC CTG 

CGTGTGCAAGGCCTGGGCCAGTAGGCC 9 

Qy 2888 AAAATTGC 2895 

II I III 
Db 8 AACAGTGC 1 
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Qy 2 TGAAATCAGGAAATCAGGCCGGGCGC 

AGTGGCTCATGCCTGTAAGCCCAGCACTTT 5 7 



Db 7 TGGATTC — GCCATCATTTTGCACACTGCGAGGGGCTCC-GTGTGTGCGCCCTG- 

TCTT- 61 

Qy 58 

GGGAGGCGGAGGCGGGTGGATCCATTGAGGTCAGGAGCTCAAGACCAGCCTGGTCAACAT 117 



Db 62 GTCTGG CC GTC CTCAT- 

CCCTTCCCCCACCCCCT 9 4 



Qy 118 GGTGAAACCCCGTCT — C T AC T AAAAAT AC AAAAAAAAAAT TAG — CTGGGC- 

GTGTTGG 172 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



GCCGCCACTCCGACCCGCTCCCAAAGTGGCTTCACAATAGTCGGTCCTCGGCGGTGTAGG 15 4 

Qy 173 CGG-GAGCCTGTAGTCCCAGCTACACGGGAGGCTAAGGTGGGAGA 

ATTGCTTGA 225 

III III I I I I II I II II I I I I I 

Db 155 CTGCGCACCAG— GTCCACACTTAAGCGAAATCAA 

GGAGACCCCCTTCTACTTCT 207 



Qy 226 

ACCCGGGAGGCGGAGGTTGCAGTGAGCTGAGATTGCACCATTGCACTCCAGCCTGGGCGA 2 85 

III II I I I I I I I I I III I I I I I I I I 

I I 

Db 208 ACCTTTGGGTTTGGTGCT-CAATGCGAAGC TGC TGCAACTCAGAC- 



Qy 2 86 CAGAGCAAGACTCTCTCAAAAAAAAAAAAAAAGAAAGAAAGAAATCAGAAAATC- 

GAC 3 42 

I II I I I I I I I I I I I I I I I I I I II III II 

I 

Db 2 56 C T AAG T C A- AC T C AT GC AG AAAAAGG AG AAAAG T T — 

TTGGTATACAAATGCTCTCAGTC 312 

Qy 3 43 CACAGTGGTAGCCACCTGGCCTAA TGCTG 

TGTTTTTGTACCT 384 

II I I I I I I I I I I I I I I I I I II I 

I I I 

Db 313 CA 

GCCAGACACCAAGCCGAAAGGTTGTGCTGGCTGCAACCGAAAGATCAAGGACCG 36 8 

Qy 3 85 G ACAGGGGTCACT 

CATTTTAGGCACAACTCCTTCATTCTTTGTGAAATTAG 435 

I I I I I I I II I I I I I I I I I I I I I 

I 

Db 369 

GTATCTTCTAAAGGCACTGGACAAATACTGGCATGAAGACTGCCTGAAGTGTGCCTGCTG 42 8 

Qy 436 TGAGTTTC CTTCTACCCGT- 

CACCAGATTCAATATGTTCTATTA 478 

I I I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 429 

TGACTGTCGCTTGGGGGAGGTGGGCTCCACCCTGTACACTAAAGCTAATCTTATCCTTTG 4 8 8 

Qy 4 79 ATACACCGA-TAAC CAC AGGGGAAGGGCACT TGT 

CGCT 515 



Db 489 

TCGCAGAGACTATCTGAGGCTCTTTGGTGTAACGGGAAACTGCGCTGCCTGTAGTAAGCT 5 4 8 

Qy 516 C-TCCC-ACC TGGT TACCACAGTC TCCA-TGGGTCT- 

TT 550 

I I I I I I I I I I I I I I I I I III I I I I 

I I 

Db 549 



CATCCCTGCCTTTGAGATGGTGATGCGTGCCAAGGACAATGTTTACCACCTGGACTGCTT 6 0 8 

Qy 551 TGCC G T G AC C AC AAA TAAAGGAAACA CT 

CATCA 5 83 



Db 609 

TGCATGTCAGCTTTGTAATCAGAGATTTTGTGTTGGAGACAAATTTTTCCTAAAGAATAA 66 8 

Qy 58 4 CTAGTATCT AAGTCGGGC TTTACAGTAA 

CTATGCACCTTC 623 



Db 669 

CATGATCCTTTGCCAGACGGACTACGAGGAAGGTTTAATGAAAGAAGGTTATGCACC — C 72 6 

Qy 62 4 TGTGTGCTTCACCTCACTCTCTACTTCAAACAGCCCATGGAG 

GGAG 669 

I I I I I I I I I I I I I I I I I I I II III 

Db 72 7 CAGG TTCGCTGATCTATCAACATCAC 

C C C AT T AAG AAT AC AG AGC AC T AC AT T 7 79 

Qy 6 70 GTAT-TATTATACTCCTTATG T T G AC AG T G AAG AAT - C T G A — 

GGCC 712 



Db 780 

CTTTTATCTTTTTTGCTCCACATGTACATAAGAATTGACACAGGAACCTACTGAATAGCG 839 

Qy 713 CAGAG AGGTTGGGGACTTG — AGTAAAGTCACACAGCC-CTG- 

AGA 754 



Db 840 

TAGATATAGGAAGGCAGGATGGTTATATGGAATAAAAGGCGGACTGCATCTGTATGTAGT 899 

Qy 755 GG CAGGACCAGGGTTCCATTCCTGCTCT 

ATCCAGTTCCAA — 79 4 

I I I I I I I I I I I III I 

Db 900 

GAAAT TGCCCCAGTT CAGAG T T G AAT G T T T AT TAT T AAAGAAAAAAG TAATGTACATATG 959 

Qy 795 GC CCTTGTGTTTTCCATTATGTTTAGTGCCTCTTTGC 

TAACAGCA 839 

II I I I I I I I I I I I I I I I I I I I I I I 

Db 960 

GCTGGATTTTTTTGCTTGCTATTCGTTTTTGTGTCACTTGGCATGAGATGTTTATTTTGG 1019 
Qy 840 

ACATCTGCAAGATTTGTGTTGGTTTTGATGGAGAACTCTAGCTCATCCACATGCTAGTG- 89 8 



Db 1020 

ACTATTGTATATAATGTATTG T AAT AT T TGAAGC AC AAATGT AAT AC AGT T TTATTGTGT 1079 

Qy 899 -CCCAAGTGGTGGAGGGGCCA CCTCAGCAGGTGGGTTC — 

TGAATGCAGCCAAGGC 951 



Db 1080 TACCATTTG- 



TGTTCCATTTGCTTCTTTGTATTGTTGCATTTAGTACAATCAGTGT 113 4 



Qy 952 TGTCCCCGCAATGGGTG AGACTCGCTCCAACTGCCCGCCCT 

CAGAGCAGGTG 10 03 

II I I I I I I I I I I I I I I I I I 

Db 1135 T- 

TAAACTTACTGTATATTTATGCTTTCTGTATTTACCAGCTATTTTAAATGAGCTGTAA 1193 

Qy 1004 C CTA AGTCCTCCCTG GCACTG 

GCAGG 1029 



Db 1194 

C T T T C T AG TAAAGAAT T G AAAAGC AAAT C TC AC T AATGAT AC AC AG AT AG AT AAAGC AAG 12 53 

Qy 103 0 CCTTACCTC ACATTGCT — AAATTAAAG CAATGCAAT 

TCC 1067 



Db 1254 

T C T AT C AAC AT T AAAAAT AC T AAAAAAT AAAG AC AC AC AC AGAGC AT T T T AG T G AC AT C C 1313 

Qy 106 8 TCT TGGGTAAGAGGAATTCCTCCTTCTT- 

T AC T AAC T GAT C C C C AG 1112 

II I I I I I I I I II I I I I I I I I I I 

Db 1314 ACTACTTATTGCCGCTATGAGTTAGAGTCTATCAGTGTTCTTGTTATAAC 

CCCCTA 1369 

Qy 1113 CAAGGAAATAAAATGTTAGGCTTT — AAAAATCCCTAC 

TTTGTCATATCAGA 1162 



Db 13 7 0 TTTTCAGGGGGTTAAAA — 

AT C AGC T T TAAAAAAAT AC AT AAAAAT T TC ATC T T AAAGC A 142 7 

Qy 1163 

CTATATTCTAAAACTATATTTGAGCGAAACCTGTCATTGCGTCTAATTTCAAATATACAG 12 2 2 



Db 142 8 CTTTCATTTTATACCA- 

AC G T G AAAAG TGCCATTTT T AG AAT AAC T T T AAAGC T T AAC AG 14 86 

Qy 1223 AATCTCC TTAAGAGCTGTTGCCTTA 

TTTTTTTG — 1255 

I I I I I I II I I I I I I I I I I I 

Db 1487 

GTTTCCTTTTAATATCCTTTTTTTGTGTGCTCTTTACTTACACAATGGCTTTGTTTTGCT 15 46 

Qy 12 56 — TAAAGCCTCTCTGACATCAAATGGGGAGAAAT 

GGTGGCAC C 1296 

I I I I I I II I I I I I I II III III 

Db 1547 TTTTCAGCCAC 

ACCCCTTATGTGAACTAGTGCCTTTGGGTATCACGTAAAATTTTT 16 02 

Qy 129 7 TCCA GACACCCTGAAAC 

TACACACCAT 1323 

I I I I I II I I I I I I I I I I I I 

Db 16 03 TCCAAAGGGTTACTTTAAAAATCTGTTAC- 

CACAATTATGAGATGATTTTTAAGTGATAA 1661 



Qy 1324 TTCTTC CCTGCTCAGCT- 

TCTGCTCAGGAGTTCTGTGAG 1361 

I I I I I I I I I I I I I I I I I I I I I I I II 

Db 1662 ATTAAACTTCTTCTTGTATAAATTCTGCCCAGATCTCTCCACAAGAG — 

CTGAGGGTTTC 1719 

Qy 1362 CTATGG GAAGGCCATTGGTTGTATTTGC 

TACTTTTACTTTC 1402 

I I I I I I I I I I I I I I I I I I 

Db 1720 

ATAACTTTATGGCTTAATAAATGTATGACACTGAAAAGATTTGAGTGTGAATCTACTGAA 17 79 
Qy 1403 

ATCTTCCTCTGCTGTAGAGCCATTTAATGTTATTGTCATATGCTGCTGGTGAGGTAAAGG 1462 



Db 1780 ATC 

ACTATAATGCACATTGAAGCTATGATGGTATTTGAGTAGTGAGGTTACTT 1832 

Qy 1463 TGGGTCCGGGTGCCTT CCCAGGGGTT 

AGAGGATGTTCAAAGGGCCGAT 1510 



Db 1833 TTGATCGGAGCAACATAATGCTCATAGAATCTTCTAGAAGAAGAGAAACAAAGG 

GAT 1889 

Qy 1511 TTCAGCAGGAGTTCAGAGGGCTTATGAT GGATGGTGAGAGATT 

TGACA 1558 

I I I I I I I I II I I I I III II I I I I I 

Db 1890 T 

GATAAAATGCTGAGAACTAGTGATTATATATTTTTCTGTATTTACCTGACATTTAT 19 46 

Qy 1559 ACCACCAG — AGCAC 

ATGTGCTCTGACCCTCTCCTGGGCATT 1598 



Db 19 47 TTTAATGTTCAAAAAGTAAACACTTTAAGTTTGATGTGTTTT- 

ACTCTCTCATTGTTTTA 2005 

Qy 1599 GGTTCCTGCTGGTACCGGGCGGTTCAGA — CCTTCA — 

AATAGGTTGCTTTCAAAAGAG- 1653 



Db 2 0 06 AGTAATTGC CAACTCAGAATACATCATTCTTAGGCTG 

AAATTTGT 2050 

Qy 1654 CTTTCAGGCACTTATTGAGAATTAAT-GTTTAAACAGACATAATAGCCT 

AGATGAA 170 8 



Db 2051 CTTTC 

C AT T T T T T AAGG T G AAAT AG T AC T AC C T T ACGTGAT AGC AT AC AAAGAAG AA 210 7 

Qy 1709 CTCCCAAGAGATCTATTAAATCTTGTGGGCTGA 

ATAAATATCTCGTGCAGGACTGT 176 4 



Db 2108 AGCTCTAGAAA 

GAG AAAT T AT GG AG AATGAT TAT T T AAAT T AC AAT T AAGGAAATG A 216 4 



Qy 1765 GCAACAGTAGCCCAGAGCATCC TGCCTGTGGGCAT- 



CCACCTCCCAGGTGAGGGCA 1819 

II I III I I I I I I I I I I I I I I 

I 

Db 2165 GAATATGATCCCCT 

CTTCCGAGTTGCCCACAAACTTGCTTCTTTGCTTTTGCTCCCT 2221 

Qy 182 0 GTGGGAAGCTGGCCCGAC — GGC AGCC AGAAC TTGTTTCT C AC — CTCC- 

CACCAGCA 18 72 

II II III II I I I I I III III III 

Db 2222 GT AAT AG AAC T AC T T T T C A- 

ACAAATCTAATTTTGCACGGCACCGTTAACCATAT 22 75 



Qy 18 73 ACCCCCCAC — CCAACTCTGGGCCCCAGGCACAC 

GAAGCAC 1911 

I I I I I I I I I III II I 

I I I 

Db 2276 

TTTCACTACAGCAAACTTAGTGCTATGGGTTTTCTTTTTCTTTGTTTTTTTCTTGATCAC 2335 

Qy 1912 AAGTCTCAGGGGAC — CATTCCCACATTGGGGGATCCTGAGGGA 

GCCCATCACCGC 1965 

I I I I I I II I I I I I I I I I II I I I I I I 

Db 2336 TTGTAT-AGGAAACAACATTTTC-CAGTGTTATTTGCATATATATTTTGTCCTT- 

CCAA 2391 

Qy 1966 CTCTTGCA T AC AAC T G T C C AC TAG 

GAGGC 199 4 

I I I I I I I I I I III I I I I 

Db 2392 

TATATGCATTACAGATGAAAATTAAATGTTATACCTGAATTCTTGGGTTGGGGCCAAAAT 2 451 

Qy 1995 ACGC CCAGTGTGGG AGAGAT 

GTATGGTCTT 2024 

III I I I I I I I I I I I I I I I 

I 

Db 2452 

ATTAAGCTGAAAATAATGCTGGTGTGGATTTGTTTTAAAACAAAGCTTTATTATGAACAT 2 511 

Qy 2 025 GCCT-TCCACCTGTA-AAAACTGCACATATGCAAGCCATTTGCACTCTGGAAC- 

TGCATG 2081 



Db 2512 

GCATGTGAATCTGGATATTGCCTCTTATTTTTAAGAAAATGGTTCTGTGAAAAGTGAATG 2 5 71 

Qy 2 0 82 CCGTG AAAACTCCTAATGGTGTGGA-ACTT — AGTT TG 

AATT 2120 

II III I I I I I I I III III I I I I II 

I I I 

Db 2572 

ATATGTATTTTTCCAAATGCTTCATGGTTAGGAGTCTTCAAGTTCCATGTTCCCCAGATT 2 631 

Qy 2121 TGAAATC-AC-GCCGCATGCA CAAAGGGA-C 

AGGCCCAGGCCCGA 2162 

I I I I I II I I I I I I I I I I I I I I I I I I 

Db 2 632 TGAGATATACTAAAGAAAGAAATTCAAAAGTAGCTATTTGGGGCCCA — 

C AAAAAT AAC T 26 89 



Qy 2163 
CCCT 2199 



-CCTCAG-GTCATCCGCCCGCTGGCTGCAGA- 



-GCAT — 



1 1 1 1 1 I I I I I II 1 1 1 1 1 1 1 1 1 II 

Db 2690 

ATTATTTTAGCCTTAGAGCCTTACACTTGTTTCATGAAGAGAAAGGACTTGCATAACCAA 2 7 49 

Qy 2200 

GGGAGCCAAGGCGAGGCCCGTGGAGCCTGAGCTTTGTGTAGCTCGAGCTTTGTGTAGCTC 2 2 59 

I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2 75 0 AAT AAAC AAAGC AAGAC AAAT T AA AAATATGTGGGG-GAGAGATCAGTGAA — 

AA 2801 

Qy 226 0 GTGCACTTATT-ATGCACCAC CTCCCTT— CAGTCAC 

CAC 2296 

III I I I I I I I I I I I I I I I I III I III 

Db 2802 

GTGGTTTTCTTAATGCAGCCCTGCTGGTCCCCATTAACAATTGCTTGAAATTCACATGGA 2 861 

Qy 229 7 TC CTCTTCCTCCGCCATCCTCATTTA 

TACT GAT TG 2331 

II I I I I III II I I I I I I I I I I 

I I 

Db 2 862 TGTAAAATTATAATTGTCAGGATCTTATTCAG — 

ATGATCTTTTAAGGTTTAACTGGTTT 2919 

Qy 2332 CACACCCCCCGCTC-AAACAACAATGTCCTTATTA — TGATGAC 

CAT 2375 

I I I I I I I I I I I I I I I I II I 

I 

Db 2920 

TGCTTTTGTTTATCTATATGTCAAAATACTTGTAAATTGGGAACAAACTTCTCTCAGCTT 29 79 

Qy 23 76 CTCGTAGTGGT AC AT TCCATTCCTATT T AAGG T AAGC C CAAAG 

CCCACT 2 42 4 

I I I I I I I I II I I I I I I I III II I I I I I 

I I 

Db 29 8 0 CTTGAAGTTGTTCAACTATCCTTGCCACT 

GGAAGACCAAACAAGGTTTTCACTGCT 3 035 

Qy 2425 TTTGGATTTTCTCGACTGTCCGAGAAAAGTTGTGTAAGC 

GCCTGCGT 2471 

III I I I I I I I I I I I II I I I I III 

Db 3036 

TTTTCTTT T AC AT AAT AT GC T GAG AAT T AT T T C T T ATGC T T T T T AC T AC AAAC AAAAT T A 3 095 

Qy 2472 -TCTTCTGGGTTT GGCTAGATAGGGTTG TGTC 

CCTCTATGGAA 2513 

I I I I I I I I I I I I I I I I III I I I I I I 

I I 

Db 3096 

CTCACCTGGATTAAAGATTAAGGCCTTAATCTGTTTAGATTATCTTTAATCTCCATGAAA 3155 

Qy 2514 TGGAGAGTGATG — TGGGCAAGGGTGTCATTTTCTCG- 

CACAATACAACTCACTGAGGAT 2570 

I I I I II I I I I I I I I I III I I I I I I I I I 

I I 

Db 3156 

TCGTGAAATAAGACAAGAATAGTGTTTCAGCTGTAGGCCATTTTACAGCTAATTGCCCAT 3215 
Qy 2571 

G C T T C T G T AG AAG T GAG AAAC AC GAT GAG T AC AT T C AG AAT T AC AAT AAC T C AC T C T 2 62 7 

I I I I I I I I I II II I I I I I I I I I I I I I I I I I 



Db 3216 AAAT-TGTAGCATTTATTGACCTGA — AGTACTAAGCTAATTGTCTTGACT- 

ACTCAAAG 32 71 



Qy 262 8 -CACTGGGT AAC- 

T T C T C AT GAT AG AT T T G T AT G ATC AAT ACGGGTC T AT T — 2 6 76 

I I I I I I I I I I I I I II I I I I I I II III I 
Db 3272 CCCCTGAATTGTTGTCAACTTTCCCCT — TTGTGTTGTGTAGCCCTAAC — 
GTCATTTAG 3327 

Qy 26 77 -TTTATGTCAACTGAACACTGTAGGGTACCTTCCAGT CTT TTTC 

AAG 2722 

II I I I I II I II I I I I I I I I I I III I I I I 

I 

Db 332 8 CTTGTTGTCTGATGCCTCCAGTAGGACACC- 

TCCGATGGAGCTTTGATTTCTGAGCAGCG 33 86 

Qy 2723 ATTGTTAAATT 

GAG AC AAG T AAT T G AAT AAT TTGTCCTATTTT T A 2767 

I II II II II I I I I I I I I I I I I II 

Db 33 8 7 AAAGCTCCCTTCCTAAGATGCA TCTCGCATAGGCTG- 

CCTATGATGAAGGACCGTGC 3 4 42 

Qy 2 76 8 TTTTAAAAAAAGTGAATGGACTGAAATGTT- 

AAATGTGAATGTACATTTCTTAA 2 82 0 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 

I 

Db 3 4 43 ACCTCCACTCCAACAGAGTG-CTGAGTTTAAAAGTTGACCTGTGTTTGTA-ATTTC- 



Qy 2 821 TTGCAATTTTTCTACTGAGTGTTTGCACTATACTTTC- 

TGGAATCTTATTTAACAA 2 8 75 

I I I I II I I I I I I I I I I I I I I I I 



Db 3499 

TTTCATCTTGCTTAATAAATATCTGCTGGATTCTTTCAAAAAAAAAAAAAAAAAAAAAAA 355 8 

Qy 2 8 76 AAATAAAGGGAAAAAATTGCTTGA-CT 29 01 

I I I I I I I I II I I I I I I I I 
Db 3559 AAAAAAAAGGCCACATGTGCTCGAGCT 3585 
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Qy 1 TTGA AATCAGGAA ATCAGGCCGGGCG-CAGTG 

GCTCATG— 3 8 

I I I I I I I I I I I I I I I I I I I I I I 

Db 3537 

TTGAAAGAATCCAGCAGATATTTATTAAGCAAGATGAAAGTGAAATTACAAACACAGGTC 3 4 7 8 

Qy 39 — CCTGTAAGCCCAGCACTTTG GGAGG CGG 

AGGCGG — 72 

I I I I I I I I I I I I I I I I I I I I III II 

Db 3477 

AACTTTTAAACTCAGCACTCTGTTGGAGTGGAGGTGCACGGTCCTTCATCATAGGCAGCC 3 418 
Qy 73 GTG-GATCCAT- 

TGAGGTCAGGAGCTCAAGACCAGCCTGGTCAACATGGTGAAACCC 12 7 



Db 3 417 TATGCGAGATGCATCTTAGGAAGGGAGCTTTCG — CTGCTCAGAAATCA 

AAGCTC 3365 

Qy 12 8 CGTC T C T AC T AAAAAT A- 

CAAAAAAAAAATTAGCTGGGCGTGTTGGC 173 

III I I I I I I I I I I I I I I II I I I I I 

Db 336 4 CATCGGAGGTGTCCTACTGGAGGCATCAGACAACAAGCTAAAT- 

GACGTTAGGGCTACAC 33 06 

Qy 174 GGGAGCCTGTAGTCCCAGCTACACGGGAGGCTAAG-GTGG 

GAGAATT-G 22 0 



Db 33 05 AAC AC AAAGGGGAAAGT TG — ACAACAATTCAGG- 

GGC T T T GAG TAG T C AAG AC AAT TAG 32 49 

Qy 221 CTT-GAAC— CCGG GAGGCGGAGGTT 

GCAGTGAGCTGAGATTGCACCAT 266 



Db 32 48 CTTAGTACTTCAGGTCAATAAATGCTACAATTTATGGGCAATTAGCTGTAA- 

AA 3196 

Qy 26 7 TGCACTCCAGCCTGGGCGACAGAGCAAGAC TCTCTC- 

AAAAAAAAAAAAAAAGAAAG 322 



Db 3195 TGGCCTACAG- 

C T G AAAC AC TATTCTTGTCTTATTTCAC GAT T TC ATGGAGAT TAAAGAT 313 7 

Qy 323 AAAGAAATCAGAAAATCGACCACAGTGGTAGCCACCTG — 

GCCTAATGCTGTGTTTTTGT 380 

II I I I I I I I III II I III 

I I 



Db 3136 AATCTAAACAGATTA-AGGCCTTAAT-CTTTAATCCAGGTGAGTAATTTTGT- 

TTGTAGT 3080 



Qy 381 

ACCTGACAGGGGTCACTCATTTTAGGCACAACTCCTTCATTCTTTGTGAAATTAGTGAGT 4 4 0 

I II I I I I I I I I I I II I I I I I I I I I I 

Db 3 0 79 AAAAAGCATAAG-AAATAATT 

CTCAGCATATTATGTAAAAGAAAAAGCAGTGAAA 3026 

Qy 441 TTCCT TCTACCCGTCAC 

CAGATTCAA T 467 

II I I I I I I I I I I I I I I I 

I 

Db 3025 

ACCTTGTTTGGTCTTCCAGTGGCAAGGATAGTTGAACAACTTCAAGAAGCTGAGAGAAGT 2 966 

Qy 468 ATGTTC 

TATTAATACACCGATAACCACAGGGGAAGGGCACTTGTCG 513 

I I I I I I I I I I I I I I I I I I I III I I I I 
Db 2965 TTGTTCCCAATTTACAAGTATTTTGACATATAGATAAACA- 
AAAGCAAAACCAGTTAAAC 29 0 7 

Qy 514 CTCTCC CACCTGGTTA CC— ACAGT 

CTCCATGGGTCTTT — 550 

II I I I I I I I I I I I I I I I I I I I I I I I 

Db 2906 

CTTAAAAGATCATCTGAATAAGATCCTGACAATTATAATTTTACATCCATGTGAATTTCA 2 8 4 7 

Qy 551 TGCCGT GACCA-CA 

AATAAAGGAAACACTCATCACTAGTATC- 591 

II I I I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 2846 

AGCAATTGTTAATGGGGACCAGCAGGGCTGCATTAAGAAAACCACTTTTCACTGATCTCT 2 7 8 7 

Qy 592 TAAGTCGGGC 

TTTACAGTAACTATGCACCTTCTGTGTGC 63 0 

I Ml II I I I I I I I I I I I I I I I I 

Db 2786 

CCCCCACATATTTTTAATTTGTCTTGCTTTGTTTATTTTGGTTATGCAAGTCCTTTCT-C 2 72 8 



Qy 631 TTCACCTCACT CTCTACTTCAAACA 

GCCCATGG 663 

I I I I II I I I I I I I I I 

Db 2727 

TTCATGAAACAAGTGTAAGGCTCTAAGGCTAAAATAATAGTTATTTTTGTGGGCCCC 2 6 71 

Qy 66 4 AGGGAGGTATTATT- 

ATACTCCTTATGTTGACAGTGAAGAATCTGAGGCCCAGAGAGGTT 72 2 



Db 2670 

AAATAGCTACTTTTGAATTTCTTTCTTTAGTATATCTCAAATCTGGGGAACATGGAACTT 2 611 

Qy 723 GGGGAC TTGAGTAAAGTCA 

CACAGCCC— 7 49 

I I I I I I I I I I I I II 

Db 2610 

GAAGACTCCTAACCATGAAGCATTTGGAAAAATACATATCATTCACTTTTCACAGAACCA 2 551 



Qy 75 0 TGAGAGGCAGGACCAGGGTTC- 

CATTCCTGCTC 781 

I I I I I I I I II I I I I I I I I I I I I 

Db 2550 

TTTTCTTAAAAATAAGAGGCAATATCCAGATTCACATGCATGTTCATAATAAAGCTTTGT 2 491 

Qy 782 TATCCAGTTCCAAGCC — CTTGTGTTTTC — 

CATTATGTTTAGTGCCTCTTTGC 831 

I II I I I I II I I I I I I I I I I I I I I I I I I I I 
Db 2 49 0 T T T AAAAC AAAT C C AC AC C AGC AT - T AT T T T C AGC T T AAT AT T T T G - 

GCCCCAACCCAAG 2 433 

Qy 832 TAACAGCAA CATCTGCAAGATTTGTGTTGG 

TTTTG 866 

I I I I I I I I I I I I I I I I I I I I 

Db 2432 

AATTCAGGTATAACATTTAATTTTCATCTGTAATGCATATATTGGAAGGACAAAATATAT 23 73 

Qy 86 7 AT G — G AG AAC T C T AGC T C A TCCACATGCTAGTGCCCAAG 

TGGTG 909 

III I III II I I II I I I I I I I I I I I 

Db 2372 

ATGCAAATAACACTGGAAAATGTTGTTTCCTATACAAGTGATCAAGAAAAAAACAAAGAA 2313 

Qy 910 GAGGGGCCACCTCAGCAGGTGGGTT-CTG-AATGCAGCCAAGG CTGT- 

CCCCGCAA 962 



Db 2312 

AAAGAAAACCCATAGCACTAAGTTTGCTGTAGTGAAAATATGGTTAACGGTGCCGTGCAA 2253 

Qy 963 TGGGTGAGAC-TCGCTCCA — ACTGCCCGC 

CCTCAGAGCAGGTGCCT 10 06 



Db 2 2 52 AATTAGATTTGTTGAAAAGTAGTTCTATTACAGGGAGCAAAAGCAAAGAAGCAAGT 

TT 2195 

Qy 1007 AAGTCCTCCCTGGCACTGGCAGGCCTTACCTCA CATTGC 

T AAAT T AAAGC A — A 1058 



Db 2194 

GTGGGCAACTCGGAAGAGGGGATCATATTCTCATTTCCTTAATTGTAATTTAAATAATCA 2135 
Qy 1059 TGC 

AATTCCTCTTGGGTAAGAGGAATTCCTCCTTCTTTACTAACTGATCCCCAGC 1113 



Db 213 4 TTCTCCATAATTTCTCTT — TCTAGAG CTTTCTTCTTTGTATGCT-ATCAC 

GT 2085 

Qy 1114 AAGGAAATAAAATGTTAGGCTTTAAAAATCCCTACTTTGTC — 

ATATCAGACTA 1165 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 

Db 2084 AAGGTAGTACTAT-TTCACCTTAAAAAATGGAAA 

G AC AAAT T T C AGC C T AAG AAT G 2 03 0 



Qy 1166 TATTC- 



TAAAACTATATTTGAGCGAAACCTGTCA 1198 

I I I I I I I I I I I I I III I I I I II 

Db 2029 

ATGTATTCTGAGTTGGCAATTACTTAAAACAATGAGAGAGTAAAACACATCAAACTTAAA 19 7 0 

Qy 1199 TTGCGTCT AATTTCAAAT— ATACAG- 

AATCTCCTTAAGAGC 1237 



Db 1969 

G T G T T T AC x T T T T G AAC AT T AAAAT AAATGTC AGGT AAAT AC AGAAAAAT AT AT AAT C AC 1910 

Qy 1238 T-GTTGCCTTATTTTTTTGTAAAGCC 

TCTCTGACATCAAATGGGGAGAAATGG 12 89 



Db 1909 TAGTT — CTCAGCATTTTATCAATCCCTTTGTTTCTCT-TCTTC- 

TAG 1866 

Qy 129 0 T GGC AC C T C C AG AC AC C C T G AAAC T AC AC AC C A 

TTTCTTCCCTGCTCAGCTTCTGCT 1346 



Db 1865 AAGATTCTATGAGCATTATGTTGCTCCGATCAAAAGTAACCTCACTACTCAAATACC- 

AT 1807 

Qy 13 47 CAGGAGTTCTGTGAGC — TATGGGAAGGCCATTGGTTGTATTTGCT- 

ACTTTTACTTTCA 1403 



Db 18 06 CATAGCTT C AAT G T GC AT TAT AG T GAT T T C AG TAG 

ATTCACACTCAAATCTTTTCA 1751 

Qy 140 4 TCTTCCT-CTGCTGTAGAGCCATTTAATGTTATTGTCATATGCTGC 

TGGTGAGGTAA 1459 

I I I I II I I I I I I II I II I I I I III 

Db 175 0 GTGTCATACATTTATTAAGCCATAAAGTTATGAAACCCTCAGCT- 

CTTGTGGAGAGATCT 1692 

Qy 146 0 AGGTGGG TCCGGGTGC CTTCCCA- 

GGG 1485 



Db 1691 

GGGCAGAATTTATACAAGAAGAAGTTTAATTTATCACTTAAAAATCATCTCATAATTGTG 1632 

Qy 1486 GTTAGAGGATGTTCAAAGGGCC 

GATTTCAGCAGGAGTTCAGAGGGCTTATG 1536 



Db 1631 

GTAACAGATTTTTAAAGTAACCCTTTGGAAAAAATTTTACGTGATACCCAAAGGCACTAG 15 72 

Qy 153 7 AT G G AT GG T GAG AG AT T T G AC AAC C — ACCAGAGCACATGTGCTC- 

TGACCCTCTCC 1590 



Db 1571 

TTCACATAAGGGGTGTGGCTGAAAAAGCAAAACAAAGCCATTGTGTAAGTAAAGAGCACA 1512 



Qy 1591 TGGGCATTGGTTCCTGCT — GGTACCGG GCGGTTCAGACCTTC — 

AAATAGGTTGC 16 42 

I I I I I I I I I I I I I I I I I I I I 

Db 1511 

CAAAAAAAGGATATTAAAAGGAAACCTGTTAAGCTTTAAAGTTATTCTAAAAATGGCACT 1452 

Qy 16 43 TTTCA AAAGAGC T T T C AG — 

GCACTTATTGAGAATTAATGTTTAA 16 85 



Db 1451 TTTCACGTTGGTATAAAATGAAAGTGCTTTAAGATGAAATTTTTATGTATTTTT- 

TTAAA 1393 

Qy 16 86 ACAGACATAATAGCCTAGATGAACTCCCAAGAGATC 

TATTAAATCTTGTGGGCTGAA 1742 



Db 1392 GCTGATTT 

TTAACCCCCTGAAAATAGGGGGTTATAACAAGAACACTGAT 13 4 4 

Qy 1743 TAAATAT — CTCGT- 

GCAGGACTGTGCAACAGTAGCCCAGAGCATCCTGCCTGTGGGCAT 1799 



Db 13 43 AGACTCTAACTCATAGCGGCAATAAGTAGTGGATGTCACTAAAATGCT — 

CTGTGTGTGT 1286 

Qy 1800 C— CACCTCCCAG-GTGAGGGCAGTGGGAAG-CTGGC CCG 

ACGGCAG 18 42 



Db 1285 

CTTTATTTTTTAGTATTTTTAATGTTGATAGACTTGCTTTATCTATCTGTGTATCATTAG 1226 

Qy 18 43 CCAGAACTTGTTTCTCACCTC-CCACCAGCAACCCCCCACCCAACT 

CTGGGCC 1894 

III I I I I I I I I II I I I I I I I III I 

Db 1225 TGAGAT- 

TTGCTTTTCAATTCTTTACTAGAAAGTTACAGCTCATTTAAAATAGCTGG 117 0 

Qy 1895 CCAGGCACACGAAGCACAAGTCT-CAG- 

GGGACCATTCCCACATTGGGGGATCCTGAGGG 1952 



Db 1169 - T AAAT AC AG AAAGC AT AAAT AT AC AGT AAGT T T AAAC AC T GAT T G 

TACTAAATG 1116 

Qy 1953 AGCC CATCACCGCCTCTTGCATACAACTG — TCCACTAGGAGGC 

ACGCCCAGT 2 0 03 



Db 1115 

CAACAATACAAAGAAGCAAATGGAACACAAATGGTAACACAATAAAACTGTATTACATTT 1056 
Qy 2004 GTGGGAGAGATGT — 

ATGGTCTTGCCTTCCACCTGTAAAAACTGCACATATGCAAGCCAT 2 061 



Db 1055 G T GC T T C AAAT AT T AC AAT AC AT T AT AT AC AAT AGTCC AAAAT AAAC AT C T - 

CATGCCAA 99 7 



Qy 2 062 TTGCACTCTGGAAC TGCATGCCGTGAAAACT — 

CCTAATGGTGTGGAACTTAGTT- 2114 



Db 996 GTG-ACACAAAAACGAATAGCAAGCAAAAAAATCCAGCCATAT- 

GTACATTACTTTTTTC 939 

Qy 2115 TGAATTTGAAATCACGCCGC-ATGCACAAAGG GACAGGCCCAGGC 

CCG 2161 



Db 938 

TTTAATAATAAACATTCAACTCTGAACTGGGGCAATTTCACTACATACAGATGCAGTCCG 8 79 

Qy 2162 ACCT CAGGT CATCC-GCC 

CGCTGGCTGCAGAGCATCCCT 2199 



Db 8 78 CCTTTTATTCCATATAACCATCCTGCCTTCCTATATCTACGCT — 

ATTCAGTAGGTTCCT 821 

Qy 22 0 0 GGGAGCCAAGGCGAG GCCCGTGGAGC — 

CTGAGCTTTGTGTAGCTCGAGCTTTGT — 2 2 52 

I I I I I I I I I I I I I I III III 

Db 820 — 

GTGTCAATTCTTATGTACATGTGGAGCAAAAAAGATAAAAGAATGTAGTGCTCTGTAT 763 

Qy 2253 GTAGCTC- 

GTGCACTTATTATGCACCACCTCCCTTCAGTCA 2292 



Db 762 

TCTTAATGGGGTGATGTTGATAGATCAGCGAACCTGGGGTGCATAACCTTCTTTCATTAA 7 03 

Qy 2293 CCACTCCTCTTCCTCCGCCATCC TCAT-TTATACTGATTGCACAC- 

CCCC 2340 



Db 702 

ACCTTCCTCGTAGTCCGTCTGGCAAAGGATCATGTTATTCTTTAGGAAAAATTTGTCTCC 6 43 

Qy 23 41 C GC T C AAAC AAC AAT G T CC T TAT TAT GAT G AC CATC 

ACAT- 239 0 



Db 6 42 AACACAAA-ATCTCTG — 

ATTACAAAGCTGACATGCAAAGCAGTCCAGGTGGTAAACATT 5 86 

Qy 2391 -TCCAT TCCTATTTAAGGTAAGCCCAAAGCCCACTTTTGG- 

ATTT 2433 



Db 585 

GTCCTTGGCACGCATCACCATCTCAAAGGCAGGGATGAGCTTACTACAGGCAGCGCAGTT 52 6 

Qy 2 43 4 TCTCGACTGTCCGAGAA 

AAGTTGTGTAAGCGCCTGCGTTCTTCTGGGTTTGGC — 2 486 

I I I I I I I I I I I I I I I I I I I 
Db 52 5 TCCCGTTACACCAAAGAGCCTCAGATAGTCTCTGCGACAAAG 



-TCGTAGTGGT — 
I I I I I I I 



GAT AAGAT T AGC T T 470 



Qy 2 48 7 TAG ATAGGGTTGTGTCCCTCTATGGAATG-GAGAGTGA-TGTGGGCA- 

AGGG 2 535 



Db 469 

TAGTGTACAGGGTGGAGCCCACCTCCCCCAAGCGACAGTCACAGCAGGCACACTTCAGGC 410 
Qy 2536 

TGTCATTTTCTCGCACAATACAACTCACTGAGGATGCTTCTGTAGAAGTGAGAAACACGA 2 595 



Db 409 AGTC TTCATGC-CAGTATTTGTC-CAG TGC — CTTTAGA AGATAC- 

CGG 366 

Qy 2 596 T— GAGTACATTCAGAATTACA ATAACTCACTCTC-ACTGGGTAACT-TCT — 

CATG 2646 



Db 365 TCCTTGATCTTTCGG — 

TTGCAGCCAGCACAACCTTTCGGCTTGGTGTCTGGCTGGACTG 3 0 8 

Qy 26 47 ATAG-ATTTGTATGATCAATACGGGTCT- 

ATTTTTATGTCAACTGAACACTGTAGGGTAC 2704 

I I I I I I I I I I I I I I I II III I I I I I I I 
Db 307 AGAGCATTTGTAT-ACCAAAACTTTTCTCCTTTTTCTG- 

CATGAGTTGACTTAGGCGTGT 250 

Qy 2 705 C TTCCAGTCTTTTTCAAGATTGTTAAATTGAGACA- 

AGTAATTGAATAATTT 2 755 



Db 2 49 CTGAGTTGCAG-CAGCTTCGC- 

ATTGAGCACCAAACCCAAAGGTAGAAGTAGAAGGGGGT 192 

Qy 2 756 GTCCTATTTTTATTTTAA 

AAAAAGTGAATGGACTGAAATGTTAAATGTGAATGT 28 09 

I I I I III I I I I I III I I I I I 

Db 191 CTCCTTGATTTCGCTTAAGTGTGGACCTGGTGCGCAGCCTACACCG 

CCGAGGACCG 136 



Qy 2810 ACATTTCTTAATTGCAATTTTTCTA 

CTGAGTGTTTGCA 2847 

II I I I I I I I I I II I I I I I 

Db 135 ACTATTGTGAA — 

GCCACTTTGGGAGCGGGTCGGAGTGGCGGCAGGGGGTGGGGGAAGGG 7 8 



Qy 2 8 48 CTATACTT-TCTGGAATCTTATT- 

TAACAAAAATAAAGGGAAAAA — A 2 891 



ATGAGGACGGCCAGACAAGACAGGGCGCACACACGGAGCCCCTCGCAGTGTGCAAAATGA 1 8 



2 892 TTGCTTGAC 29 0 0 
III I 
17 TGGCGAATC 9 
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Qy 2 TGAAATCAGGAAATCAGGCCGGGCGCAGTGGCTCATGC-CTGTAAGCCCA 

GCAC 54 



Db 2 TTACTTCTGCTCTAAAAGCT — 

GCGGAATTCCTCGAGCACTGTTGGCCTACTGGGTGTAC 59 

Qy 55 TTTGGG AGGCGGAGGCGGGTGGATCC — ATT GAGGTCAGGAGCTC 

AAGAC 102 

I I I I II III I I I I I III III I I I I I 

I I I I 

Db 6 0 

TGTGGAGACTGTCAAAGTCTCCCGGAGCCCAATTTCCGGAAGCGGTGAGTTCTGAAAGAA 119 
Qy 103 

CAGCCTGGTCAACATGGTGAAACCCCGTCTCTACTAAAAATACAAAAAAAAAATTAGCT- 161 



Db 12 0 GTTCCTG— CACCGTAGT TTCCCAAGTCTGC 

GAATCCCCAACCATGAGCGCCTC 171 

Qy 162 GGGCGTGTTGGCGGGAGCCTGTAGTCCCAGCTACACGGGAGGCTAAGGTGGGAG- 

AATTG 220 



Db 172 GGGCGTACTG TCCTTTA — CCCA 

GCAAGGATGGGAGCAGGTG 211 

Qy 221 

CTTGAACCCGGGAGGCGGAGGTTGCAGTGAGCTGAGATTGCACCATTGCACTCCAGCCTG 2 8 0 

III III III I I I I I I I III I I I I I I 



1 1 1 1 

Db 212 CTGGCCAAAGTGAAACGG GCTGTGGTT-TACC — TGGACGCC- 

GCCT- 254 

Qy 281 GGCGAC- 

AGAGCAAGAC T C T C T C AAAAAAAAAAAAAAAGAAAGAAAGAAATC AGAAAAT C 339 

I I I I I I I I I I I I I II 

I I I 

Db 2 55 -GCGCCGAGAGCCTG CACT 

GGGGCTGCGGATC 2 85 

Qy 340 

GACCACAGTGGTAGCCACCTGGCCTAATGCTGTGTTTTTGTACCTGACAGGGGTCACTCA 399 

III I I Mill I I I I I I I I I I I I I 

Db 2 86 CACC CGTCTCCTGG AGGCGGTG 

GGGGGCCCTGA 318 

Qy 40 0 TTTTAGGCACAACTCCTTCATTCTTTGTGAAATTAGTGAGTTTCCTTCTA- 

CCCGTCACC 458 

II I I I I I I I I I I I I I I I I I I 

Db 319 CTGT CACCTGC GAGAG 

TTCGAGCCCGACG — 3 47 

Qy 459 

AG AT T C AAT AT G T T C T AT T AAT AC ACCGAT AACC AC AGGGGAAGGGC AC TTGTCGCTCTC 518 

I I I I II I III I I I I I I I I I II I 

I I 

Db 3 48 CAATTGGTGGTG GAGCCAAGCAGCCCA- 

AGGCAGTGTTTGTGCTGAGCTG 396 

Qy 519 

CCACCTGGTTACCACAGTCTCCATGGGTCTTTTGCCGTGACCACAAATAAAGGAAACACT 57 8 

II III III I I I I I I I I I I I I II I II 

I I I I 

Db 397 CCTGCTGAAAGGC-CGG--ACCGTGGAGATCCT- 

ACGGGACATCATCTGCCGCAGTCACT 452 

Qy 5 79 CATCACTAGTATC- 

TAAGTCGGGCTTTACAGTAACTATGCACCTTCTGTGTGCTTCACCT 63 7 

I I I I I I I I III I I I I I II I III I 

I I I I I 

Db 453 TCCAGTATTGTGTGGTGGTCACAACCGTGA GCCAC-GCTG 

TCCACCT 49 8 

Qy 63 8 CACT-CTCTACTT CAAACAGC CC 

ATGGAGGGAGGT AT TAT TAT AC T 6 82 

III II II I I I I I II I I I I I I I I 

Db 499 CACAGCTAATCATGTCCCAGCGGCGGCAGCGGCCGAGATGGAGGGG 



Qy 6 83 CCTTATGTT G AC AG T G AAG AAT C T G AGGC C C AG AG AG G T T G G G - 

GACTTGAGTA 735 

I I I I I I I I I I I I II I I II I I I I I I I I I I I 

Db 55 0 GCCGGTGTTCGAGCAGCTGGAGGAGAAGCTGTG 

TGAATGGATGGGCAACATGAACT 6 05 



Qy 736 

AAGTCACACAGCCCTGAGAGGCAGGACCAGGGTTCCATTCCTGCTCTATCCAGTTCCAAG 795 



I I I I III I I I I I Ml I I I I I I I I I I II 

Db 6 06 ACACGGCC GAGG-TGTTCCA- 

TGTCCCGTTATTGCTTGCCCCTGTTGCTCC 654 

Qy 796 CCCTTGTGTTTTCCATTATG- 

TTTAGTGCCTCTTTGCTAACAGCAACATCTGCAAGATTT 854 

II III II I I I I I I I I I I I I I I I I I I I 

Db 655 

CCACTTTGCCTTGACTCCAGCTTTTGCATCCCTTTTCCCACTGCTACCCCAG 7 06 

Qy 855 

GTGTTGGTTTTGATGGAGAACTCTAGCTCATCCACATGCTAGTGCCCAAGTGGTGGAGGG 914 
I I I I I I I I I II I I I I I I I 



CTTAATAGCGCCC G 732 

Qy 915 

GCCACCTCAGCAGGTGGGTTCTGAATGCAGCCAAGGCTGTCCCCGCAATGGGTGAGACTC 9 7 4 

II I I I I I III I I I I I I I I I I I I 

Db 733 ACCG GACAAGAGGAAGCTGG — GAAGCC 

TGGGTGATGTGG 770 

Qy 9 75 GCTCCAACTGC-CCGCCCTCAGAGCAGGTGCCTAAGTCC 

TCCCTGGCACTGGCAGGC 103 0 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I II 

I I I I I 

Db 771 ACTCC-ACTACGCTAACCCCAGAGCTGCTGCTGCAGATCAGATGCCTAG 

TGTCAGGC 826 

Qy 1031 CTTACCTCACATTGCTAAATTAAAGCAATGC AATTCCT 

CTTG 1072 

III III II II I I I I I I I I I I 



CTCAGTTCTCTGTGTGAACATTTAGGAGTACGGGAGGAGTGTTTTGCTGTAGGTTCCTTA 8 86 

Qy 10 73 GGTAAGAGGAATTCCTC-CTTCTTTACTAACTGAT CCCCAGCA 

AGGAAAT 1121 

I I I I I I I I I I I II I II I I I I I I I 

Db 887 AGTCAGG 

TCATCGCTGCGGATCTGGCCAATTATGCCCCTGCAAAGAACAGGAAGA 9 41 



Qy 1122 AAAATGTT — 

AGGCTT TAAAAAT CCCTACTTTGT C AT ATC AGAC TAT AT T C TAAAAC TAT 1179 

I I I I I I I I I I II I I I I III I I I I I I 

Db 9 42 AGACTGCTGCAGGC 

AGGGCATCAGTGGTTTTTGTGGACAGAACCCTGGATCTC- 99 4 

Qy 1180 ATTTGAGC- 

GAAACCTGTCATTGCGTCTAATTTCAAATATACAGAATCTCCTTAAGAGCT 123 8 

I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 995 ACAGGAGCAGTTGGACATCATGGAGACAACTT AGT AG AG AAG AT CAT T T C - 

AGCA 1048 

Qy 1239 

GTTGCCTTATTTTTTTGTAAAGCCTCTCTGACATCAAATGGGGAGAAATGGTGGCACCTC 12 9 8 

I I I I I I I I I I I I I I I I I I I I I I II 



Db 1049 CTTCCC- 

TAACATG 1093 



-CAG-CTCCCAGGCCACACAAATGATGTGATGGT- 



Qy 1299 

CAGACACCCTGAAACTACACACCATTTCTTCCCTGCTCAGCTTCTGCTCAGGAGTTCTGT 135 8 



Db 109 4 ATAGCGCTC ACTGCACTCCA 

TACTGAGGAGGAAAATTAT 1132 

Qy 1359 

GAGCTATGGGAAGGCCATTGGTTGTATTTGCTACTTTTACTTTCATCTTCCTCTGCTGTA 1418 



Db 1133 AA — TGTGGTTGCACCA — GGCTGTCTTT-CACAATTCAGTGACA CCACAGC— 

CA 1181 

Qy 1419 GAGCCATTT AATGTTATTG-TCA-TATGCTGCTGGTGAGGTAAAGGTGG — 

GTCCG 1470 



Db 1182 AAGCCCTATGGGAAGCTTTACTGAACACTAAGC — 

ACAAAGAGGCAGTGATGGAAGTTC- 123 8 

Qy 1471 

GGTGCCTTCCCAGGGGTTAGAGGATGTTCAAAGGGCCGATTTCAGCAGGAGTTCAGAGGG 153 0 

I I I I I I I I I I I I I I I I I I III III 

Db 1239 GGAGACATC TAGTGGAAG — CGGCAAGCAGAGAAAACCTGCCAATCA- 

AG— 1285 

Qy 1531 C T T AT G AT GG AT GG T GAG AG AT T T G AC AAC C AC C AG 

GC 1577 



Db 12 86 ATGA-GTATGGGGAGA GTCA — 

CACCGGGACAGCTCATGTCCTATATTCAGC 1334 

Qy 1578 TCT GACC CTCTCCTGGG-CATTGG- 

TTCCTGCTGGTACCGG 1616 



Db 1335 

TCTTCAAGAACAACCTCAAAGCTCTAATGAATCATTGTGGCCTCCTCCAGCTTGGACTGG 139 4 

Qy 1617 — GCGGTTCAGACCTT CAAATAGGTTGCTTTCAAAAGAG 

CTTTCAGGCAC 166 4 

I I I I I I I I I I III III III II I I I I I 

Db 1395 CCACAGCT C AAAC G T T G AAAC AC C C AC AG AC T G C 

CAAGTGGGACAACTTTCTGGC — 1449 

Qy 1665 T T AT T G AG AAT T AAT G T T T AAAC AGAC AT AAT AGCC T AG AT G A- AC T C C C AAG — 

AGATC 1721 



Db 1450 -TTTTGA AAGGCT 

CCTTCTTCAGAGCATTGGGGAGTCAGCAATGTCCGTTG 1499 

Qy 1722 

TATTAAATCTTGTGGGCTGAATAAATATCTCGTGCAGGACTGTGCAACAGTAGCCCAGAG 17 81 
I I I I I I I I I I I I III I I I I I II I I I I 



AGCACATGT- 

III I I I I I 



Db 150 0 TGTTAAATC AGCTG- 

CTGTAACCCAGAG 15 42 



-CT-GCCCATGATTAAGC — 



Qy 1782 CATCCTGCCTGTGGGCATCCACCTCCCAGGTGAGGGCAGTGGGA- 

AGCTGGCCCGACGGC 18 40 

I I I I I I I I I I I I I I I I 

Db 15 43 -AACCAAC— GAGGACTACAGCC 

CTGAGGAACTGCTGATCCTTC 1583 



Qy 18 41 AGCCAGAACTTGTTTCTCACCTCCCACCAGCAACCCCCC 

ACCCAACTCTGGGCCC 1895 

III I I I I II III I I I I I 
Db 158 4 — TCATA 

TATATTTATTCTGTCACTGGAGAGCTCACGGTAGACAAAGACCTGTG 1635 



Qy 1896 

CAGGCACACGAAGCACAAGTCTCAGGGGACCATTCCCACATTGGGGGATCCTGAGGGAGC 1955 

I I I I I I I I I I I I I I I I I I I I III II I I I I I 

Db 1636 -TGAAGCA-GAAGAAAAAG — TCAAGAAAGCATTGGCTCA 

GGTCTTCTGTGAGGAA 16 8 7 

Qy 1956 CC ATCACCGCCTCT— 

TGCATACAACTGTCCACTAGGAGGCACGCCCAGTGTGGGAG 2 010 

I II I I I I I I I I I I I I I I I I I I I II II 

Db 16 8 8 TCTGGATTGTCACCTTTGCTGCAAAAAATTACGGACTGGGA — 
CTCTTCAA 1736 

Qy 2011 AGATGTATGGTCTTGCCTTCCACCTGT-AAAAACTGCACATATGCAAGCCATTTG- 

CA 2066 

I III I I I II I I I I I I I I I I I II III 

I I 

Db 173 7 TTAATCTGACATTTCACAAATCCAAAATTGC- 

CGTGGATGAACTCTTTACTTCA 1789 



Qy 2 06 7 C-TCTGGA-ACTGCATGC- 

CGTGAAAACTCCTAATGGTGTGGAACTTAGTTTGAATTTGA 2123 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 

I I 

Db 179 0 CTTCGGGATATTGCTGGAGCTCGGAGTCTCC TGAAAC — 

AGTTTAAGTCTGT 1839 



Qy 212 4 AATCACGCCGCATGCACAAAGGGA- 

CAGGCCCAGGCCCGACCTCAGGTCATCCGCCCGCT 2182 

I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 18 40 AT ATGTTCCTGGAAATCATACCCA — CCAGGCATC TTATAAGCC 



Qy 2183 

GGCTGCAGAGCATCCCTGGGAGCCAAGGCGAGGCCCGTGGAGCCTGAGCTTTGTGTAGCT 2 2 42 

I I I I I I III I I I I I I I I I I I I 

Db 1884 TGTTGAAGCAAGT- 

TGTGGAGGAAATATTTCATCCCGAGAGGCCAGA 192 9 



Qy 2243 

CGAGCTTTGTGTAGCTCGTGCACTTATTATGCACCACCTCCCTTCAGTCACCACTCCTCT 23 02 
II II III I I I I I I I I I I I I 

I I I 

Db 1930 



TTCCGTTGATATTGAACACATGTCTTCAGGCCTCACTGATCT 19 71 



Qy 2303 TCCT 

CCGCCATCCTCATTTATACTGATTGCACACCCCCCGCTCAAACAACAATGTC 235 8 

II II I I I I I I I I I I I I I I I I I 

I I 

Db 1972 CCTTAAAACTGGATTTAGCATGT-TCATGAAGGTGAGCCGGC — CTCA 

TC 2018 

Qy 2359 

CTTATTATGATGACCATCTCGTAGTGGTACATTCCATTCCTATTTAAGGTAAGCCCAAAG 2 418 

I I III III III I I I I I I I I I I I I I 

Db 2 019 C TAGTGACTACCCCCTC 

CTGATCCTCTTTGTGGTAGG 2055 

Qy 2 419 CCCACTTTTGGATTTTCTCGACTGTCCGAGAAAAGTTGTGTAAGC- 

GCCTGCGTTCTTCT 2477 

III II I I I I I III I I I I I II I I I I I 

I 

Db 2 056 TGGGG TC-ACAGTCTCTGAA 

GTGAAAATGGTCAAAGATCTTGT 2 09 7 

Qy 2478 

GGGTTTGGCTAGATAGGGTTGTGTCCCTCTATGGAATGGAGAGTGATGTGGGCAAGGGTG 2 53 7 

I I I I I I I I I I I I I I I I I I 

Db 2 09 8 GGCAT C GTTGAAGCC AGGAAC 

CCAGGTAA 2126 

Qy 253 8 TCATTTTC T CGC AC AAT AC AAC T 

CACTGAGGATGCTTCTG T AG AAG T G AG AAAC A 2592 

III I I I I I I I II II I I I I I I I I I I I I I 

I I I 

Db 2127 TCGTGCTGTC- 

CACACGACTCCTGAAGCCACTTAACATTCCTGAGCTGTTATTTGCAACT 2185 

Qy 2 593 C G AT GAG T AC AT T C AG AAT T AC AAT AACTCACTCTCACTGGGTAA 

CTTCTCAT 2645 

I I I I I I I I I I I I II I I I I I I I I II 

I I I I 

Db 2186 

GACCGACTGCATCCAGACCTTGGCTTCTGAGCATCCGCTAAGAAGATAAGACCTACTCAA 2 2 45 

Qy 2646 

GATAGATTTGTATGATCAATACGGGTCTATTTTTATGTCAACTGAACACTGTAGGGTAC- 2 7 0 4 

I I I I I I I I I I I I I I I I I I I I III I I 

I 

Db 2246 GCTGGAAATG CCGATGC AATTTTCTGCCACC 

ACTCCAAATACTCCT 2291 

Qy 2705 CTTCCA— 

GTCTTTTTCAAGATTGTTAAATTGAGACAAGTAATTGAATAATTTGTCC 2 759 

I III III I I II I I I I I I I I I I I I I I I I III 

I I 

Db 22 92 CCACAACCAGCGTCCCTGTCAC TAATTGCGAGAA-TGATGGAAT — TCTG- 

CC 2340 

Qy 2 76 0 TATTTTTATTTTAAAAAAAGTGAATGGAC- 

TGAAATGTTAAATGTGAATGTACATTTCTT 2818 

I I I I I I I I I I I I I I I I I I I I I I 

I I I 



Db 23 41 TGAAGGGTCTTGATACCTACTCAGTGAG- 

GTACTTTGCTT 2379 

Qy 2819 - 

AATTGCAATTTTTCTACTGAGTGTTTGCACTATACTTTCTGGAATCTTATTTAACAAAA 2 8 7 7 
I I I I I I I I I I II 

I I I I 

Db 23 8 0 GGATTGCTGTGATTCT 

TAAAAAAAAAAAA 2408 

Qy 2 8 78 ATAAAGGGAAAAAATTGCTTGA-CT 29 01 

I I I I I I II I I I I I I I I 
Db 2 409 AAAAAAGGCCACATGTGCTCGAGCT 2 433 
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Qy 8 CAGGAAATCAGGCCGGGCGCA-GTGGC TCATGCCTGTAAG 

CCCAGCAC 5 4 



Db 2437 CTG 

CAGCTCGAGCACATGTGGCCTTTTTTTTTTTTTTTTTTAAGAATCACAGCAA 23 83 

Qy 55 TTTGGGAGGCGGAGGCGGGTGGATC- 

CATTGAGGTCAGGAGCTCAAGACCAGCCTGGTCA 113 



Db 23 82 TCC AAGCAAA GTACCTCACTGAG— TAGGT-ATCAAGACC- 

CT 2344 



Qy 114 

ACATGGTGAAACCCCGTCTCTACTAAAAATACAAAAAAAAAATTAGCTGGGCGTGTTGGC 173 



1 1 1 1 1 1 1 1 1 1 1 1 1 I 

Db 23 43 TCA-GGCAGAATTCCATC 

ATTCTC 2321 

Qy 174 

GGGAGCCTGTAGTCCCAGCTACACGGGAGGCTAAGGTGGGAGAATTGCTTGAACCCGGGA 233 

II I I I I III I I I I I I I I I I I I I I I I 

Db 232 0 GCAATTAGTGACAGGGACGCTGGTTGT 

GGAGGAGTATTTGGA 2 2 79 

Qy 23 4 GGCGGAGGTTGCAGTGAGCTGAGATTGCACCATTGCA- 

CTCCAGCCTGGG 2 82 

I I I I I I I I I I I I I I I I III I I I I I I I I I 

Db 22 78 GTGGTGGCAG AAAATTGCATCG — 

GCATTTCCAGCTTGAGTAGGTCTTAT 2231 

Qy 283 

C G AC AG AGC AAG AC T C T C T C AAAAAAAAAAAAAAAGAAAGAAAG AAAT C AG AAAAT C G AC 3 42 

I I I I I I I I I I III III I I I I I I I I I I 

Db 2230 CTTCTTAGC-GGATGCTC 

AGAAGCCAAGGTCTGGATGCAGTCGGTCAGTTG — 2181 

Qy 3 43 CACAGTGGTAGCCAC- 

CTGGCCTAATGCTGTGTTTTTGTACCTGACAGGGGTCACTCATT 4 01 

III III I I I I I I I I II II II I I I I I I I 

I 

Db 2180 CAAA TAACAGCTCAGG AATG TTAAGTGGCT- 

TCAGGAGTCG T 2140 

Qy 402 TTAG GCACAACT-CCTTCATTCTTTGTGAAATTAGTGAGTTTCCTTCTAC- 

CCGTCA 456 

II I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 2139 GTGGACAGCACGATTACCTGGGTTCCTGG 

CTTCAACGATGCCA 2 09 7 

Qy 457 CCAGATTCAATATGTTCTATTAATACACCGATAACCACAGGGGAAGGGCACT- 

TGTCGCT 515 

I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2 096 CAAGAT CTTTGACCATTT T C AC T T C AG AG 

ACTGTGAC 2 06 0 

Qy 516 CTCCCACCTGGTTACCACA GTCTCCATGGGTCTTTTGCCGTGACCACAA — 

ATA 56 7 

I I I I I I I I I I I I I II I I I I II I I I I 

I I 

Db 2 059 — CCCACC TACCACAAAGAGGATCAGGAGGGG 

GTAGTCACTAGGATG 2 015 

Qy 56 8 AAG — GAAACACTCATCACTAGTAT-CTAAGTCGGGCTTTACAGT- 

AACTATGCACCTTC 623 

II I I I I I I I I I I I I I I I I I I I I I I I II II I 

I 

Db 2014 AGGCCGGCTCAC- 

CTTCATGAACATGCTAAATCCAGTTTTAAGGAGATCAGTGAGGC— C 195 8 

Qy 624 

TGTGTGCTTCACCTCACTCTCTACTTCAAACAGCCCATGGAGGGAGGTATTATTATACTC 6 83 

II II I I I I I I I I I III I I I I I I I I I I I I 



Db 1957 TGAAGACATGTGTTCAATATCAACGGAATCTGGCCTCT — CGGGATGAAATATT- 

TCCTC 1901 



Qy 684 CTTATGTTG 

ACAGTGAAGAATCTGAGGCCCAGAGAGGTTGGGGACTTGAGTAAAG 73 8 

I I III I I I I I II I I I I I III II I 

Db 1900 

CACAACTTGCTTCAACAATGGCTTATAAGATGCCTGGTGGGTATGATTTCCAGGAACATA 18 41 

Qy 739 T C AC AC AGC C C T G AGAGGC AGGACC AGGG T TCCATTCC- 

TGCTCTATCCAGTTCCAAGCC 79 7 



Db 18 40 T ACAGACTT AAACTGTTTCAGGAGAC TCCGAGCTCCAGCAATATC- 

CC 1794 

Qy 79 8 CTTGTGTTTTCCATTATGTTTAGTG — 

CCTCTTTGCTAACAGCAACATCTGCAAGATTTG 855 



Db 1793 GAAGTG AAGTAAAGAGTTCATC CACGGCAA-TTTTG 

GATTTG 1753 

Qy 856 

TGTTGGTTTTGATGGAGAACTCTAGCTCATCCACATGCTAGTGCCCAAGTGGTGG 910 



Db 1752 T G AAAT G T C AG AT T AAT T G AAG AG T C C C AG 

TCCGTAATTTTTTGCAGCAAAGGT-G 169 8 

Qy 911 AGGGGCCA CCTCAGCAGGTGGGTTCTG 

AATGCAGCCAAGGCTGT 954 

I III I I I I I I I I I III I I I I I I I 

Db 169 7 ACAATCCAGATTCCTCA-CAG — 

AAGACCTGAGCCAATGCTTTCTTGACTTTTTCTTCTG 16 41 

Qy 955 CCCCGCAATGG GTGAGACTCGCTCCAACTGCCCGCC- 

CTCAG 995 



Db 16 40 CTTCACACAGGTCTTTGTCTACCGTGAG CTCTCCAG- 

TG AC AG AAT AAAT AT AT AT G 1585 

Qy 996 AGCAGGTGCCTAAGTCCTCCCTGGCACTGGCAGGCCTTACCTCACATTG- 

CTAAAT 1050 



Db 158 4 AGAAGGATCAGCAGTTCCTCAGGGCT GTAGTCCT 

CGTTGGTTCTCTGGGT 1535 

Qy 1051 TAAAGCAATGCAATTCCTCTTGGGTA — AGAGGAATT 

CCTCCTTCTTTACTAACTG 1104 

I I I I II I I I I I I I I I II I I I I 

Db 153 4 TACAG GC— 

TTAATCATGGGCAGCAGCTGATTTAACACAACGGACATTGCTGAC— 14 83 

Qy 1105 ATCCCCA— GCAAGGAAATAAAATGTTAGGCTTTAAAAATCCC-TACTTTGT 

CATA 1157 



Db 1482 -TCCCCAATGCTCTGAAGAAGG 

AGCCTTTCAAAAGCCAGAAAGTTGTCCCACTTG 1429 



Qy 1158 TCAGACTATATTC T AAAAC T AT AT T T G AGC - 

GAAACCTGTCATTGCGTCTAATTTCAAAT 1216 

I I I I I I I II I I I I I I I I I I I I I II 

Db 142 8 GCAGTCTGTGGG-TGTTTCAACGTTTGAGCTGTGGCCAGTCCAAGC- 

TGGAGGA 13 7 7 



Qy 1217 ATACAGAAT-CTCCTTAAGAGCTGTTGCCTTATTTTTTTGTAAAGC 

CTCTCTGACAT 1272 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 13 76 G G C C AC AAT GAT T C AT TAG AGC T - T T G AGG T - 

T G T T C T T G AAG AGC T G AAT AT AGG AC — 1321 

Qy 12 73 CAAATGGGGAGAAATGGTGGCAC- 

C T C C AG AC AC C C T GAAAC TACACACCATTTCTTCCC 1331 

I I I I I I I I I I I I I I I II III I 

I 

Db 1320 ATGAGCTGTCCCGGTGTGACTCTCC — 

CCATACTCATCTTGATTGGCAGGTTTTCTC 1266 

Qy 1332 TGCT— CAGCTTCTGCTCAGGAGTTCTGTGAGCTATGGGAAGGCCAT- 

TGGTTGTATT 1386 

I I I I I I I I I I II II III I I I I I I I I 



Db 1265 TGCTTGCCGCTTC — CACTAGATGTCTCCGAACTT- 

CCATCACTGCCTCTTTG 1216 



Qy 13 8 7 TGCTACTTTTACT- 

TTCATCTTCCTCTGCTGTAGAGCCATTTAATGTTATTGTCATATGC 14 45 



Db 1215 TGCTTAGTGTTCAGTAAAGCTTC CCATAGGGC- 

TTTGGCTGTGGTGTCA C 1167 



Qy 1446 TG- 

CTGGTGAGGTAAAGGTGGGTCCGGGTGCCTTCCCAGGGGTTAGAGGATGTTCAAAGG 15 0 4 

II I I I I I III I I I I I I I III I I I I I I I I I 

Db 1166 T G AAT T G T G AAAG AC AG CCTGGTGCAACCACA TTATA — 

ATTTTCCT 1122 



Qy 1505 GCCGATTTCAGCA-GGAGTTCAGAGGGC- 

TTATGATGGATGGT GAG AG AT T T G AC AAC C A 1562 

II I I I I I I I I I I I I I I I I I I I I I I 

I I I I I 

Db 1121 -CC TCAGTATGGAGTGCAGTGAGCGCTATCATG- 

AACCA 10 83 



Qy 1563 

CCAGAGCACATGTGCTCTGACCCTCTCCTGGGCATTGGTTCCTGCTGGTACCGGGCGGTT 162 2 
I I I I I I I I I II I I I I I I I I I I I 

Db 10 82 TCACATCATTTGTG TGG CCTGGG 

AGCTGGGGAAGTGCTGAA 10 42 



Qy 1623 

CAGACCTTCAAATAGGTTGCTTTCAAAAGAGCTTTCAGGCACTTATTGAGAATTAATGTT 16 82 
I I I I I I III I I I I I 

Db 10 41 ATGATCTTC 

TCTACTAAGTTGTCT 1018 



Qy 1683 

TAAACAGACATAATAGCCTAGATGAACTCCCAAGAGATCTATTAAATCTTGTGGGCTGAA 17 42 

I I I I I III II I I I I I I I I I I I I I I I 

Db 1017 CCATGATGTCC-AACTG — CTCCTGTGAGATCCA 

GGGTT 982 

Qy 17 43 TAAATATCTCGTGCAGGACTGTGCAACAGTAGCC-CAGA- 

GCATCCTGCCTGTGGGCATC 1800 

I I I I I I I I I I I I I I I I I I I I I I I I III 

Db 981 CTGT-CCACAAAAACCACTGATGC— CCTGCCTGC- 

AGCAGT 9 44 

Qy 1801 CACCTCCCAGGTGAGGGCAGTGGGA-AGCTGGCCCGA- 

CGGCAGCCAGAACTTGTTTCTC 1858 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 9 43 CTTCTTCCTGTTCTTTGCAGGGGCATAATTGGCCAGATCCGCAGCGATGACCTGACT- 



Qy 1859 ACCTCCCACCAGC — AACCCCCCACCC AACTCTGGGCCCC- 

AGGC 1900 

I I I I I I I I I I I I I I I I I I I I I I 

I I I I 

Db 884 

AGGAACCTACAGCAAAACACTCCTCCCGTACTCCTAAATGTTCACACAGAGAACTGAGGC 82 5 

Qy 19 01 ACACGAAGCA-CAAGTCT-CAGGGGACCATTCCCACATTGGGGGATCCTGAG- 

GGAG 1954 

I I I I I I I I I I I I I I I I I I I I I I I I II 

I I I I 

Db 82 4 CTGACACTAGGCATCTGATCTGCAGCAG 

CAGCTCTGGGGTTAGCGTAGTGGAG 7 72 

Qy 1955 — CCCATCACC GCCT-CTTGCATACAACTGTC CACT AGGAG— 

GC 1994 

I I I I I I I I I I I I I I I I I I I I III I I I I I 

I I 

Db 771 

TCCACATCACCCAGGCTTCCCAGCTTCCTCTTGTCCGGTCGGGCGCTATTAAGGAGGTGC 712 

Qy 1995 ACGCCCAGTGTGGGAGAGATGTATGGTCTTGCCTTCCAC — 

CTGTAAAAACTGCACATAT 2 052 

II II I I I I II II I I III I I I I I I I I I 
Db 711 ACATCC 

TGGGGTAGCAGTGGGAAAAGGGATGCAAAAGCTGGAGTCAAGGCAAAGTG 656 

Qy 2 053 GCAAGC — CATTTGCACTCTGGAACTGCATGCCGTGAAAACTCCT- 

AATGGTGTGGAACT 2109 

I III II III I I I I I I I I I I I I I I I I I I I I 

I 

Db 655 GGGAGCAACAGGGGCAAGCAATAACGG GACATG- 

GAACACCTCGGCCGTGTAGTTCA 600 

Qy 2110 T AGT T TGAAT T TGAAATCACGCCGC ATGCACAAAGGGACAGGCC- 

CAGG 2157 

I III I I I I I I I I I I I I I I I I I I I I I 

I 

Db 599 TG — TTGCCCATCCATTCACACAGCTTCTCCTCCAGCTGCTCGAA — 

CACCGGCTGCTGC 544 



Qy 2158 CCCGACCTCAGGTCATCCGC CCGCTGGCTGCAGAGCATCCCTGGGA 

GCCAA 2208 



Db 5 43 CCCTCCATC TCGGCCGCTGCCGCCGCTGG — 

G AC AT GAT T AGC T G T G AGG T GG AC A- 490 

Qy 22 09 GGCGAGGCCCGTGG 

AGCCTGAGCTTTGTGTAGCTCGAGCTTTGTGTAGCTCGTG-C 2 2 63 



Db 489 - 

GCGTGGCTCACGGTTGTGACCACCACACAATACTGGAAGTGACTGCGGCAGATGATGTC 431 

Qy 226 4 ACTTATTATGCACCA CCTCCCTT 

CAGTCACCACTCCTCTTCCTCCGCCATCC 2 315 

III I I I I I I II I I I I I I I I I I I II 

Db 43 0 CCGTAGGAT- 

CTCCACGGTCCGGCCTTTCAGCAGGCAGCTCAGCACAAACACTGCC 3 76 

Qy 2316 TCATTTATACTGATTGCACACCCCCCGCTCAAACAACAATGTC 

CTTATTATGATGAC 2 3 72 



Db 375 TTGGGCTGCTTG- 

GCTCCACCACCAATTGCGTCGGGCTCGAACTCTCGCAGGTGAC 321 

Qy 23 73 CATCTCGTAGTGGTACATTCC-ATTCCTATTTA- 

AGGTAAGCCCAAAGCCCACTTTTGGA 2430 



Db 32 0 AGTC AGGGCCCCCCACCGCCTCCAGGAGACGGGTGGATCCGCAGCCCC — 

AGTGCA 26 7 

Qy 2 431 TTTTCTCG — ACTGTCCGAGAAAAGTTGTGTAAGCGCCTG — 

CGTTCTTCTGGGTTTGGC 2486 



Db 266 GGCTCTCGGCGCAGGCGGCGTCCAG GTAAACCACAGCCCGTTTCAC 

TTTGGC 215 

Qy 2487 

TAGATAGGGTTGTGTCCCTCTATGGAATGGAGAGTGATGTGGGCAAGGGTGTCATTTTCT 2 5 46 

II II I I I I I I I I I I I I I I I I 

Db 214 CAGCAC CTGCTCCCATCCTTG CTGGGTAAAG — 

GACAGT A 177 

Qy 2547 CGCACAATACAACTCACTGAGGATGCTTCTGTAGAAGTGAGAAAC-ACGATG- 

AGTACAT 26 0 4 



Db 176 CGCCCGAGGC-GCTCATGGTTGGGGATTC- 

GCAGACTTGG G AAAC TACGGTGCAG G AAC T 119 

Qy 26 05 T C AG AAT T AC AAT AAC T C AC TC TC AC T GGG T AAC T — 

TCTCATGATAGATTTGTATGATC 2662 



Db 118 TC TTTCAG-AACTCAC-CGCTTCCGGAAATTGGGCTCCGGGAGACTTTGACAG- 



TC 66 



Qy 2663 

AATACGGGTCTATTTTTATGTCAACTGAACACTGTAGGGTACCTTCCAGTCTTTTTCAAG 2 72 2 

III I I I I I I I I I III I III I I I I I I I 

Db 65 TCCACAGTACACCCAGTAGGCCAACAGTGCTC-GAGGAATTCCG — CAG- 

CTTT 16 

Qy 2 723 AT T G T T AAAT T GAG - AC AAGT AA 2 744 

I III I I I I I I I 
Db 15 TAGAGCAGAAGTAA 2 



RESULT 7 

US-10-478-914-20 

; Sequence 20, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 

; TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
; FILE REFERENCE: 7388-80899 

; CURRENT APPLICATION NUMBER: US / 1 0 / 478 , 9 1 4 

; CURRENT FILING DATE: 2003-11-26 

; PRIOR APPLICATION NUMBER: PCT/ JP02 / 0 52 9 4 

; PRIOR FILING DATE: 2001-05-30 

; PRIOR APPLICATION NUMBER: JP 2001-162775 

; PRIOR FILING DATE: 2001-05-30 

; PRIOR APPLICATION NUMBER: JP 2001-255226 

; PRIOR FILING DATE: 2001-08-24 

; NUMBER OF SEQ ID NOS : 417 

; SOFTWARE: Patentln version 3.2 

; SEQ ID NO 2 0 

LENGTH: 2 495 

TYPE: DNA 
; ORGANISM: Homo sapiens 
US-10-478-914-20 

Query Match 31.4%; Score 5410; DB 1; Length 2495; 

Best Local Similarity 45.7%; Pred. No. 0; 

Matches 1375; Conservative 0; Mismatches 956; Indels 678; 
Gaps 181; 

Qy 4 AAATCAGGAAATCAGGC CGGGCGCAGTGGCTCATGCCTGTA- 

AGCCCAGCACTT 56 

I II I I I I III I I I I I I I I I I I I I I II 
Db 2 ACTGTTGGCCTACTGGCTACTATCAGGATC-GTGGCTATT — 

T T G AAG AGC T GAT C AC C A 5 8 

Qy 57 TG— 

GGAGGCGGAGGCGGGTGGATCCATTGAGGTCAGGAGCTCAAGACCAGCCTGGTCAA 114 

II I I I I I I I II I I I I I I I I I I I I I I II I II 

Db 59 TGTTGGAAGCAGC-AC TGGGAC — TTGAG — C — GAGCTC ACATG 

GGAATG 102 

Qy 115 CATGGTGAA — 

ACCCCGTCTCTACTAAAAATACAAAAAAAAAATTAGCTGGGCGTGTTGG 172 



Db 103 T T T AC T G AAT T AGC T AT TC T AT AC TC T AAAT T T AAGC - 

C T C AG AAAAT — G 150 



Qy 173 

CGGGAGCCTGTAGTCCCAGCTACACGGGAGGCTAAGGTGGGAGAATTGCTTGAACCCGGG 232 

I I I I I I II I I I I III III II II III III 

Db 151 AGGGAGCACCTGG AGCTGTTCTGGT — CTAGAGT— GAATATT 

CCC— 192 

Qy 233 

AGGCGGAGGTTGCAGTGAGCTGAGATTGCACCATTGCACTCCAGCCTGGGCGACAGAGCA 2 92 

I I I I I I I I I I I I I I I III I I I I I I I I I 

I 

Db 193 AAGGTGCTAAGAGCTGC AGAACAA — GCTCATCT — TTGGG 

CAGAACT 236 

Qy 293 

AGACTCTCT C AAAAAAAAAAAAAAAG AAAG AAAG AAAT C AG AAAAT C G AC CACAGTGGT- 351 

I III I III I II I I I III I III I I I I II 

Db 23 7 GGTGTTTTTGTATGACAAGTATGAAGAATATGATAATGCCATAATT — 

ACCATGATGAAT 29 4 

Qy 352 -AGCCACCTG — GCCT — AATGCTGTGTTTTTGTA 

CCTGACAGGGGTCACTCAT 40 0 

I I I I I I I I I I I III II II I I I I I III I 

I I 

Db 295 CATCCAACTGATGCCTGGAAAGAAGGGCAATTCAAAGATATCATTACCAAGGTTGC- 

CAA 353 

Qy 401 TTTAGGCACAACTCCTTCATTCTTTGTGAAATTAGTGAGTT-TCCTTCTAC — 

CCGTCAC 457 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 354 T G T - GGAAC T A- T AC T AC A GAGCAATAC 

AGTTCTACTTAGAATTCAAGCCT 402 

Qy 458 

CAGATTCAATATGTTCTATTAATACACCGATAACCACAGGGGAAGGGCACTTGTCGCTCT 517 

III I I I I I I I I I I I I I I I I II I I I I I I I I 

I 

Db 403 CTGTTGTTAAATGATTTGCTGATGGTGCTGTCTCCACGGTTGGATCACACTCGT- 

GCAGT 461 

Qy 518 

CCCACCTGGTTACCACAGTCTCCATGGGTCTTTTGCCGTGACCACAAATAAAGGAAACAC 5 7 7 

I III III I I I I II I I I I I I I I I I I I I 

Db 462 C AATTATTTCAG CAAGG TTAAACAGCTACCACTGGT 

GAAAC— 502 

Qy 578 TCATCACTAGTATCTAAGTCGGGCTTTACAGTAACTATGCACC — TTCTGTG — 

TGCTTC 633 

I I I I I II I I I I I I I III I I I I I I II 

I I 

Db 503 CGTATTTG 

CGTTCAGTTCAGAACCATAACAACAAATCTGTGAATGAATC 551 

Qy 634 

ACCTCACTCTCTACTT C AAAC AGCCC ATGGAGGGAGGT AT TAT TATACTCCTTATGTTGA 693 

I II III II III I I I I I I I I I I I III II 

I 

Db 552 ATTGAACAATCTTTTTATTACA GAAGAAG — ATTATCAGGCTC 

TGCGAA 59 8 



Qy 69 4 CAGTGA-AGAATCTGAGGCCCAGAGAGGTTGGGGACTTGAGTAAAGTCACAC- 

AGCCCTG 751 



Db 599 CATCAATAGATGCTTATGAC 

AACTTTGACAATATCTCGCTTGCTCAG 6 45 

Qy 752 AG AGGCAGGACC AGGGTTC- 

CATTCCTGCTCTATCCAGTTCCAAG 795 



Db 6 46 CGTTTGGAAAAACATGAACTCATTGAGTTCAGGAGAATTGCTGCT-TATCTC- 

TTCAAAG 7 03 

Qy 796 CC — CTTGTGTTTTCCATTATGTTTAGTGCCTCTTTGCT — 

AACAGCAACATCTGCAAGA 851 

II II I I I I I I I I I I I I II I I I I 

Db 70 4 GCAACAATCGCTGGAAACAGAGTGTAGAG 

C T G T GC AAG AAAG AC AGC C T T T AC AA — 75 8 

Qy 852 

TTTGTGTTGGTTTTGATGGAGAACTCTAGCTCATCCACATGCTAGTGCCCAAGTGGTGGA 911 



Db 759 GGATGCAATGCAGTATGCTT-CTGAATCTAAAGATACTG 

AATTGGCTGA 8 06 

Qy 912 GGGGCCACCTCAGCAGGTGGGTTCTGAATGCAG — 

CCAAGGCTGTCCCCGCAATGGGTGA 969 

I I I I I I II I I I I I I I I I I I I III II I 

Db 807 -AGAACTCCT — GCA-GTGGTTTTTGCAGGAAGAAAAAAGAGAGT 

GCTTTGG 854 

Qy 9 70 GACTCGCTCCAACTGCCCG CCCTCAGAGCAGGTGCCT- 

AAGTCCTCCCTGGCACTG- 1024 



Db 855 AGCTTG-TCTGTTTACCTGTTACGATCTTTTAAG — 

GCCAGATGTCGTCCTAGAAACTGC 911 

Qy 1025 — GCAGGCCTTACCTCACATTGCTAAATTAAAGCAATGC 

AATTCCTCTTGGGTAAGA 10 79 



Db 912 ATGGAGGCACAATATCA TGGA: 

TTGCCATGCCCTATTTCATCCAGGTCATGA 964 

Qy 1080 GGAATTCCT 

C C T T C T T T AC T AAC T GAT C C C C AGC AAGGAAAT AAA 112 4 



Db 965 AGGAGTACTTGACAAAGGTGGATAAATTAGATGCT-TCAGAATCACTG — 

AGAAAAGAAG 1021 

Qy 1125 ATGTT-AGGCTTTAAAAATCCCTA — CTTTGTCAT — ATCAGAC 

TATATTCTAAAA 1175 



Db 1022 AAGAAC AAGC T AC AGAGA- 

CACAACCCATTGTTTATGGTCAGCCCCAGTTGATGCTGACA 10 8 0 



Qy 1176 CTATATTTGAGCGAAACCTGTCATTGCGTC 

TAATTTCAAATATACAGAA 122 4 

I I I I II I I I I III III I I I I I I 

Db 1081 GCAGGACCCAGTGTT 

GCCGTCCCTCCCCAGGCACCTTTTGGTTATGGTTATACCGCA 113 7 

Qy 1225 TCTCCTTAAGAGCTGTTGCCTTATTTTTTTGTAAAGCCT — CTCTG 

ACATCAAATGG 12 79 

I I I I I I I III I I I I I I I I I I I I I I I 

I I 

Db 113 8 CCACCGTATGGAC AGCC 

ACAGCCTGGCTTTGGGTACAGCATGTGA 1182 

Qy 12 8 0 GGAGAAATGGTGGCACCTCCAGACACC 

C T G AAAC T AC AC AC CATTTCTTC 1329 

I III III III I I I I I I I I I I I I I I I I I I 

I 

Db 1183 GATGAAGCGCT-GATCCTGTAGTCACCTATTTTCGTACTGAAAC— ATCGTCTTTAC- 



Qy 1330 

CCTGCTCAGCTTCTGCTCAGGAGTTCTGTGAGCTATGGGAAGGCCATTGGTTGTATTTGC 13 89 

I I I I I I I I I I I I I III I I I I I III 

I I I 

Db 1239 ACTTCTCAGTTTATAATGGGG GAAAAC AGGCAAC 

GTGTTCTTGT 1282 

Qy 139 0 TACTTTTACTTTCATCTTCCTCTGCTGTAGAGCCATTTAATGTTATTGTC- 

ATATGCTGC 1448 

I I II I I I I I I I I I I I I II I I I I I I I I I 

Db 1283 AACCTTTA-TTTCA 

TGAAGGACTTCTTTTTGTTTCTAACTATAAACT — 132 8 

Qy 1449 

TGGTGAGGTAAAGGTGGGTCCGGGTGCCTTCCCAGGGGTTAGAGGATGTTCAAAGGGCC- 150 7 

I I I I I III I I I I I I I I 

I I 

Db 1329 TGGATC ACCT ATGTTAAAA — 

CCT 1350 

Qy 150 8 GATTTCA 

GCAGGAGTTCAGAGGGCTTATGATGGATGGTGAGAGATTTGAC 155 7 

I I I I I I II III I I I I I I I I I I I I I I I 

Db 1351 TATTTCACATTCCACATCATTTTAGA — 

ATTTATTTTCGAAGGGGAATAGTTTCAATGTT 1408 

Qy 1558 — 

AACCACCAGAGCACATGTGCTCTGACCCTCTCCTGGGCATTGGTTCCTGCTGGTACCG 1615 

I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 1409 TTATTCACTTGGGCTTTTTTTCT TCCCCCTCT TTCTT 

TAAAG 1450 

Qy 1616 

GGCGGTTCAGACCTTCAAATAGGTTGCTTTCAAAAGAGCTTTCAGGCACTTATTGAGAAT 16 75 

I I I I I I I I I I I I I I I I I II I I I I I I I I I II 

Db 1451 AACTGCTCA-ATATTC-AATCTGTTG-TGAAGAACCTGATTT GCACT — 

CTG 1497 



Qy 1676 

TAATGTTTAAACAGACATAATAGCCTAGATGAACTCCCAAGAGATCTATTAAATCTTGTG 1735 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 149 8 TAG T G T T T AAAG AAAC AAAG AAACTCTAA 

TATTGAATCT 1536 

Qy 1736 

GGCTGAATAAATATCTCGTGCAGGACTGTGCAACAGTAGCCCAGAGCATCCTGCCTGTGG 1795 

I I I I I I I I I I I III I I I I I II II I I I I 

Db 153 7 — CTTAA ATTTAGTGTA TGT- 

AAACAGCTTACAAATACGTATTGTCTAAAT 1584 

Qy 1796 

GCATCCACCTCCCAGGTGAGGGCAGTGGGAAGCTGGCCCGACGGCAGCCAGAACTTGTTT 1855 

I I I I I I I I I I I I I I I I I II I I I I I 

I 

Db 1585 GCATTTAAAT — CTGTTTTATTCAAAGAAAAGCTAAAGCAAAAAC 

ACTGGCAT 1635 

Qy 1856 CTCACCTCCCAC CAGCAACCCCCCACCCAACTCTGGGCCCCAGGCACA- 

CGAAG 1908 

I I I I II III I I I I I I I I I I I I I I I I I 

I 

Db 1636 ATGACCATGCAAGACTGTCAGTGCCAACAAAGACAACACTAATC 

AGCACATCGTA- 169 0 

Qy 19 09 CACAAGTCT-CAGGGGACCATTCCCACATTGGGGGATCCTGAGGGAGCCCATCACCG- 

CC 1966 

III I I I I I I I I I I I I I I I II II I I I I 

I 

Db 1691 CACTGGATTGCAGTG CTTCCCAGATTATTGAAAAATG 

TT AC AG AC A 1736 

Qy 196 7 TCTTGCATAC AACTGTCCACTAGGAGGCACGC 

CCAGTGTGGGAGAGATGTAT 2018 

I I I I I I I I I I I II I I I I I I II II II 
Db 173 7 ACTTGCCTGATTTTTAAATGAGCG- 
TAAAAGGCCCTCTAACCTATGCAGG 1785 

Qy 2019 GGTCTTGCCTTCCA 

CCTGTAAAAACTGCACATATGCAAGCCATTTGCACTCTGGA 2 0 73 

I I I I III I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 1786 TTTCC — C CAT T ATGC AT AT AGAAAATGC TAG T AT G 

TTTTGCTCACTTCA 1833 

Qy 2074 

ACTGCATGCCGTGAAAACTCCTAATGGTGTGGAACTTAGTTTGAATTTGAAATCACGCCG 2133 

III III I I I I I I I I I I I I I I I I I I 

I 

Db 183 4 TATGTAACAGGTGC CCTTATGTTGTG CT— GT ATCCTG- 

TG 1871 

Qy 2134 

CATGCACAAAGGGACAGGCCCAGGCCCGACCTCAGGTCATCCGCCCGCTGGCTGCAGAGC 2193 

II I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 18 72 CTTTTTCTGTGGGACCATTCCATTCAGGAGCAAAG AGCAC- 

CATGATTC C 192 0 



Qy 2194 

ATCCCTGGGAGCCAAGGCGAGGCCCGTGGAGCCTGAGCTTTGTGTAGCTCGAGCTTTGTG 2 2 53 
I I I I I I II I I I I I I I I I I I I I I I I I I II 

I I I I I 

Db 1921 AATCTTGTGTGTGTTTACTA-ACCCTT 

CCCTGAGGTTTGTGTATGTTGGATATTGTG 1976 

Qy 2254 TAG — CTCGTGCACTTATTATGC — ACCACCTCCCTTCAGTCACC— 

ACTCCTCTTCCTC 2307 

I II I I I I I I I I I I I I I I II I I I I I I I 

I I 

Db 1977 

G T G T T T TAG AT C AC T GAG T G T AC AGAAGAGAGAAAT TC AAAC AAAAT AT TGCTGTTCTTC 2036 
Qy 2308 

CGCCATCCTCATTTATACTGATTGCACACCCCCCGCTCAAACAACAATGTCCTTATTATG 236 7 

I I I I I I I I I I II I I I I I I I I II 

I I 

Db 2037 AGTTTT 

GTTTGTGGAATTTGAAATTACTCAAATTTAAAATAAATTACTGGACTGTG 2 092 

Qy 236 8 ATGACCATCTCGTAGTGG 

TACATTCCATTCCTATTTAAGGTAAGCCCAAAGCCCAC 2 423 

I I I I I I I I III I I I I I III I I I I 

I I 

Db 2 093 G AAAT AAC AT AGAAT TGAAGT T T T AAT T AAAT ACC AC T 

C AAAC G AAAAG AAC A- 2145 

Qy 2424 

TTTTGGATTTTCTCGACTGTCCGAGAAAAGTTGTGTAAGCGCCTGCGTTCTTCTGGGTTT 2 4 83 

I I I I I I II I I I I I I I I I I I I I I 

Db 2146 — GTAGTTTTT GTAGTTTTATATTGGATACTGAG- 

GCATTAGGG — A 2187 

Qy 2 48 4 GGCTAGATAGGGTTGTGTCCCTCTATGGAATGGAGAGTGATGTGGGCAAG- 

GGTGTCATT 2542 

III I I I I I I I I I I I I I I I I I I I I I I 

I I I I 

Db 2188 GGCATGAAAGG AAGAGGAATGAGGATTGA 

G AC AT G T G AAG AC AT T 2232 

Qy 2543 

TTCTCGCACAATACAACTCACTGAGGATGCTTCTGTAGAAGTGAGAAACACGATGAGTAC 2 6 02 
I III III I I I I I I I I I I I I I I I II I I I I I 

I I I I 

Db 2 233 GT GCATTATA TCAATGTGCATTC — C T G TAG T — T C AT T AAC AAG 

GTAC 2277 

Qy 2603 

ATTCAGAATTACAATAACTCACTCTCACTGGGTAACTTCTCATGATAGATTTGTATGATC 2 662 

I I I I I I I I I I I I I I I I I I I I I I I I I II 

I 

Db 22 7 8 AT GC AAT AG T - C T AAAG AAC CAGAGTCACT AC T - AT AGTGGC T TAAC AT — 

TT 2326 

Qy 2663 

AATACGGGTCTATTTTTATGTCAACTGAACACTGTAGGGTACCTTCCAGTCTTTTTCAAG 2 72 2 

III I I I I I I I I II I I I I I I I I I I I I I I I I 

Db 232 7 AATCTGTCTCCAATATTTTAACCAAGTGACACCG— 



GGGTTTTTATCGAAGCATTTCAC- 23 83 



Qy 2723 

ATTGTTAAATTGAGACAAGTAATTGAATAATTTGTCCTATTTTTATTTTAAAAAAAGTGA 2 7 82 

I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I 

Db 23 8 4 TTAAATGA-ACAAATCATGGC TGT — 

TAT AT T AAC T T GAAAT AAAAT AT 2 42 9 

Qy 2 7 83 ATGGACTGAAATGTTAAATGTGAATGTACATTTCTTAAT- 

TGCAATTTTTCTACTGAGTG 2 8 41 

II I I I I I I I I I II II I I I I I I I I I I I 

Db 2 43 0 ATTTA — AACATGTAAAAAAAAAAAAAAAAAGGCCACATGTGC — 

TCGAGCTGCAGGTCG 2 485 



Qy 2842 TTTGCACTA 2850 

I III 

Db 2486 CGGCCGCTA 2494 
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Qy 16 CAGGCCGGGCGCA-GTGGC 

TCATGCCTGTAAGCCCAGCACTTTGGG 6 0 

III I I I I I I I I I I I I I I I I I I I I I 

Db 2336 

CAGCTCGAGCACATGTGGCCTTTTTTTTTTTTTTTTTTTTTGTATTCTCAGAGCT 2 2 82 



Qy 61 

AGGCGGAGGCGGGTGGATCCATTGAGGTCAGGAGCTCAAGACCAGCCTGGTCAACATGGT 12 0 

III I I I I I I I I I I I I I I I I I I I 

I 

Db 22 81 -GCCAG GAGTGCATC GAGC- 



CTGTAATTTCCTGTTCTC T 2245 



Qy 121 GAA- 

ACCCCGTCTCTACTAAAAATACAAAAAAAAAATTAGCTGGGCGTGTTGGCGGGAGC 179 



Db 22 44 GAATCCCCCATCT-TTCTGCAGCTCCA AGCTTTGTGT 

CCCACAGC 22 01 

Qy 180 CTGTAGTCCCAGCTACACGGGAGGCTAAGGTGGGAGAATTGCT- 

TGAACCCGGGAGGCGG 23 8 

I I I I I I I I I I I I I I I I I I I I II 

I I I I 

Db 2200 CTGT-GACTCTGT GCTAA 

CAAATCGCTATTGTCCAGTGGGGCGA 2158 

Qy 239 A-GGTTGCAGTGAGCTGA— GATTGC 

ACCATTGCACTCCAGCCTGGGCGAC 2 86 

I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 2157 ATGGTGGCTG- 

GAACTAAAGAATTGCTGTCTGGTTTCTATTCAAATCCAGGTAGCGAGAT 2 099 



Qy 287 AGAGCAA — 

GACTCTCT C AAAAAAAAAAAAAAAG AAAG AAAG AAAT C AG AAAAT C G AC — 3 42 
I I I I I I I I I I I I I 

Db 2 09 8 ATATGAATGGACT-TTTCGA 

ATCGTCATGTGAATAACGTCTG 2 058 



Qy 3 43 CACAGTG- 

GTAGCCACCTGGCCTAATGCTGTGTTTTTGTACCTGACAGGGGTCACTCATT 4 01 

III I I I I I I I I I I I I I I I I I I I I I I 

Db 2 057 CTCGGCATGAAGGCTCAGAGCC — ATGCT 

AGGAAGGATTAACTCG — 2 015 



Qy 402 

TTAGGCACAACTCCTTCATTCTTTGTGAAATTAGTGAGTTTCCTTCTACCCGTCACCAGA 461 

I I I I I III I I I I I I I I I I I I I I I I II III 

I I 

Db 2014 -TAGGCTGACCACTAACATCCTTTGTGGTACGAGGGAGAAACAT 

TCCCAAGT 196 4 



Qy 462 TTCAATATGTTCTATTAATACACCGATAACCACAGGGGAAGGGCACTTG — 

TCGCTCTCC 519 



Db 1963 ATCATTTTATTC 

ACACTTAATTTTCTATCC 1934 

Qy 520 

CACCTGGTTACCACAGTCTCCATGGGTCTTTTGCCGTGACCACAAATAAAGGAAACACTC 5 79 
II III I I I I I I I I 

Db 1933 CA TAC 

CCCCAAAATAAGG 1916 

Qy 580 

ATCACTAGTATCTAAGTCGGGCTTTACAGTAACTATGCACCTTCTGTGTGCTTCACCTCA 639 
I I I I I I I I I I I II I 

Db 1915 CTAGCTATTTAATTAGTTGG CC 

GCTT 1890 



Qy 640 

CTCTCTACTTCAAACAGCCCATGGAGGGAGGTATTATTATACTCCTTATGTTGACAGTGA 699 
I I I I I I I I I I I I I I I I 

I I 

Db 1889 TTCTCTTAAT 

TTTTAGTGTTTCTGTTGATAATG- 1857 



Qy 700 

AGAATCTGAGGCCCAGAGAGGTTGGGGACTTGAGTAAAGTCACACAGCCCTGAGAGGCAG 759 

I I I I I I I I I I II I I I I I I I I I 

Db 1856 TGTGAGTTTGGGAAAATGC — 

TAAGTAGCTTTTCACTTA 182 0 



Qy 76 0 GACCAGGGTTCCATTCCTGCTCTATCCAG-TTCCAAGCCCTTGTGTTTTCCATTAT- 

GTT 817 

I I I I III II I I I I I I I I I I I I I I I III I I I I I 

Db 1819 GAACACTGTTATTTT CTCTTTAAAGTTTTCTA — CCTTACATTT 

ATTATAGCA 1769 



Qy 818 

TAGTGCCTCTTTGCTAACAGCAACATCTGCAAGATTTGTGTTGGTTTTGATGGAGAACTC 8 7 7 

I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 176 8 TAGT-TATCTTT ATAGCAT-AGATGCAGAA 

AGTAAGAGAGAGC 172 8 



Qy 878 

TAGCTCATCCACATGCTAGTGCCCAAGTGGTGGAGGGGCCACCTCAGCAGGTGGGTTCTG 93 7 

I I I I I I II I I I I II 

Db 172 7 TTGTTTTTTCA AGAAAAC AAC C C 

TTTAA 1700 

Qy 93 8 AATGC- 

AGCCAAGGCTGTCCCCGCAATGGGTGAGACTCGCTCCAACTGCCCGCCCTCAGA 996 

I I I I I I I I III I I I I I I I I I I I I I I III 

Db 1699 AATACTTTCCAA CCCATGAAGGG-AAAAATC-CTCCTTTTTTC — 

CCC 1656 

Qy 99 7 GCAGGTGCCTAAGTCCTCCCTGGCAC-TGGCAGGCCTTACCTCACATTG- 

CTAAATTAAA 1054 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 

I 

Db 1655 -CAAGTGCATT CTACTTATTACTTTGCA 

TTTTTCTCCCAAAGTCCAAATTTAT 1604 



Qy 1055 GCAATGCAATT — 

CCTCTTGGGTAAGAGGAATTCCTCCTTCTTTACTAACTGATCCCCAG 1112 

I I I I I I I I II I I I I I I I I I III 

Db 1603 

GC AAAG AAAAT AG AAAC AAG T T C AAATGC AATGC AT T AACC AAAT AAAAC AAG T C T C C T T 15 4 4 

Qy 1113 CA AGGAAATAAAATGTTAGGCTTTAAAAATCCCTACTTTGTCA- 

TATCAGACTATA 116 7 

II I I I I I II I I I I I I II III II II 
Db 15 43 CAAATTAGGAACCAACC TAAGCATTTGTAA 

AG T G T AGC AG AAT C AG AAT T C T 1492 



Qy 1168 

TTCTAAAACTATATTTGAGCGAAACCTGTCATTGCGTCTAATTTCAAATATACAGAATCT 12 2 7 



II 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 I I 1 1 1 1 1 III 

1491 T T TAAAAAT T AGAT T T GGAACC TGAAC T AT AT AAT T CAT AAT - 



Qy 122 8 CCTTAAGAGCTGTTG — 

CCTTATTTTTTTGTAAAGCCTCTCTGACATCAAATGGGGAGAA 12 85 

III I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 1446 CATTT — TTCTGTGGAAAATTATTTTATCTTT CTC-CTGTATACCT- 

GAAAAA 1398 



Qy 1286 

ATGGTGGCACCTCCAGACACCCTGAAACTACACACCATTTCTTCCCTGCTCAGCTTCTGC 13 45 

III I I I I I I I I I I I I I I I I I I I I III 

I 

Db 139 7 ATG TCCATA-GGCTTAAAGGGTCATGCTTTTACATTCCTTC 

CATAT-C 1352 

Qy 13 46 TCAGGAGTTCTGTGAGCTATGGGAAGGCCATTGGTTGTATTTGCTACTTTTACTT — 

TCA 1403 

III I I I I I I I I I II III I I I I I I I I III III 

I I 

Db 1351 AC AG — G T AC TAT G AAG T AAGG — 

AGACTTTTAGGTTTCTTTTTGTCTTAAACTCAGACA 12 96 

Qy 1404 TCTTCCTCTGCTGTAGAG 

CCATTTAATGTTATTGTCATATGCTGCTGGTGAGGTAAA 146 0 

III I I I I I I I I III II I I I I I I I I I I I 

I 

Db 1295 G C T T T G T AAGC AG TAG T G T G TAG AT T AC AAG AG T TAG AC AAA 

AGCAGGCGCGACTGA 1239 

Qy 1461 

GGTGGGTCCGGGTGCCTTCCCAGGGGTTAGAGGATGTTCAAAGGGCCGATTTCAGCAGGA 152 0 

I III I I I I I I I I I I I I I I I I I I I I I II 

Db 123 8 GAAGAGTT — GGTG GGGG — AGAAG CTTGGGGC — ACTTC — 

CTGTC 1200 



Qy 1521 GT TC AGAGGGC T T ATGATGGATGGTGAGAGAT T TGAC A- 

AC C AC C AG AGC AC AT G T GC T C 15 79 



Db 1199 AC TCA ACACATTCCAGATCACTAAAAAATTTCCACACCCTCTG- 

CATTCCCCC 1148 



Qy 158 0 TGACCCTCTCCTGGGCATTGGTTCCTGCTGGTACCGGGCGGTTCAGA — 

CCTTCAAATAG 1637 

I I I I I I I I I I I I I I I I I I I I I 

Db 1147 TTGCCCACTCC AGTTC 

CCGGTATTTTCTGATTCCATATGTTGT 1105 



Qy 163 8 GTTGCTTTCAAAAGAGCTTTCAGGCACTT — 

ATT G AG AAT T AAT G T T T AAAC AG AC AT AA 1695 

II I I I I I I I II I I I I I II I I I I I I 

Db 1104 

GGTATTTACCATACTTCTCTCCCTCACTAGGCTCTGGCAAGACTGCTTCAGAGGGGATGC 10 45 



Qy 1696 TAGCC — 

TAGATGAACTCCCAAGAGATCTATTAAATCTTGTGGGCTGAATAAATATCTCG 1753 



II 1 1 1 1 1 II I 1 1 1 1 III II 1 1 1 I II 1 1 1 1 

Db 10 44 AT T C C T T TAG AT - T GC AC AAAGCGGAGC TGGG AAA- 

ATGGCTGGCAGTTTCAGAATCTAG 987 

Qy 175 4 T GCAGGACTGTGCAACAGTAGCCCAGAGCATCC 

TGCCTGTGGGCATCCAC 18 03 

I I I I I I I I I I I III I I I I I I I I I III 

I 

Db 9 86 TCACGATCGCACGCATGAGCACCT CACA CATCCATGTCCCT 

ACCCGC 9 40 

Qy 1804 

CTCCCAGGTGAGGGCAGTGGGAAGCTGGCCCGACGGCAGCCAGAACTTGTTTCTCACCTC 1863 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 939 CCCCCCGCT C CCGCCCCTGCAGCTGGCTGACC 

TGTCTCACCCA 89 7 

Qy 186 4 CCACCAG-CAACCCCCCACCCA— AC— TCTGG 

GCCCCAGGCACACGAAGCACAAG 1914 

I I I I I I I I I I I I I I I I I I I I I I I III 

I 

Db 896 CTGCTGGCCTATCGAACGGCCAGGACTGTCTGGTTTTGGCTCGTGC 

CTTTGTCCATG 840 

Qy 1915 TCT — 

CAGGGGACCATTCCCACATTGGGGGATCCTGAGGGAGCCCATCACCGCCTCTTGC 19 72 

III I I II II I II I II I I I I I I I I I I I I 

I I 

Db 839 TCTGGCTTAGTTCCTCTCTGTCTATGCTTGCCTCT ACCCC— 

CACCGCCCCAGGC 78 7 

Qy 1973 - 

ATACAACTGTCCACTAGGAGGCACGCCCAGTGTGGGAGAGATGTATGGTCTTGCCTTCC 2 031 
I I I I I I I III III II I I I I I I I I I 

I I I I 

Db 786 GGCACAAGTGT T T GGC C AC AC AAAAC T AG AG AT AG AAAAG - 

GTGGTAAAAACTTCA 732 

Qy 2 032 ACCTGTAAAAACTGCACATATGCAAGCCATTTGCACTCTGG 

AACTGCATGCCGTG 2 0 86 

I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 731 AACTTTTCTAAATTCTC CAA — 

CAGTTTATTTCTTGTGAATTTCTTCCTTCTTTA 679 

Qy 2087 

AAAACTCCTAATGGTGTGGAACTTAGTTTGAATTTGAAATCACGCCGCATGCACAAAGGG 2146 

I I I I I I I I I III I I I I I III III I II 

I I 

Db 6 78 AAT AC T C CAT T T TAAG- AAAAC AAA AAAATTAATTATCTAAAGGCA 

AAGAATGG 626 

Qy 2147 ACAGGCCCAGGCCCGACCTCAG-GT — 

CATCCGCCCGCTGGCTGCAGAGCATCCCTGGGA 22 03 

III I I I I I III II I I I I I I I I I I I I I 

Db 625 AAAG CAACCTTTGTGTTCCTTATAATAACTGACTTCATAAC- 

TCTCT 580 



Qy 



22 0 4 GCCA-AGGCG 



AGGCCCGTGGAGCCTGAGCTTTGTGTAGCTCGAGCTTTGTGTAGCTC 2 2 59 

III III II I I I I I I I I I III I I I I I 

I 

Db 579 -CCAGCTGCGTTATGGGATGTGTATAAAAAGCTTC-TGTT-CT- 

GAGAACAAAGGAGCAC 52 4 

Qy 2260 

GTGCACTTATTATGCACCACCTCCCTTCAGTCACCACTCCTCTTCCTCCGCCATCCTCAT 2319 
I I I I I III I I II I I I I I I I I I I I I 

I 

Db 523 G T G C AG AAAT G AG AC G AAAAAAT C C - AC T G AC AG TAT T C C - 

AT T AC AC AAAT T AC T T AAA 466 

Qy 2320 

TTATACTGATTGCACACCCCCCGCTCAAACAACAATGTCCTTATTATGATGACCATCTCG 23 79 

II I I II I I I I I I I I I II I I I I I I I I 

Db 465 AGATTTTAGT — CAAGCCCC 

TCAACAGATTCAATTTTAAAATGGCTTT 42 0 

Qy 2380 

TAGTGGTACATTCCATTCCTATTTAAGGTAAGCCCAAAGCCCACTTTTGGATTTTCTCGA 2 439 

I I I I II I I I I I I I I II I I I I I I II 

I I I 

Db 419 TAGTTAAAAA AAAAAAATTGAAAGT — GCTTA C C C AG T AAAAG A — 

ACCGA 3 73 

Qy 2440 C— 

TGTCCGAGAAAAGTTGTGTAAGCGCCTGCGTTCTTCTGGGTTTGGCTAGATAGGGTT 2 49 7 

I I I I III III I I I I I I I I I I I I I I I I 

I 

Db 3 72 AGTAGTCC-TGAACTGTTACGTAAG AC T T T T T AC AG 

TTGGATC T 330 

Qy 2498 

GTGTCCCTCTATGGAATGGAGAGTGATGTGGGCAAGGGTGTCATTTTCTCGCACAATACA 2557 

III I I I I I I II I I I I I I I I I I I I II III 

I I 

Db 329 TTGT — CAAAAGGGGATGG-GGGTGATGGGAGAAAG CA GCA- 

ACG 289 

Qy 2 55 8 AC— TCACTGAGGAT 

GC T T C T G T AG AAG T G AG AAAC AC GAT GAG T AC AT TC AGAAT T 2 612 

II III III I I I I I I I I I I I I I I I I I 

I I 

Db 288 ACAATCAAAAAAGTTCGAGCTGCTGTGG C T AGAGGAC AAC - 

TTCTGTGTT 240 

Qy 2613 

ACAATAACTCACTCTCACTGGGTAACTTCTCATGATAGATTTGTATGATCAATACGGGTC 2 6 72 
III I I I I I I I I I I I I I I III 

I 

Db 239 TCCAGATAGGATTCTTGCTG 

TAGAAATGGAACTTCCAGCCAGCAC 195 

Qy 2673 

TATTTTTATGTCAACTGAACACTGTAGGGTACCTTCCAGTCTTTTTCAAGATTGTTAAAT 2 732 
III I I I I I I I I I I I I I I I 

I I I I I I 

Db 19 4 AG CATCCTGTCCCAGTAGAGAA ATGAGTTTGTCA 

GTTAAAA 154 



Qy 2733 

TGAGACAAGTAATTGAATAATTTGTCCTATTTTTATTTTAAAAAAAGTGAATGGACTG — 2 79 0 

I I I I I I I I I I I I III I II II I I I I I 

Db 153 C-AAAAAAAAAATTAGATACTGGAACCCAGGCT 

AGACGAGGTATTGAACCGCG 102 

Qy 2 791 -AAATGTTAAATG 

TGAATGTACATTTCTTAATTGCAATTTTTCTACTGAGTGTT 2 8 43 

I II II II I I I I I I I I I I I I I I I I 

Db 101 CCAGATTTCCTTGCAGCCCTGTCTGCTCAGCTC- 
GCATTCCAGTAGGCCAACAG 49 

Qy 2 8 44 TGCACTATACTTTCTGGAATCTTATTTAACAAAAATAAAGGGAAAAAATTGCTTGAC 

2900 

I I I I I III I I I I I I I I I I I I I I I I I I I I I 

Db 4 8 TGCTCGAGGAATTCC-GCAGCTTTTAGAGCAGAA GTAACACTTCCGT-AC 

1 
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Qy 8 

CAGGAAATCAGGCCGGGCGCAGTGGCTCATGCCTGTAAGCCCAGCACTTTGGGAGGCGGA 6 7 



Db 2 495 CTAG 

CGGCCGCGACCTGCAGCTCGAGCACATGTGGCCTTTTTTTTTTTTTTTTTA 2 4 41 

Qy 6 8 GGCGGGTGGATCCA-TTGAGGTCAGGAGCTCA-AGACCAGCC- 

TGGTCAACATGG 119 



Db 2 440 CATGTTTAAATATATTTTATTTCA — 

AGTTAATATAACAGCCATGATTTGTTCATTTAAG 2383 



Qy 120 

TGAAACCCCGTCTCTACTAAAAATACAAAAAAAAAATTAGCTGGGCGTGTTGGCGGGAG- 17 8 



Db 2382 TGAAA 

TGCTTCGATAAAAACCCCGGTGTCACTTGGTTAAAATATTGGAGACAGA 232 9 

Qy 179 CCTGTAGTCCCAGCTACACGGGAGGCTAAGG 

TGGGAGAATTGCTTGAACCCGG 231 

III I I I I I I I I I I I II I II I I I I I I 

Db 232 8 TTAAATGTTAAGCCA- 

CTATAGTAGTGACTCTGGTTCTTTAGACTATTGCATGTACC 2 2 73 

Qy 232 GAGGCGGAGGTTG-CAGTGAGCT — 

GAGATTGCACCATTGCACTCCAGCCTGGGCGACAG 2 88 

III I I I I I I I I I I I I I I I I I I II 

Db 2 2 72 TTGTTAATGAACTACAGGAATGCA-CATTGATATAATGCAC- 

AATGTCTT 2225 

Qy 289 

AGC AAG AC T C T C T C AAAAAAAAAAAAAAAG AAAGAAAGAAAT C AG AAAAT C G AC C AC AG T 3 4 8 



Db 222 4 CACATGTCTCAATC CTC 

ATTCCTCTTCCTT 2195 

Qy 349 

GGTAGCCACCTGGCCTAATGCTGTGTTTTTGTACCTGACAGGGGTCACTCATTTTAGGC- 4 0 7 



Db 2194 TCATGCCTC CCTAATG CCT— CA 

GTATCCAATATAAAACT 2157 

Qy 408 ACAACTCCTTC- 

ATTCTTTGTGAAATTAGTGAGTTTCCTTCTACCCGTCACCAGATTCAA 466 

I I I I III I I I I I I I I I I I I II I I I I I I 

Db 2156 ACAAAAACTACTGTTCTTTTCG-TTTGAGTGGTATTTAATTAAAACTTCA 

ATTC— 2104 

Qy 467 

TATGTTCTATTAATACACCGATAACCACAGGGGAAGGGCACTTGTCGCTCTCCCACCTGG 52 6 



Db 2103 TATGTTATTT CCACAG 

TCCAGT 2 0 82 

Qy 527 

TTACCACAGTCTCCATGGGTCTTTTGCCGTGACCACAAATAAAGGAAACACTCATCACTA 5 86 



Db 2081 AATTTATTTTAAATTTGAGTAATTT- 

CAAATTCCACAAACAAAACTGAAGAACAGCAATA 2 023 

Qy 58 7 GTATCTAAGTCGGGCTTTACAGTAACTATGCACCTTCTGTGTGCT 

TCACCTCACTCT 6 43 



Db 2 022 TTTT GTTTGAATTTCTCTCTTCTGTACAC — 

TCAGTGATCTAAAACACCACAATAT 1969 



Qy 6 44 C T AC T TC AAAC AGCCCATGGAGGGAGGTATTATT AT AC TC - 

CTTATGTT G AC AG T G AAG A 702 

II I I I I II I I I I I I I I I I I I I I I III II 

I I 

Db 196 8 C C AAC AT AC AC AAAC C T C — AGGGAAGGGTTAGTAAACACACACAAGAT TG— 

GA 1918 

Qy 7 03 ATC-TGAGGCCCAGAGAGGTTGGGGACTTGA 

GTAAAGTCACACAGCCCTGAGAGGCA 75 8 

I I I I I III I I I I I I II I I I I I I I I II 

I I I 

Db 1917 ATCATGGTGCTC TTTGCTCCTGAATGGAATGGTCCCACAG 

AAAAAGCA 1870 

Qy 759 

GGACCAGGGTTCCATTCCTGCTCTATCCAGTTCCAAGCCCTTGTGTTTTCCATTATGTTT 818 

I I I I I I I I I I II I I I I I I I I I I I I I I 

Db 1869 CAGG ATACAGCACAA — CA — TAAGGGCACCTG TTACA- 

TATGAAG 1829 

Qy 819 AGTGCCTCT-TTGCTAACAGCAAC- 

ATCTGCAAGATTTGTGTTGGTTTTGATGGAGAACT 8/6 

III I I I I I I I I I I I I I I I I I I I I I 

Db 182 8 T G AGC AAAAC AT AC T AGC AT T T TC T AT ATGC A 

TAATGGGGAA — 178 7 

Qy 877 

CTAGCTCATCCACATGCTAGTGCCCAAGTGGTGGAGGGGCCACCTCAGCAGGTGGGTTCT 936 

I I I I I I I I I I I I I I III I II 

Db 1786 ACCTGC AT AGGT T AG AGGGC C T T T T AC GC T CAT 

TTAA 1750 

Qy 93 7 GAATGCAGCCAA GGCTGTCCCC GCAATGGG- 

TGAGACTCGCTCCAACTGCCC- 987 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 17 49 AAAT-CAGGCAAGTTGTCTGTAACATTTTTCAATAATCTGGGAAGCACTGCAAT — 

CCAG 1693 

Qy 988 -GCCCTCAGAGCAGGTGCCT-AAGTCCTCCCTGGCACTGGCAGGCCTTACCT- 

CACAT-T 10 43 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 

I 

Db 1692 TGTACGATGTGCTGATTAGTGTTGTCTTTGTTGGCACTGACA- 

GTCTTGCATGGTCATAT 1634 

Qy 1044 

GCTAAATTAAAGCAATGCAATTCCTCTTGGGTAAGAGGAATTCCTCCTTCTTTACTAACT 1103 

II I I I I I I I I I I I I I I I I 

Db 1633 GC CAGTGTTTTTGCTTTAG 

CTTTTCTTTGA 1604 

Qy 1104 GATCCCCAGCAAGGAAATAAAATG 

TTAGGCTTTAAAAATCCCTACTTTGTCATATCA 1160 

III I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 16 03 ATAAAACAG ATT T AAAT GC AT T TAG AC AATACGTA- 

TTTGT A 1563 



Qy 1161 G AC T AT AT T C T AAAAC T AT AT T T G AGC G AAAC C - 

TGTCATTGCGTCTAATTT 1211 



Db 1562 AG C T G T T T AC AT AC AC T AAAT T T AAG AG AT T C AAT AT TAG AG TTTCTTTGTTTC 

TTT 1506 

Qy 1212 CAAATATACAGAAT-CTCCTTAAGAGCTGTTGC — CTTATT — 

TTTTTGTAAAGCCTCTC 1266 



Db 1505 AAACACTACAGAGTGCAAATCAGGTTCT-TCACAACAGATTGAATATTGAGCAG- 

TTCTT 1448 

Qy 126 7 TGACATCAAATGGGGAGAAATGGTGGCACCTCCAG 

ACACCCTGAAACTACACACCA 132 2 



Db 1447 TAAAGAAAGAGGGGGAAGAAAAAAAGC- 

CCAAGTGAATAAAACATTGAAACTATTCCCC- 1390 

Qy 1323 TTTCTTCCCTGCTCAGCTTCTGCTCAGGAGT — 

TCTGTGAGCTATGGGAAGGCCATTGGT 13 8 0 



Db 1389 -TTCGAAAAT 

AAATTCTAAAATGATGTGGAATGTGAAATAAGGTTTTAACATAGGT 1335 

Qy 13 81 TGTATTTGCTACTT-TTACTTTCAT CTTCCTCTGCT GT 

AG AGC 1422 



Db 1334 

GATCCAAGTTTATAGTTAGAAACAAAAAGAAGTCCTTCATGAAATAAAGGTTACAAGAAC 12 75 

Qy 1423 -CATTTAATGTTATTGTC ATATGCTGC-TGGTGAGGTAAAGGTG 

GGTCC 1469 



Db 12 74 ACGTTGCCTGTTTTCCCCCATTATAAACTGAGAAGTG- 

GG T AAAG AC GAT G T T T C AG T AC 1216 

Qy 1470 G GGTGCCTTCC CAGGGGTT AGAGGATGTTCAAAGGGCC- 

GATTT 1512 



Db 1215 

GAAAATAGGTGACTACAGGATCAGCGCTTCATCTCACATGCTGTACCCAAAGCCAGGCTG 1156 

Qy 1513 CAGCAGGAGTTCAGAGGGCTTATGATGGATGGTGAGAGATTTGACAACCACCAGA — 

GCA 1570 

III II I I I I I I I I I I I I I I I I I I I 

Db 1155 TGGCTG TC CATACGGTG 

GTGCGGTATAACCATAACCAAAAGGTG 1112 

Qy 1571 CATGTGCTCTGAC— CCTCTCCTGGGCATTGGTTCCTGCTG 

GTACCGGGCGGTTCAG 1625 



Db 1111 CCTGGGGAGGGACGGCAACAC — 

TGGGTCCTGCTGTCAGCATCAACTGGGGCTG 1060 



Qy 1626 ACCTTCA — AATAGGTTGCTTTCAAAAGAGCT — TTCAGGCACTTAT- 

TGAGAATTAATG 16 8 0 

I I I I I I I I I I I I I I I I I I I I III I I I I I I I 

I I 

Db 1059 AC C AT AAAC AAT G G G T T G - 

TGTCTCTGTAGCTTGTTCTTCTTCTTTTCTCAGTGATTCTG 10 01 

Qy 16 81 T T T AAAC AG AC AT AAT AGCC T AGAT 

GAACTCCCAAGAGATCTATTAAATCTTGTG — 1735 

I I I I I I I I I I I I I I I I I I I II 

Db 1000 

AAGCATCTAATTTATCCACCTTTGTCAAGTACTCCTTCATGACCTGGATGAAATAGGGCA 9 41 

Qy 1736 -GGCTGAATAAAT-ATCTCGTGCAGGACTGTGCAACAGTAGCCCAGA — 

GCATCCTGCCT 1791 

III III I I I I I I I I I I II I I I I I II I I I I 

I I I I 

Db 9 40 TGGCAAAATCCATGATATTGTGCC — TCCATG 

CAGTTTCTAGGACGACATCTGGCCT 8 86 

Qy 1792 

GTGGGCATCCACCTCCCAGGTGAGGGCAGTGGGAAGCTGGCCCGACGGCAGCCAGAACTT 1851 

II II I I I I I I III I I I I I III I 

I I I 

Db 8 85 TAAAAGAT CGTAACAGGTAA — ACAG ACAAGCTCCAAAGCACTC-- 

TCTT 840 

Qy 1852 GTTTCTCACCTC CCAC — 

CAGCAACCCCCCACCCAACTCTGGGCCCCAGGCACAC 1904 

I I I II I I I I I I I I I I I I I I I I I I I 

Db 839 

TTTTCTTCCTGCAAAAACCACTGCAGGAGTTCTTCAGCCAATTC A 795 



Qy 1905 

GAAGCACAAGTCTCAGGGGACCATTCCCACATTGGGGGATCCTGAGGGAGCCCATCACCG 196 4 
III II I I I I I I I I I I I I I I I I I I I II 

Db 79 4 G T AT C T T TAG AT T C AG AAG — CA-TACTGCATT 

GCATCCTTGTAAAGGCTGTCT 744 



Qy 1965 CCTCTTGCATACAACTGTCCACTAGGAGGCACGC 

CCAGTGTGGGAGAGATGT 2 016 

I I I I I I I I I I I I I I I I I I II II II I I I I I 

I 

Db 7 43 -TTCTTGC— ACAGCTCTACACTCTGTTTCCAGCGATTGTTGCCTTTG- 

AAGAGATAAGC 688 

Qy 2017 ATGGTCTTGCCTTCCACCTGTAAAAACTGCACATATG 

CAAGCCATTTGCACTCT 2 0 70 

I I I I I I I I I I III III I I I I I I I I 

I 

Db 687 A 

GCAATTCTCCTGAACTCAATGAGTTCATGTTTTTCCAAACGCTGAGCAAGCG 635 



Qy 2 0 71 GGA-ACTG-CA TGCCGTGAAAACTCCTAATGGTGT GGAACT — 

TAGTTTGA 2117 

I I I I I I I I I I I II I I I I I I I I I I II 

Db 63 4 AGATATTGTCAAAGTTGTCAT — 



AAGCATCTATTGATGTTCGCAGAGCCTGATAATCTTC 577 



Qy 2118 ATTTGAAATCACGCCGCATGCACAAAGGGACAGGCCCAGGCCCGACCTCA- 

GGTCATCCG 2176 

I I I I I I I I II I I I I II I I I I I I I I I I II 

I 

Db 5 76 TTCTGTAAT-AAAAAGATTGTTCAATGATTCATTCACAGATTTGTTGTTATGGT — 

TCTG 520 

Qy 2177 

CCCGCTGGCTGCAGAGCATCCCTGGGAGCCAAGGCGAGGCCCGTGGAGCCTGAGCTTTGT 2 236 

III III III II II I I I I I I III II 

Db 519 — AACTGAACGCA AATAC GGTTTCA CCAGTGGTAGCTGT — 

TTAAC 478 

Qy 2237 GTAGCTCGAGCTTTGTGT- 

AGCTCGTGCACTTATTATGCACCACCTCCCTTCAGTCACCA 2 2 95 

I I I I I I I II I I I I I III III III I I I I 

I I 

Db 477 CTTGCT-GAAATAATTGACTGCACGAG TGTGATCCA — 

ACCGTGGAGACAGCA 42 8 

Qy 2296 

CTCCTCTTCCTCCGCCATCCTCATTTATACTGATTGCACACCCCCCGCTCAAACAACAAT 2355 

I I I I I I I I I I I I I I I I I I I I 

Db 42 7 C CATCAGCAA — ATCATTT 

AACAACAGA 402 

Qy 2356 

GTCCTTATTATGATGACCATCTCGTAGTGGTACATTCCATTCCTATTTAAGGTAAGCCCA 2 415 

I I I I I I I I I I I I I I I I I I I I I I II 

I I I 

Db 401 GGCTTGAAT TCTAAGTAGAACTGT — 

ATTGCTCTGTAGTATAGTTCCA 356 

Qy 2416 AA— 

GCCCACTTTTGGATTTTCTCGACTGTCCGAGAAAAGTTGTGTAAGCGCCTGCGTTC 2 4 73 

I I I I I I I I I I III III I I I I I I 

I I I 

Db 355 CATTGGCAACCTTGGTAAT GATATCTTTGAA TTGC- 

CCTTCTTTC 312 

Qy 2474 

TTCTGGGTTTGGCTAGATAGGGTTGTGTCCCTCTATGGAATGGAGAGTGATGTGGGCAAG 2 533 

III I I I I I I III III III I I I I I I I I 

Db 311 — CAGGCATCAGTTGGAT — GATT CATC ATG 

GTAATTATGGCAT- 2 72 

Qy 2534 

GGTGTCATTTTCTCGCACAATACAACTCACTGAGGATGCTTCTGTAGAAGTGAGAAACAC 2 593 

I I I I I I I I I I I I I I II I I I III I I 

I 

Db 2 71 — TATCATATTCT T CAT AC T T G T C A- T AC AAAAAC AC C AG T 

TCTGCCCAAA 224 

Qy 2 59 4 GATGAGTACATTCAGAATTACAATAACTCACT- 

CTCACTGGGTAACTTCTCATGATAGAT 2 652 

I I I I I I I I I I I I I I II I I II I I I I I 

Db 223 GATGAGCTTGTTCTGC 

AGCTCTTAGCACCTTGGGAATATTCAC 181 



Qy 2 653 TTGTATGATCAATACGGGTCTATTTTTATGTCAACTGAACA- 

CTGTAGGGTACCTTCCAG 2 711 

I I I I I I I I I I I I I I I I 

Db 180 TCTAGAC CAGAACAGCTCCAGG— 

TGCTCCC— 152 

Qy 2712 

TCTTTTTCAAGATTGTTAAATTGAGACAAGTAATTGAATAATTTGTCCTATTTTTATTTT 2 7 71 
I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 

Db 151 TCATTTTC-TGAGGCTTAAATTTAG AGT-ATAGAATAG 

CTAATT 110 

Qy 2 7 72 AAAAAAAGTGAATGGACTGAAATGTTAAATGTGAATGTACATTTCTTAAT — 

TGCAATTT 2 829 



Db 109 C AG T AAAC AT T C — CCATG 

TGAGCTCGCTCAAGTCCCAGTGCTGC T 66 

Qy 2 83 0 TTCTACTGAGTGTT-TGCACTATACTTT CTGGAATCTTATT- 

TAACAAAAATAAAGG 2884 



Db 65 

TCCAACATGGTGATCAGCTCTTCAAAATAGCCACGATCCTGATAGTAGCCAGTA G 1 1 

Qy 2885 GAAAAAATTG 2894 

I I I I I I 
Db 10 GCCAACAGTG 1 
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Qy 17 

AGGCCGGGCGCAGTGGCTCATGCCTGTAAGCCCAGCACTTTGGGAGGCGGAGGCGGGTGG 76 
I I I I I I I I I I I I I I I I I I I I I I 

Db 2 AGGCC-TGTACGGAAG 

TGTTACTTCTGCTCTAAAAGCTGCGGA A 44 



Qy 77 

ATCCATTGAGGTCAGGAGCTCAAGACCAGCCTGGTCAACATGGTGAAACCCCGTCTCTAC 136 

III I I I I I III I I I I I I I I I I I I 

I 

Db 45 TTCC TCGAGCACTGTTGGCCT-ACTGGAGTGC- 

TGAAGTAGGCGCGGACGTGC 95 



Qy 13 7 TAAAAATACAAAAAAAAAATTAGCTGG- 

GCGTGTTGGCGGGAGCCTGTAGTCCCAGCTAC 195 

II I I I I I I I I I I I I I I I I I 

I 

Db 96 — CCGGTGC 

CTGGCGCGTGGTAGCAGGCGCCCGGTGCCCCGGC — C 137 



Qy 196 ACGGGAGGCTAAGGTG 

GGAGAATTGCTTGAACCCGGGAGGCGGAGGTTGCAGTGA 2 5 0 

I I I I I I I I I I I I I III I I I I 

I 

Db 13 8 GGCGAAGACCATGGCGTTCATGGTGAA GACCATGGTGG- 

GCGGCCA 182 

Qy 251 G C T GAG AT T GC AC CAT T GC AC T - 

CCAGCCTGGGCGACAGAGCAAGACTCTCTCAAAAAAA 309 

I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 183 GCTGA AGAACCT 

CACTGGGAGCCTGGGAGGCGGCG 217 



Qy 310 

AAAAAAAAGAAAGAAAGAAATCAGAAAATCGACCACAGTGGTAGCCACCTGGCCTAATGC 369 

I I I I I III I I II I I I I II I II I I I I I 

I I 

Db 218 -AGGATAAGGGAGATGGGGACAAG TCGGCAGCCGAAG CTCAGGGCATGA- 

GC 267 

Qy 370 

TGTGTTTTTGTACCTGACAGGGGTCACTCATTTTAGGCACAACTCCTTCATTCTTTGTGA 42 9 

II I I I I I I I I I I II I I I I I I I I I 

I I 

Db 26 8 CG-GGAGGAGTAC GAGGAGTATC AGAAGCAACTCGTGGA 

AGAGA 310 



Qy 430 

AAT TAG T GAG TTTCCTTC T AC C C G T C AC C AGAT TC AAT ATGT TC T AT T AAT AC AC C GAT A 4 89 

I I I I II I I I I I 

I 

Db 311 AGATGGAG CGG 

GATGCACAGTTC 333 

Qy 490 

ACCACAGGGGAAGGGCACTTGTCGCTCTCCCACCTGGTTACCACAGTCTCCATGGGTCTT 5 49 

I I I I I I I I I I I I I I I I I I I I I I III II 

Db 33 4 A-CACAGAGGAAGG CAGAGCGGG CCACA— CTGC— 

GGAGCCA 3 71 



Qy 550 

TTGCCGTGACCACAAATAAAGGAAACACTCATCACTAGTATCTAAGTCGGGCTTTACAGT 6 09 
I I I I I I I I I I I I II I I I I I I III 

I I I I 

Db 3 72 CTTCCGAG ACAAATACCGG CTACCCAAGAA CGAG 

AC AG- 409 

Qy 610 

AACTATGCACCTTCTGTGTGCTTCACCTCACTCTCTACTTCAAACAGCCCATGGAG 665 

I I I I I II II III II I I I I I 

Db 410 ATGAGAGC — CAGATC 

CAGATGGCAGGTGGAGACGT 4 43 

Qy 666 GGAGGTATTATTATACTCCTTATGTTGACAGTGAAGAATCTGAGG 

CCCAGAGAGGT 721 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 444 GGAG CTGCCCCGGGAGCTGGCCAAGATG-ATC- 

GAGGAGGACACAGAGGAGG 493 

Qy 72 2 TGGGGACTTGAGTAAAGTCACACAG-CCCTGAGAGGCAGGACCAGGGTTCC 

ATT 774 

III I II I I IN II II I I I I I I I I I I I I 

Db 49 4 AGGAG GAGAAGGC-CTCAGTCCTTGGGCAGCTGG- 

CCAGCCTTCCTGGCTTGAA 545 

Qy 775 CCT — 

GCTCTATCCAGTTCCAAGCCCTTGTGTTTTCCATTATGTTTAGTGCCTCTTTGCT 832 

III I I I I I I I I I I I I I I I III II II III 

I 

Db 546 CCTGGGCTCACTCAAGGACAAGGCCCAGG CCACACTG GGGGATCT 

CA 592 

Qy 833 

AACAGCAACATCTGCAAGATTTGTGTTGGTTTTGATGGAGAACTCTAGCTCATCCACATG 892 

III I I I I I I III II I I I I I I I I I I I I I 

I 

Db 593 AGCA — ATCAGCTG — AGAAGTGTCACGTCAT GTGACCACT 

TCCCCGGG 637 

Qy 893 CTAGTGCCCAAGTGGTGGAGGGGCCACCTCAGCAGGTGGGTTCTGAATG CAGC— 

CA 947 

II I I I I I II I I I I I I I I I I I I I I I I I III 

I 

Db 63 8 GTTACCCACTG — GGCTGGGCCCC 

CATGAGGGCTAAGAGTGTGTCAACTTCC 687 



Qy 948 AGGCTGTCCC 

CGCAATGGGTGAGACTCGCTCCAACTGCCCGCCCTCAGAGCAGGTGC 10 0 4 

III I I I I I I I I III I I I I I I I I I I I I 

I 

Db 688 AGG — GACCCATACTCCATTTG-GGGCTTTGTTTCCCTTGCC — CCATC- 

CTAGTTC 73 8 

Qy 1005 CTAAGTCCT 

CCCTGGCACTGGCAGGCCTTACCTCACATTGCTAAATTAAAGCAA 105 8 

I I I I I I I II III I I I I I I III III 

Db 739 C-AAGACCTTTCCCATCCATGCCC 

CAAGCCTATCTTCTGGTTTC 781 



Qy 1059 

TGCAATTCCTCTTGGGTAAGAGGAATTCCTCCTTCTTTACTAACTGATCCCCAGCAAGGA 1118 
I I I I I I I II I I I I I I I I I I I I I I I I I I I 

Db 782 TTCCTCTCCGCTGGGAGTAAAGTCCCCATCTTCACT 

CTACCCTTCAGG— 829 

Qy 1119 AATAAAATGTTAGGCTTTAAAAATCCCTACTTTGTCATATCAGACTAT 

ATTCTA 1172 

I II I I I I II I I I I I I I 

I 

Db 83 0 ACCCTCCCCAC 

CAGCTCAGCCTGTGGAGGCCTCCCA 865 

Qy 1173 AAACTAT ATTTGAGCGAAACCT 

GTCATTGCGTCTAATTTCAAATATACAGAA 122 4 

I I I I III II III I I I I I I I I I I I I 

I 

Db 866 AGATTGTAGGAATAGGCCCATCCCTCTCTGGCCATGGC- 

CCCAAGTTCCTGCACACAGGA 92 4 

Qy 1225 

TCTCCTTAAGAGCTGTTGCCTTATTTTTTTGTAAAGCCTCTCTGACATCAAATGGGGAGA 12 8 4 
III I I I I I I I I I I III 

I 

Db 925 GCACCCACAGAG 

AGACACACACAGGACACAA A 956 



Qy 12 85 AATGGTGGCACCTCCAGACACCCTGAAACTACACACCATTTCTTCCCTGCTCAG- 

CTTCT 1343 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 957 ACCCCTGGCACGTTCAGAGA — CAGAAGCCACAGAC 

AC AT C C C GGC AC AG AC AG AC 1010 



Qy 1344 

GCTCAGGAGTTCTGTGAGCTATGGGAAGGCCATTGGTTGTATTTGCTACTTTTACTTTCA 14 03 
I I I I I I I I I I I I I I I I 

I I 

Db 1011 AC AC AC G AGGCCA 



GCTCCCTTGCGTGTC- 1038 



Qy 1404 

TCTTCCTCTGCTGTAGAGCCATTTAATGTTATTGTCATATGCTGCTGGTGAGGTAAAGGT 1463 
I I I I I I I III III 

Db 1039 -CAGCCCCTCCAG — ACACCA 

CCACT 1061 

Qy 1464 

GGGTCCGGGTGCCTTCCCAGGGGTTAGAGGATGTTCAAAGGGCCGATTTCAGCAGGAGTT 1523 

II I II III I I I I I I I III II I I I I I I I 

Db 1062 CAGAAAC — TCTGAGAG AGAGC ATGGGC AG AC AC C C 

TCAGCAG A 1103 

Qy 152 4 CAGAGGGCTTATGATGGATGGTGAGAGATTTGACAACCACC AGAGCACATG — 

TGCT 1578 



Db 1104 CAGGAGGCCT 

GAGTTCCAGTCTCCACCTTTATTGTTCTTGAAAGCC 1149 



Qy 1579 CTGACCCTCTCCTGGGCATTGGTTC — 

CTGCTGGTACCGGGCGGTTCAGACCTTCAAATA 1636 

I I I I I I I I I I I I I III I III I II I 

Db 1150 CCTGCTCTCT-CTGAGCCTTATTTCATCATCTGTAAAATGGGAAT 

GTCCT 1198 



Qy 1637 

GGTTGCTTTCAAAAGAGCTTTCAGGCACTTATTGAGAATTAATGTTTAAACAGACATAAT 1696 

I II I I I I I I I I I I I I I I I I I I I I I 

I 

Db 1199 GAATGACTTCTAAGGCTCTTTCTGGC TTGAACTGTC- 

AG 1236 



Qy 1697 

AGCCTAGATGAACTCCCAAGAGATCTATTAAATCTTGTGGGCTGAATAAATATCTCGTGC 1756 

I I I I I I I I I I I II III 

I I 

Db 1237 AGCCAAGCCCACATCCC TC 

CTTGGGC 1262 



Qy 175 7 AGGACTGTGCAACAGTAGC — 

CCAGAGCATCCTGCCTGTGGGCATCCACCTCCCAGGTGA 1814 

III I I I I I I II I I I I I I I I I I I III I I I I 

I I 

Db 1263 AGG GCAGCAGCTGCTGCCACAGCCTCCAGC GGCTGCCAC- 



Qy 1815 GGGCAGTGGGAAGCTGGCCCGA CGGCA 

GCCAGAACTTGTTTCTCA 1859 

I I I I I I I I I I I I I III I I I I I I I I 

Db 13 05 GGGCTCTGGG-AGCCGGAGCGATGCTGTGTGAGAGGCAGAGTGCCA- 

AGGATGAAGCT-G 1361 



Qy 186 0 CCTCCCACCAGCAACC— CCCCA- 

CCCAACTCTGGGCCCCAGGCACACGAAGCACAAGTC 1916 

II I I I I I I I I I I I II I I I I I I I I I I I I I I 

I I 

Db 1362 GCACTGAACAGTAAGCGGCTCCAGGCCTCCTCTGGGCCCAGGGCCC — 

AGCCAATTTC 1417 

Qy 1917 TCAGGGGACCATTCCCACATTGGGGGATC-CTGAGGGAGCCCATCAC- 

CGCCTCT 1969 

I I I I I I I II I I I I I III I I I I I 

I I I I I 

Db 1418 T GTTCTGTTCC — TGTAGAACGCTCTCT 

GGATTCCATAGCTGGAATCTCCTCT 146 8 



Qy 1970 

TGCATACAACTGTCCACTAGGAGGCACGCCCAGTGTGGGAGAGATGTATGGTCTTGCCTT 2 02 9 

III I I I I I I I I I I I I I I I I I I 

I I I I 

Db 1469 — CTTAGCTCAGTGAAAAATAAAAATC-CCAAATG GTGTG — 

CCTACCTT 1513 

Qy 2 03 0 CCACCTGTAAAAACTGCACATATGCAAGCCATTTGCACTCTGGAACTGCATG 

CCG 2084 

II I I I I I I II III I I I I I I I I I I I 

I I I 

Db 1514 CCCAC — TTCTTACTG GCTT-CCA 

GGAGTCTTGGAGTTCATAGCCCCCCG 156 0 



Qy 2 0 85 TGAAAACTCCTAATGGTGTGGAACTTAGTTTGAATTTGAAATCACGCC 

GCATGCAC 2140 

I I I I I I I I I I I I I I I I I I III 

I I I 

Db 1561 AGCCTGC-CTTAAAGGGGT-GTCCTCC ACCCCCCACCTACAGCTT- 

CAC 1606 

Qy 2141 A— AAGGGACAGGCCCAGGCCCGACCTCAG-GTCATC 

CGCCCGCTGGCTGCAGAGC 2193 

I I I I I I I I I I III II III I I I I I I III 

Db 16 0 7 AGGAGGGGAGAGGGCAT — CCAGTGCTAGGAGTAGAAGTGTCTCCAGCT — 

CTG 1656 

Qy 219 4 ATCCCTGGGAG-CCAAGGCGAGGCCCGTGGAGCCTGAGCTTTGTG TAGCTC- 

GAGCT 22 48 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 1657 TTCTCTTGGGGCCCTGGGTGAAG 

GTGGGGTCTGGGGCTTATGAAATAGGTCTGGGCT 1713 

Qy 2249 TTGTGTA-GCTCGTGCA — CTTATTATGC ACCA CCTCC CTTCA- 

GT 2290 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 1714 

TTGAGGAGGATGGAGCAGCCTCATTATGTGGGGAAGATGGGGCCTCTGGGGCGTCACTGA 17 73 

Qy 2291 CACCACT CC TCT — 

TCCTCCGCCATCCTCATTTATACTGATTGCACACC- 2337 

I I I I I II II I I I I I III I II I I I I I 

I I 

Db 1774 

GACCACAGGTGGGGCCGGGGCTGGACCGCAGCTGTCTTGGGTGCCTGTGCCTGCACCCCT 1833 

Qy 233 8 CCCCGCTCAAACAAC AATGTCCTTAT — 

TATGATGACCATCTCGTAGTGGTACATT 2 391 

I I I I I I II I I I I I I I I I I I I I I I I 

I 

Db 183 4 CCTCACCCTAGAGACG G AAG AT G T GC AAAAAG AAAG AAG 

GAAGGGCAAC — T 18 83 

Qy 2392 

CCATTCCTATTTAAGGTAAGCCCAAAGCCCACTTTTGGATTTTCTCGACTGTCCGAGAAA 2 451 
I I I I I I I I I I I I II I I I III I I I I I 

Db 1884 GCATTCC AGCCC CACACTGT— GAT 

GACTTT 1912 

Qy 2452 

AGTTGTGTAAGCGCCTGCGTTCTTCTGGGTTTGGCTAGATAGGGTTGTGTCCCTCTATGG 2 511 

I I I I I I I I I I I I I I I I 

Db 1913 GAGCCT — GTCCTT 

TCCCTCCT 1932 

Qy 2512 

AATGGAGAGTGATGTGGGCAAGGGTGTCATTTTCTCGCACAATACAACTCACTGAGGATG 2 5 71 
I I I I I I I I I I I I I III 

Db 1933 TGAGCC TGTC — TTGCTTG 

TCCC 1953 



Qy 2572 

CTTCTGTAGAAGT GAGAAACAC GAT GAG T AC AT T C AG AAT T AC AAT AAC TCACTCTCACT 2631 

I I I I I I I I I I I I I I I I I I I I 

Db 1954 CTGTAAAA TGAACAGTCC 

CCCTCTCCCCCAAA 1985 

Qy 2632 

GGGTAACTTCTCATGATAGATTTGTATGATCAATACGGGTCTATTTTTATGTCAACTGAA 2 691 

I I I I I I I I I I I I I I I I I I I I I I II 

I I I 

Db 19 86 TAGTAATAATACATGTT 

TCAAAGGGTGACCATTTATAAAGCATATGA- 2 032 

Qy 2692 

CACTGTAGGGTACCTTCCAGTCTTTTTCAAGATTGTTAAATTGAGACAAGTAATTGAATA 2 751 

II I I I I I I I I I I I I I I I 

Db 2 033 C AAAC CAT AT C AA TAAAT 

GTAA 2054 

Qy 2752 

ATTTGTCCTATTTTTATTTTAAAAAAAGTGAATGGACTGAAATGTTAAATGTGAATGTAC 2 811 

I I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 2055 CTCATTCTTTTAAAAAAAAAAAAAAAAGGCC 

ACATGTGCTCGAGC 2 099 

Qy 2 812 ATTTCTTAATTGCAATTTTTCTACTGAGTGTTTGCACTATAC 2 853 

I I I I III I I I I I I 

Db 2100 TGCAG-GTCGCGGC CGCTAGAC 212 0 
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Qy 7 TCAGGAAATCAGGCCGGGCGCAGTGGCTCATGCCTGTAAGCCC 

AGCACTTTG 5 8 

II I I I I I I I II I I I I II I III 

I I I 

Db 2119 TCTAG 

CGGCCGCGACCTGCAGCTCGAGCACATGTGGCCTTTTTTTTTTTTTTTTA 2 065 

Qy 59 GGAGGCGGAGGCGGGTGGATCCATTGA 

GGTCAGGAGCTCAAGACCAGCCTGGT 111 

II II II II I I I I I I I I I III II 

I I I I 

Db 2 06 4 AAAGAATGA GTTACATTTATTGATATGGTTTGTCATATGCTTTATA- 

AATGGT 2 013 

Qy 112 

CAACATGGTGAAACCCCGTCTCTACTAAAAATACAAAAAAAAAATTAGCTGGGCGTGTTG 171 
I I I I I I I I I I I I I I I I I II III 

I I 

Db 2 012 CACCCT-TTGAAA 

CATGTATTATTACTATTTGGGGGAGAGGGGGACTGTTCATTTTA 195 7 

Qy 172 

GCGGGAGCCTGTAGTCCCAGCTACACGGGAGGCTAAGGTGGGAGAATTGCTTGAACCCGG 231 

III I II I I I I I I I I II II II I I 

I 

Db 1956 CAGGGGACAAGCAAGACAGGCT-CAAGGAGGGAAAGGACAGGCTCAAAG — 

TCATCACAG 19 0 0 

Qy 232 GAGGCGGAGGTTGCAGTGAGCTGAGATTGCACC 

ATTGCACTCCAGCCTGGG 2 82 

I I I I I I I I I I I I II III I I I I I I I 

Db 1899 TGTGGGGCTG-GAATGCAGT 

TGCCCTTCCTTCTTTCTTTTTGCACATCTTCC 18 49 

Qy 283 

C G AC AG AGC AAG AC T C T C T C AAAAAAAAAAAAAAAG AAAG AAAG AAAT C AG AAAAT C G AC 3 42 
I I I I I I I I I I I III 

Db 1848 GTCTCT 

AGGGTGAGGAGGGGTGCAGG 1823 

Qy 343 

CACAGTGGTAGCCACCTGGCCTAATGCTGTGTTTTTGTACCTGACAGGGGTCACTCATTT 4 02 

I I I I I I I I I I I I I I I I I I I I 

Db 1822 CACAG GCACCCAAGACAGCTGCGGT 

CCAGCCC 1791 

Qy 403 

TAGGCACAACTCCTTCATTCTTTGTGAAATTAGTGAGTTTCCTTCTACCCGTCACCAGAT 462 
I I I I I I I I III I I I I III I I I I I I I I 

I 

Db 179 0 -CGGCCC— CACCTGTGGTCTCAGTGACGCCCCAGAGGCCCCATCTTCCC- 

CATAA 173 7 

Qy 463 

TCAATATGTTCTAT T AAT AC ACCGAT AACC AC AGGGGAAGGGC AC TTGTCGCTCTCCC— 520 

II I I I I I I I I I I I I I I II III 

Db 1736 TGAGGCTGCTC CATCCTCCTCAAAGCCCA 

GACCTATTTCATAAGCCCCA 16 8 8 



Qy 



521 - 



ACCTGGTTACCACAGTCTCCATGGGTCTTTTGCCGTGACCACAAATAAAGGAAACACTC 5 79 

III I I I I I I I I I I I I II II I I I I III 

I I I I I 

Db 16 8 7 GACC CCACCTTCACCCAGGGCC CCAAGAGAACAGA- 

GCTGGAGACACT- 16 41 

Qy 580 

ATCACTAGTATCTAAGTCGGGCTTTACAGTAACTATGCACCTTCTGTGTGCTTCACCTCA 639 
I I I I II I III I I I I I I 

I I I I 

Db 16 40 TCTA CT CCTAGCA CTGGATGC 

CCTCT 1615 

Qy 640 CTCTCTACT- 

TCAAACAGCCCATGGAGGGAGGTATTATTATACTCCTTATGTTGACAGTG 69 8 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 1614 C-CCCTCCTGTGAAGCTGTAGGTGGGGGGTGG AGGACACCCCTTT 

AAGG 1567 

Qy 699 AAGAATCTGAGGCCCAGAGAGGTTGGGGACTTGAGTAAAGTCACACAGCCCTGAG- 

AGGC 75 7 

II I I I I I I I II I I I I I I I I I I I II III 

I I I 

Db 1566 CAGGCTCGGGGGGCTA-TGAACTCCAAGACTCCTG- 

G AAG C C AG T AAG AAG T G G G AAG G T 1509 

Qy 758 AGGACCAGGGTTCCATTCCTGCTCTATCCAGTTCCAAGCCCTT 

GTGTTTTCCAT 811 

I I I I I I I I I I I I I I III III I I I I I 

Db 150 8 AGGCACA 

CCATTTGGGATTTTTATTTTTCACTGAGCTAAGAGAGGAGATTCCAGC 1454 
Qy 812 

TATGTTTAGTGCCTCTTTGCTAACAGCAACATCTGCAAGATTTGTGTTGGTTTTGATGGA 8 71 

I I I I III I I I I I I I I I I I I I I III 

Db 1453 TATG GAATC CAGAGAGCG — 

TTCTACAGGAACAGAACAGAAATTG 1411 

Qy 8 72 GAACTCTAGCTCATCCACATGCTAGTGCCCAAGTGGTGG — 

AGGGGCCACCTCAGCAGGT 929 

III II I I I I I I I I I I I I I I I I I I I I II 

I 

Db 1410 GCTGGGCC CTGG-GCCC-AGAGGAGGCCTGGAGCCGCTT — 

ACTGTT 1368 

Qy 93 0 GGGTTCTGAATGCAGCCAAGGCTGTC 

CCCGCAATGGGTGAGACTCGCTCCAACTG 984 

III I I I I I 1 I I II I I I I II I I I I I I I 

I I 

Db 136 7 CAGTGCCAGCTTCATCCTTGGCACTCTGCCTCTCACAC 

AGCATCGCTCCGGCT- 1315 

Qy 985 CCC— GCCCTCAG- 

AGCAGGTGCCTAAGTCCTCCCTGGCACTGGCAGGCCTTACCTCAC 10 4 0 

III I I I I I I I I I I I II I III I I I I I I I I 

Db 1314 CCCAGAGCCCACAGTGGCAGCCGCTGGA GGCTGTGGCAG 

CAGC— 1272 



Qy 



10 41 ATTGCTAAATTAAAGCAATGCAATTCCTCTTGGGTAAGAGGAATTCCTCCTT — 



CTTTAC 1098 



Db 12 71 — TGCT GCCCTGC CCAAGGAGGGATGTGGG 

CTTGGCT C 1236 

Qy 1099 TAACTGATCC — 

CCAGCAAGGAAATAAAATGTTAGGCTTTAAAAATCCCTACTTTGTCAT 1156 

I I I I I I I I I I I I I I I I I I I I 

I I I I I 

Db 1235 TGACAGTTCAAGCCAGAAAG AGCCTTAGAA 

GTCAT 12 01 

Qy 1157 

ATCAGACTATATTCTAAAACTATATTTGAGCGAAACCTGTCATTGCGTCTAATTTCAAAT 1216 
III I I I I I I I I I II II 



Db 12 0 0 TCAGGAC ATTC CCATTTT AC AG AT GAT G 

AAAT 1169 

Qy 1217 

ATACAGAATCTCCTTAAGAGCTGTTGCCTTATTTTTTTGTAAAGCCTCTCTGACATCAAA 12 76 

I I I I I I I I I I I I I I II III 

I I I 

Db 116 8 A AGGCTCAGAGAGAG C AG G G G C T T 

TCAAGAACAATAAA 1131 

Qy 12 77 TGGGGAGAAATGGTGGCACCTCCAGACACCCTGAAACTACACACCATTTCTTCCC- 

TGCT 1335 

I I I I I I I I I I I I I I I I I I I III I I I I I I I I 

I I I I 

Db 113 0 GGTGGAGAC TGGAAC — TCAGGCCTCCTG — 

TCTGCTGAGGGTGTCTGCCCATGCT 10 79 

Qy 1336 

CAGCTTCTGCTCAGGAGTTCTGTGAGCTATGGGAAGGCCATTGGTTGTATTTGCTACTTT 1395 

I I I I I I I I I I I I I I I I I I II I I I I I I III 

Db 10 78 C TCT-CTCAGAGTTTCTGAGTGGTGGTGTCTGG — AGGGGCTGGACACGC- 

AAGGG 1027 

Qy 1396 TACTTTCATC TTCCTCTG- 

CTGTAGAGCCATTTAATGTTATTGTCATATGCTGCTGG 1451 

II III I I I I I I I I I III III I I I I I I 

I I I 

Db 1026 AGCTGGCCTCGTGTGTGTCTGTCTGT 

GCCGGGATGTGTCTGTGGCTTCTGTCTCTGA 970 

Qy 1452 TGAGGTAAAGG TGGGTCC 

GGGTGCCTTCCCAGGGGTTAGAGGATGTTCAAAGG 1504 

I III I I I I I I I I I I I I I I I I I I I I I 

Db 969 ACGTGCCAGGGGTTTTGTGTCCTGTGTGTGTC-TCTC 

TGTGGGTGCTC 923 

Qy 15 05 GCCGATTTCAGCAGGAGTTCAGAGGGCTTATGATGGATGGTGAGAGATTTGACAA- 

CCA 1562 

II I I I I I I I I I I I I I I I I I I I I I I I III 

I I 

Db 922 -CTGTGT GCAGGA — ACTTGGGGCC 

ATGGCCAGAGAGGGATGGGCCTATTCCT 8 73 



Qy 1563 

CCAGAGCACATGTGCTCTGACCCTCTCCTGGGCATTGGTTCCTGCTGGTACCGGGCGGTT 162 2 

II I I I I III II III II I I I I I I I I I I I 

I 

Db 8 72 ACA ATCTTGGG AGGCCTCCAC — AGGC — TG AGCTGGTG — 

GGGAGGGT 83 0 



Qy 1623 C AG AC C T T C AAAT AGG T T GC T T T C AAAAG AG — 

CTTTCAGGCACTTATTGAGAATTAATG 16 8 0 

I I I I I I I I I I I I I I I I III III 

I 

Db 829 C CTGAAGGGTAGAGTGAAGATGGGGACTTT ACTCCCAGCGGA- 

GAGG 78 4 



Qy 1681 

TTTAAACAGACATAATAGCCTAGATGAACTCCCAAGAGATCTATTAAATCTTGTGGGCTG 17 4 0 

I I I I II I I I I I I I I I I I I I I I I I I I I 

I 

Db 7 83 AAG AAAC C - AG AAG AT AGGC T TGGGG CATG- 

GATGGGAAAGGTCTTG G 73 8 

Qy 1741 AATAAATATCTCGTGCAGGAC-TGTGCAACAGTAGCCCA GAGCAT 

CCTGCC 1790 

II III III I I I I I I I I I I I I I I I 

Db 73 7 AA 

CTAGGATGGGGCAAGGGAAACAAAGCCCCAAATGGAGTATGGGTCCCTGGA 685 



Qy 1791 TGTGGGCATCCACCT — CCCAGGTGAGGG 

CAGTGGGAAGCTGGCCCGACGGCA 18 41 

I I I I I III III I I I I I I I I I I I I I I I I I I 

I 

Db 684 AGTTGACACACTCTTAGCCCTCATGGGGGCCCAGCCCAGTGGGTAAC CCCGG- 



Qy 18 42 G CCAGAACTTGT TTCTCACCT- 

CCCACCAGCAACCCCCCACCCAACTCTGGG 1892 

I II I I I I I I I I I I I I I I I I I I I I I 

I I I I I 

Db 628 

GTGGTCACATGACGTGACACTTCTCAGCTGATTGCTTGAGATCCCCCAGTGTGGCCTGGG 569 

Qy 1893 CCCCAGGCACACGA AGCACAAG-TCT CAGGG GACCA — 

TTCCCACATTG 1938 

II I I II I I I I I I I I I I I I I I I I I I I I I 

Db 568 — 

CCTTGTCCTTGAGTGAGCCCAGGTTCAAGCCAGGAAGGCTGGCCAGCTGCCCA 516 

Qy 1939 GGGGATCCTGAGGGAGCCCATCACCGCCTCTTGCATACAAC — 

TGTCCACTAGGAGGCAC 1996 

III I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 515 -AGGA — CTGAGG CCTTCTCCTCCTCCT CCTCTGTGTCCTC 

CTC 475 



Qy 1997 

GCCCAGTGTGGGAGAGATGTATGGTCTTGCCTTCCACCTGTAAAAACTGCACATATGCAA 2 056 
III I I I I I I I I I I I I I I I 

Db 474 GATCA 

TCTTGGCCAGCTCCCGGGGCAGCTCCACGTCT 438 



Qy 2057 

GCCATTTGCACTCTGGAACTGCATGCCGTGAAAACTCCTAATGGTGTGGAACTTAGTTTG 2116 
III III I I I I I I I I I I II I I I I I I I I I I 

I I I I 

Db 43 7 -CCACCTGCCATCTGGATCTGGCTCTCAT-CTGTCTCGTTCTTGGGTAG- 

CCGGTATTTG 381 

Qy 2117 

AATTTGAAATCACGCCGCATGCACAAAGGGACAGGCCCAGGCCCGACCTCAGGTCATCCG 2176 
I I I I I I I I I I III I I I I I I I I I I 

I I 

Db 380 TCTCGGAAGTGGCTCCGC AGTG TGGCCCG — CTCTG 

CCT 344 

Qy 2177 CCCGCTGGCTG — CAGAGCATCCCTGGGAGCCA-AGGCGAGGCC-CGTGGAGC — 

CTGA- 2229 

I I I I I II I I I I I I I I I I III I I I I I I II 

I I I I 

Db 3 43 TCCTCTGTGTGAACTGTGCATCCC 

GCTCCATCTTCTCTTCCACGAGTTGCTTCTGAT 287 

Qy 2230 

GCTTTGTGTAGCTCGAGCTTTGTGTAGCTCGTGCACTTATTATGCACCACCTCCCTTC-A 2 2 8 8 

II I I I I II I I I I I I I I I I II I I I I I 

Db 2 86 ACTC CTCGTACTCCTCCCGGCTCATGCCCTGA 

GCTTCGGCTGCCGACTT 238 

Qy 22 89 GTCACCACTCCTCTTCCTCCGCCATCCTCATTTATACTGATTGCACACCCCCC — 

GCTCA 2346 

I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I 

Db 23 7 GTCCCCA TCTCCCT TATCCTC GC- 

CGCCTCCCAGGCTC- 201 

Qy 2347 

AACAACAATGTCCTTATTATGATGACCATCTCGTAGTGGTACATTCCATTCCTATTTAAG 2 4 06 
I I I I I I I I I I I I I I II I I I I 

I 

Db 200 CCAGTGAGGTTCTTCAGCTGGCCGCC CA— 

CCAT G 16 8 

Qy 2407 

GTAAGCCCAAAGCCCACTTTTGGATTTTCTCGACTGTCCGAGAAAAGTTGTGTAAGCGCC 2 466 

II I I I I II II I I I I I I I I I I I 

I I I I I 

Db 16 7 GTCTTCACCATGAACGCCATGG TCTTCGCCGGCCGGGGCACCG 

GGCGCC 119 

Qy 2467 

TGCGTTCTTCTGGGTTTGGCTAGATAGGGTTGTGTCCCTCTATGGAATGGAGAGTGATGT 2 52 6 
I I I I I I I I III II 

Db 118 TGC-TACCAC-GCGCCAGGC 

ACC 9 8 

Qy 2527 

GGGCAAGGGTGTCATTTTCTCGCACAATACAACTCACTGAGGATGCTTCTGTAGAAGTGA 2 5 86 

I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I 

Db 9 7 GGGCACG TCCGCGCCTACTTCAGCACTCC-AGTAGGC 

CAACAGTG- 5 4 



Qy 2587 

GAAACACGATGAGTACATTCAGAATTACAATAACTCACTCTCACTGGGTAACTTCTCATG 2 6 46 
I I I I I I I I III 

Db 53 CTCGAGGAATTC CGCAG 

CTTTTAGAG 2 8 

Qy 26 47 ATAGATTTGTATGATCAATACGGGTCTA 26 74 

III I II I I I I I I I I 

Db 2 7 C-AGAAGTAACACTTCCGTACAGGCCTA 1 
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Qy 8 4 

GAGGTCAGGAGCTCAAGACCAGCCTGGTCAACATGGTGAAACCCCGTCTCTACTAAAAAT 143 



Db 219 8 GCGACCTGCAGCTCGAGCACA 

TGTGGCCTTTTTTTTTTTTTTTTT 2154 

Qy 144 ACAAAAAAAAAATTAGCTGGGCGTGTTGGCGGGAGCCTGTAGTCCC- 

AGCTACACGGGAG 2 02 



Db 2153 TT 

TTTTTTATGGTGCCAATTTTAAATAGTTTTATTTAAG 2115 

Qy 203 

GCTAAGGTGGGAGAATTGCTTGAACCCGGGAGGCGGAGGTTGCAGTGAGCTGAGATTGCA 2 62 
I I I I I I I II II I I I I I I 

I I I I 



Db 2114 AC ATTGCATTTTCC- 

ACTTACAATACAGTGTTTATAAAGTGCA 2 073 



Qy 263 C— 

CATTGCACTCCAGCCTGGGCGACAGAGCAAGACTCTCTCAAAAAAAAAAAAAAAGA 319 

I I I I III I I I I I I I I I I I I II I 

I I I 

Db 2 0 72 ATGTTATTTCCTTCC — CCTGTGCATATGTTCCATATTC 

AAG T AT T GAGA 2 025 

Qy 320 

AAGAAAGAAATCAGAAAATCGACCACAGTGGTAGCCACCTGGCCTAATGCTGTGTTTTTG 3 79 
II I I I I I I III I I I I II I 

I I I I I 

Db 2 02 4 ATGCC CAGTAACTT-ACTA TAGCAGCTTAAC 

TTTTTA 1989 



Qy 380 TACCTG — ACAGGGGTCACTC — 

ATTTTAGGCACAACTCCTTCATTCTTTGTGAAATTAG 435 

I I I I I I I I I II I I I I I I I I I I I I I I 

I 

Db 19 8 8 AAAC TGCC AC AGAAT T TGC TACGAAT T T AGG TCCTTCA- 

AATGTTTTAAATG 1938 



Qy 436 TGAGTTTCCTT- 

CTACCCGTCACCAGATT C AAT AT G T T C T AT T AAT AC AC C G AT AAC C AC 49 4 

III I I I I I III I I I I III I I I I I III 

I 

Db 193 7 TGTGGAACAATGCTA — CATC TACACTTGGTTGGCTTAATCAACC- 

TCTTC 1890 



Qy 495 AGGGGAAGGGCACT— 

TGTCGCTCTCCCACCTGGTTACCACAGTCTCCATGGGTCTTTTG 552 
I II I I I I I I I III III II 
Db 18 89 AATGG-TGGGCCCTGAGGAAGCACCACCAGAGGG- 
AGGAGCT 1850 



Qy 553 

CCGTGACCACAAATAAAGGAAACACTCATCACTAGTATCTAAGTCGGGCTTTACAGTAAC 612 
II I I I II I I I I I I I I I I I I I 

Db 18 49 CC ACCAC CAGGAAATCCCC 

CAGGCATT 1823 



Qy 613 

TATGCACCTTCTGTGTGCTTCACCTCACTCTCTACTTCAAACAGCCCATGGAGGGAGGTA 6 72 
I I I I I I I I I I I I I I I I I 

Db 1822 CCTCCTG GCAT-GCCTC 

CTGCAC 1801 



Qy 673 

TTATTATACTCCTTATGTTGACAGTGAAGAATCTGAGGCCCAGAGAGGTTGGGGACTTGA 732 
I I I I I I I I I I I I I I I I I I I 

Db 1800 -TCTGGTACAGCTT GGTGATGA 

TGGGGTTGCAAACTT — 1765 



Qy 733 GTAAAGTCACACAGCCCTG — AGAGGCAGGACCAGGGTTCCATTCCTGCT 

CTATCCA 78 7 

I I I I I I I I I I I I II III I I I I I I I 

Db 176 4 TCTC-CAGCTCTTTCTGTTGATGTTCAA — ATTC — 

TTCCTTCTCAGCAGTCTG 1716 



Qy 788 GTTC CAAGCCC- 

TTGTGTTTTCCATTATGTTTAGTGCCTCTTTGCTAACAGCAACA 8 42 

III I I I I I I III I I I I I I I I I I I I I I I I 

Db 1715 ATTCTTATCAAGCCAGTTGATAATTTCATTA CACTTGTC 

CAGAATCT 1669 

Qy 8 43 TCTGCAAGATTTGT GTTGGTTTTG — 

ATGGAGAACTCTAGCTCATCCACATGC 893 

I I I I I I I I I III I I I I I I I I I I I I I I I II 

Db 166 8 TCTG TTTGTCCTCATCGTTAATCTTGCCTTGAAG TTTCTCATCTTCA- 

AC 1620 

Qy 89 4 TAGTGCCCAAGTGGTGGAGGGGCCACCTCAG CAGGTGGGTTC- 

TGAATGCAGCCAAG 9 49 

III I I I I I I I II I I I I I I I I I I I I I I II 

Db 1619 AGTTGCTTTCATGTTGAAGG 

C AT AGG AC T CAAG T GAAT T C T T GG AT G AC AC CT — 1567 

Qy 950 GCTGTCCCCGCAATGGGTGAGACTCG-CTCCA 

ACTGCCCGCCCTCAGAGCAGG 1001 

I I I I I I I II III I I I I I I I I I III I 

I I 

Db 1566 — TGTCCCT-C TGCTTCTCATCTTCAGCTTTGTACTTCTCAGCTTC 

CTGG 1520 

Qy 10 02 TGCCTAAGT 

CCTCCCTGGCACTGGCAGGCCTTACCTCACATTGCTAAATTAAAG 1055 

I I I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 1519 ACCATACGTTCAATGTCTTCCTTGCTCAAACGGCCCTTGTC 

ATTAGTGATAG 146 8 

Qy 1056 

CAATGCAATTCCTCTTGGGTAAGAGGAATTCCTCCTTCTTTACTAACTGATCCCCAGCAA 1115 

I I I I I I I I I I I I I I I I I I I I I I I 

I I I I I 

Db 1467 TAAT-CTTGTTCTCTT TTCC-CGTACT 

CTTGTCCACAGCAG 142 9 

Qy 1116 

GGAAATAAAATGTTAGGCTTTAAAAATCCCTACTTTGTCATATCAGACTATATTCTAA — 1173 

I I I I III III I I I I I I I I I I I I I I I I I 

Db 142 8 AG AC AT T GAG TAT AC — C AT TGG 

CATCAATGTCAAAAGTGACTTCAATCTGAGG 1377 

Qy 1174 

AACTATATTTGAGCGAAACCTGTCATTGCGTCTAATTTCAAATATACAGAATCTCCTTAA 1233 

I I I I I I I I I II I I I I I I I I I I I I I II 

Db 13 76 AAC-ACCTCGGGGTGCAGGAGGT — ATGCCTGTGAGTTCAAACTTG 

CCAAGC 132 8 

Qy 1234 

GAGCTGTTGCCTTATTTTTTTGTAAAGCCTCTCTGACATCAAATGGGGAGAAATGGTGGC 12 93 
I I I I I I I I I I I I I I I I I 

I 

Db 1327 AGGTTGTTATC 

CTTTGTCATGGCACGCT C 1299 



Qy 129 4 ACCTCCAGACACCCTGAAACTACACACCATTTCTTCCCTG- 

CTCAGCTTCTGCTCAGG 1350 

I I I I I I I I I I I I I I I I I I I I III 

Db 129 8 GC C T T C AT A- AAC C T G AAT AAGC AC ACC AGG 

CTGGTTGTCAGAATAGGTAGTGA 12 46 

Qy 1351 AGTTCTGTG— AGCTATGGGAAGGCCATTGGTTGTATTT- 

GCTACTTTTACTTTCATCTT 1407 

I I I I I I I I III I I I I I I I I I I I I I I I I 
Db 1245 AGGTCTGTGTCTGCTT — GGTAGG — 
AATGGTGGTATTACGCT 1207 



Qy 140 8 CCTCTGCTGTAGAGCCATTTAATGTTATTGTCAT- 

ATGCTGCTGGTGAGGTAAAGGTGGG 1466 

I I I I II II I I I I I 

Db 12 06 TGATGAGGA — CA 

G T C AT G AC T C C AC C AG 118 0 



Qy 1467 

TCCGGGTGCCTTCCCAGGGGTTAGAGGATGTTCAAAGGGCCGATTTCAGCAGGAGTTCAG 152 6 

I I I I I I I I I I I I I I I I I I I II II I I I I I 

I I 

Db 1179 — CAGTTTCAATACCAAGGGAAAGAGGA-GT GACATC— CAAGAG- 

CA- 1138 



Qy 1527 

AGGGCTTATGATGGATGGTGAGAGATTTGACAACCACCAGAGCACATGTGCTCTGACCCT 1586 
II II III I I I I I I I I I I I I I I I I 

Db 113 7 GCAAATCTTGAACATTCTCAGACTTGT CTCCAG 

ACAAGATGGCTG 1093 



Qy 1587 

CTCCTGGGCATTGGTTCCTGCTGGTACCGGGCGGTTCAGACCTTCAAATAGGTTGCTTTC 16 46 
I I I I I I I I I I I III I III I I I I I I 

Db 1092 — CCTGGACAG CTGC ACCATAAGCAACAG — 

CTTCATCAGGGTTGAT 1050 



Qy 1647 

AAAAGAGCTTTCAGGCACTTATTGAGAATTAATGTTTAAACAGACATAATAGCCTAGATG 17 06 

I I I I I I I I III I I I I I I 

I 

Db 10 49 GCTCTTATT CAGTTCTTTTC- 

CATTGAAG 1022 



Qy 1707 

AACTCCCAAGAGATCTATTAAATCTTGTGGGCTGAATAAATATCTCGTGCAGGACTGTGC 1766 
I I I I III I I I I I I I I I I I I I I I I I 

I 

Db 1021 AAGTCTTGGAGAAGCTTCTGAATCTTG-GGGAT 

AC GAG TAG AAC C 9 78 



Qy 176 7 AACAGTAGCCCAGAGC ATCCTGC — 

CTGTGGGCATCCACCTCCCAGGTGAGGGCAGT 1821 

III I I I I I I I I I I I I I I I III I II I I I I 

Db 9 77 ACCA AC C AG G AC AAT AT CAT G AAT C T G T G ACTTGTCTAGT- 

TTGGCATC 93 0 



Qy 1822 GGGAA — 



GCTGGCCCGACGGCAGCCAGAACTTGTTTCTCACCTCCCACCAGCAACCCCCC 18 79 
III III I I I I I I I I I II II I 

Db 929 TCGAAGGGCTTTCTCTACTGGGTCCAGG 

GTGCCACGGAACAGGTCAGCATTCAGTT 874 



Qy 1880 

ACCCAACTCTGGGCCCCAGGCACACGAAGCACAAGTCTCAGGGGACCATTCCCACATTGG 1939 
I I I I I I I I I I II I I I I I I I I I I I I I I I 

Db 8 73 CTTCAAATC-GGGCAC — GGGTAATGGAG GTAT- 

AGAAGTCGATTCCTTCAT 826 



Qy 1940 

GGGATCCTGAGGGAGCCCATCACCGCCTCTTGCATACAACTGTCCACTAGGAGGCACGCC 1999 
I I I I I I I I I I I I I I I I I II 

Db 825 AG AG AG AAT C GAT C T C AATACTGGCC 

TGGGTGC 793 



Qy 2000 

CAGTGTGGGAGAGATGTATGGTCTTGCCTTCCACCTGTAAAAACTGCACATATGCAAGCC 2 059 

I I I I I I I I II III I I I I I I I I I I I III I 

I 

Db 792 TGGAAGAGA GGGTACGCTTAGCACGTTCACAAGCAGTACGGAGGC — 

GTC 745 



Qy 2060 ATT- 

TGCACTCTGGAACTGCATGCCGTGAAAACTCCTAATGGTGTGGAACTTAGTTTGAA 2118 
I I I I I I I I II II II I I I I I I 

Db 744 TTACAGCTCTCTTGTTCTCACTGATGT CCTTCTTAT 

GCTTGCG 702 



Qy 2119 TTTGAAATCACGCCGCATGCACAAAGGGACAGGCC 

CAGGCCCGACCTCAGGTCATCC 2175 

I I I I I I I III I I I I I III II 

I I I 

Db 701 CTTAAACTCA GCA ATAAAATGGTTGACCATTCGG- 

TTGTCAAAATCTTCTCC 651 



Qy 2176 GCCCGC-TGGCTGCAGAGCATCC — 

CTGGGAGCCAAGGCGAGGCCCGTGGAGCCTGAGCT 2232 

III III II I II I I I I I I II II II I 

I 

Db 650 ACCCAAGTGG GTGTCTCCAGCT-GTAGACTTGAC CTCAAAGATTC- 



Qy 2233 

TTGTGTAGCTCGAGCTTTGTGTAGCTCGTGCACTTATTATGCACCACCTCCCTTCAGTCA 2 2 92 
I I I I I I I II III I I I I II I I I I I I I I I 

I I I I 

Db 604 TC CTCAA TAG T G AGG AT T G -AC AC AT C AAAAG TGCCACCTCCC- 

AGGTCA 55 7 

Qy 2293 C CACTCCTCTTCCTCCGCCATCCTCATT TATACTGATTGCA — 

CACCC 2338 

III II I I I I I I I I I I I II II II III 

I 

Db 556 

AAGATGAGCACGTTTCTTTCTGCTCCAACCTTTTTGTCTAAGCCGTAAGCAATAGCAGCA 49 7 



Qy 2339 CCC 

GCTCAAACAACAATGTCCTTATTATGATGACCATCTCGTAGTGGTACATTC— 2392 



I 1 1 1 1 1 I III II 1 1 1 1 1 1 1 1 I I I 1 1 I I 

Db 496 GCAGTTGGCTCATTAATAATTCTAAGTACATTGA-GACCAGC 

AATAGTTCCAGCAT 4 42 

Qy 2393 CATT CC TATTTAAGGTAA GCCCAAAGCCCACT — 

TTTGG 2429 

III II I I I I I I I I I III I I I I I 

I I I I 

Db 441 

CTTTGGTAGCCTGACGCTGAGAGTCATTAAAGTAAGCTGGCACTGTGACCACAGCATTGG 3 82 
Qy 2430 

ATTTTCTCGACTGTCCGAGAAAAGTTGTGTAAGCGCCTGCGTTCTTCTGGGTTTGGCT-A 2 4 8 8 

I III I I I I III I II I I I I I I I I I I I I 

I 

Db 381 TAACAGTCTTCCCAAGGTAGGCT TCTGCAATTTCCTTCATC 

TTTGTCAGA 332 

Qy 2 489 GATAGGGTTGTGTCCCTCTATGGAATGGAGAGTGATGTGGGCAAGGGTGTCATTT — 

TCT 2546 

III I I I I I I I I I I I I I I I 

I I I 

Db 331 ACCATAGAAGACACCTCCTCTGGATAGAAG 

CTTTTGGTCT 292 

Qy 2547 CGCAC AATACAACT 

CACTGAGGATGCTTCTG T AG AAG T G AG AAAC AC GAT GAG 2 599 

III I I I I I I I III III III II I I I I 

Db 291 CTCCCTTGTATTCTACTTGGACCTTGGGCCTGC — CAGCATCATT 

CACCAC 2 43 

Qy 2600 T AC AT T C AG AAT T AC AAT AAC TCA-CTCT C AC T GGG T AAC T T C - 

TCATGATAGATTTGTA 2657 

III II I I I I III II I I I I I III I I I I I I I I 

I I 

Db 2 42 — CATAAAGGG — 

CCAATGTTTCATATCAGACTGGACAACAGCATCATCAAATCTGCGTC 18 7 

Qy 2658 TGATCAATACGGGTCTATTTTTATGTCAACTGAACACT 

GTAGGGTACCTTCCAG 2 711 

I I I I III I I I I I I I I I I I I I I I I I I I I I 

Db 186 CAATCAG-ACG TTTGGCATC- 

AAAAACTGTGTTGGTGGGGTTCATTGC— 141 

Qy 2712 

TCTTTTTCAAGATTGTTAAATTGAGACAAGTAATTGAATAATTTGTCCTATTTTTATTTT 2 7 71 

I I I I I I I I I I I II II II II II I 

I 

Db 140 AACTTGATTCTTTGCGGCA — TCACCGATCAA CC — 

GTTCAGTGTC 99 

Qy 2 7 72 AAAAAAAGTGA-ATGGACTGAAAT- 

GTTAAATGTGAATGTACATTTCTTAATTGCAATTT 2 829 

I I I I I I I I I I I I I I I I II II III I 

I I I I I I 

Db 9 8 CGTAAAGGCGACATAGCTTGGAGTGGTTCGGTTTCCCTGATCAT 

TGGCAATTA 46 



Qy 2 83 0 T-TCTACTGAGTGTTTGCACTATACTTTCTGGAATCTTATTTAACAAAAATAAAGGG 

2885 



Db 45 TCTCGACT- 
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Qy 16 

CAGGCCGGGCGCAGTGGCTCATGCCTGTAAGCCCAGCACTTTGGGAGGCGGAGGCGGGTG 7 5 
I I I I I III II III I I I I I I I II 

Db 1 CCTACTGGTC-TTGTG TGGGTGTTT-TCCAGCAC 

GGAAAAGTCGAGAT 46 

Qy 76 GAT— 

CCATTGAGGTCAGGAGCTCAAGACCAGCCTGGTCAACATGGTGAAACCCCGTCTC 133 



Db 4 7 AATTGCCAATGA — TCAGGGAAACCGAACCA CTCCAAG 

CTATGTCGC 91 

Qy 13 4 TACTAAAAATACAAAAAAAAAATTAGCTGGGCGTGTTGG- 

CGGGAGCCTGTAGTCCCAGC 192 

III III I I I I I I I I I I I I II 

Db 92 — CTTTACGGACA CTGAACG-GTTGATCGG 

TGATGCCGCAAA 13 0 

Qy 193 

TACACGGGAGGCTAAGGTGGGAGAATTGCTTGAACCCGGGAGGCGGAGGTTGCAGTGAGC 2 52 
I I I I I I I I I I I I I I I I I 

Db 131 GAATCAAGTTGCAA 

T G AAC C C C AC C AAC AC AG x T T T T GAT G 171 



DB 1; Length 219 8; 
845; Indels 843; 



Qy 



2 53 TGAGATTGCACCATTGCACTCCAGCCTGGGCGACAGAGC — 



AAGACTCTCTCAAAAAAAA 310 

III II II I I I I I 

Db 172 CCA AACGTCTGATTGGACG- 

CAGATTTGATGATGCTGTTGT 211 



Qy 311 

AAAAAAAG AAAG AAAG AAAT C AG AAAAT C G AC C AC AG T GG T AGC C AC C T G G C C T AAT G C T 3 7 0 
II I I I I I I I I I I I I I I I I II 

I 

Db 212 CCAGTCTGATATGAAACATTGGCCCTTTATGGTGG TG- 

AATGAT 254 



Qy 371 

GTGTTTTTGTACCTGACAGGGGTCACTCATTTTAGGCACAACTCCTTCATTCTTTGTGAA 43 0 
I I I I I I I I III I I I I I I 

I 

Db 2 55 G CTGGCAGG CCCA AGGTCCAA 

GTAGA 280 



Qy 431 AT — 

TAG T GAG TTTCCTTC T AC C C G T C AC C AG AT T C AAT ATGT TC T AT T AAT AC AC C GAT 4 8 8 

II I I I I I II I I I I I I I I I I I I I I I I 

I I 

Db 281 ATACAAGGGAGAGACCAAAAGCTTCTATCCAGAGGAGGTGTCTTCTATGGTT- 

CTGA- 336 

Qy 489 

AACCACAGGGGAAGGGCACTTGTCGCTCTCCCACCTGGTTACCACAGTCTCCATGGGTCT 54 8 

I I I I II II I III I I I I I I I I I I I I I III 

Db 33 7 CAAAGATGAA-GGAAATTG- 

C AG AAG CCTACCTTGG G AAG AC T G T T AC C A 384 



Qy 549 

TTTGCCGTGACCACAAATAAAGGAAACACTCATCACTAGTATCTAAGTCGGGCTTTACAG 6 0 8 

I I I I I I I I I I II II I I I I I III I I I I I I 

Db 3 85 -ATGCTGTGGTCACAGTGCCAGCTTACTTTAATGAC TCTCAGCGTCAGGC- 

TACC- 43 7 



Qy 609 

TAACTATGCACCTTCTGTGTGCTTCACCTCACTCTCTACTTCAAACAGCCCATGGAGGGA 66 8 
II I I I I I I I I I I I III I I I I I I II 

Db 43 8 -AAAGATGCTGGAACTAT-TGCTG 

GTCTCAATGTACTTAGAATTATTAATG 486 



Qy 669 

GGTATTATTATACTCCTTATGTTGACAGTGAAGAATCTGAGGCCCAGAGAGGTTGGGGAC 72 8 
I I I I I I I I I I I I I I I I I I I I I 

Db 48 7 AGCCAACTGCTGCTGCTATTGCTTACGG CTTAG — 

ACAAAAAGGTTGGAGCA 536 



Qy 72 9 T T GAG T AAAG T C AC AC AGC 

CCTGAGAGGCAGGACCAGGGTTCCATTCCTGCTCTA 783 

I I I I I I I I I I I I I I I I I I I 

I 

Db 53 7 GAAAGAAACGT-GCTCATCTTTGACCTGGGAGGTGG- 

CACTTTTGATGTG 58 4 



Qy 78 4 TCCAGTTCCAAGCCCTTG 

TGTTTTCCATTATGTTTAGTGCCTCTTTGCTAACAGCA 839 

III III I III II II I I I I I I I I I III 



Db 585 TCAA — TCCTCACTATTGAGGATGGAATCTTTGAGGTCAAGT-CTAC — AGCT 

GGAG 636 



Qy 8 40 ACATCTGCAAGATTTGTGTTGGTTTTGATGGAGAACTCTAGCTCATCCA 

CATGCT- 89 4 

I I I I I I III I I I I I I I II I I I I I I I 

I I I I 

Db 63 7 ACACCCACTTGGGTGGAGAAGATTTTGA 

CAACCGAATGGTCAACCATTTTATTGCTG 693 

Qy 895 AGT GCCCAAGTGGTGGAGGGGCCACCTCAGCAGGTGGGTTC- 

TGAATGCAGCCAAG 9 49 

III I I I I I I II I I I I I I I I I I I I II III 

I I I 

Db 69 4 AGTTTAAGCGCAAGCATAAGAAGG AC AT C A G T G AG AAC AAG AG AG C T G - 

TAAG 745 

Qy 950 

GCTGTCCCCGCAATGGGTGAGACTCGCTCCAACTGCCCGCCCTCAGAGCAGGTGCCTAAG 10 09 

I I I I I I I I I I I I I II III II I I I I I 
Db 746 AC-GCCTCCGTACTGCTTGTGA-ACGTGCTAA — 
GCGTACCCTC 785 

Qy 1010 TCCTCCCTGGCACTGGCAGGCCTTACCTCACATTGCTAAATTAAAGCAATGCAATTC- 

I I I I I I I I I I I I I I I III I I I I I I I I 

I I 

Db 786 TCTTCC— AGCACC — CAGGCC AGTATTGAGAT- 

CGATTCTCT 823 

Qy 1069 

CTTGGGTAAGAGGAATTCCTCCTTCTTTACTAACTGATCCCCAGCAAGGAAATAAAATGT 112 8 

II I I I I I I I I I I I I I I I I I III II 

Db 82 4 CTATG AAGGAAT — 

CGACTTCTATACCTCCATTACCCGTGCCC 86 4 

Qy 1129 

TAGGCTTTAAAAATCCCTACTTTGTCATATCAGACTATATTCTAAAACTATATTTGAGCG 118 8 
I I I I I I I I I I I I I I 

Db 865 GATTTGAAGA 

ACTGAATGCT 884 

Qy 1189 AAACCTGT — 

CATTGCGTCTAATTTCAAATATACAGAATCTCCTTAAGAGCTGTTGCCTT 12 46 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 8 85 -GACCTGTTCCGTGGCACCCTGGACCCAGTAG — AGAAAGCCCTTCG 

AGATGCCAA 93 7 

Qy 1247 

ATTTTTTTGTAAAGCCTCTCTGACATCAAATGGGGAGAAATGGTGGCACCTCCAGACACC 13 06 



Db 93 8 A 

CTAGACAAGTCACAGATTCATGATATTGTCCTGGTTGGTGGTTCTACTCGTATCC 993 
Qy 1307 

CTGAAACTACACACCATTTCTTCCCTGCTCAGCTTCTGCTCAGGAGTTCTGTGAGCTATG 1366 
I II III II I I I I I I I I I I I I I I I 

I I I 

Db 99 4 C CAAGATT CAG — AAGCTTCT — CCAAGACTTCT — 



TCAATG 1029 



Qy 1367 

GGAAGGCCATTGGTTGTATTTGCTACTTTTACTTTCATCTTCCTCTGCTGTAGAGCCATT 142 6 

I I I I I I I I I I I I I I I I I I I I I I 

Db 103 0 GAAAAGAAC TGAATAAG AGCATCAACC- 

CTGATGAAGC 1066 

Qy 142 7 TAATGTTATTGTCATATGCTGCTGGTGAGGTAAAGGTGGGTCCGGG-TGCCTTC- 

CCA 1482 

I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 106 7 TGTTG-CTTATGGTGCAGCT 

GTCCAGGCAGCCATCTTGTCT 1106 

Qy 1483 GG 

GGTTAGAGGATGTTCAAAGGGCCGATTTCAGCAGGAGTTCAGAGGGCTTATGA 153 7 

II II I I I I I I I I I I I I I I I I III 

Db 1107 G G AG AC AAG T C T G AG AAT G T T C AA GATTT 

GCTGCTCT 1143 



Qy 1538 

TGGATGGTGAGAGATTTGACAACCACCAGAGCACATGTGCTCTGACCCTCTCCTGGGCAT 159 7 
I I I I I I III I I I I I I I I I 

I I 

Db 1144 TGGATG TCAC TCCTCTTTCCCTT — 

GGTAT 1171 



Qy 159 8 TGGTTCCTGCTGGTACCGGGCGGTTCA-GAC 

CTTCAAATAGGTTGCTTTCAAAAGAG 1653 

II I I I I I I I I III I I I I I I 

Db 1172 T-GAAACTGCTGGT GGAG — 

TCATGACTGTCCTCATCAAGCGTAATACCACCATTC 1224 



Qy 1654 

CTTTCAGGCACTTATTGAGAATTAATGTTTAAACAGACATAATAGCCTAGATGAACTCCC 1713 

II Mill I II III II I I I I I I I I I I I I 
Db 1225 CTACCAAGCAGACACAGACCTTCACTACCTATTCTGACA-ACCAGCCT- 

GGTGTGCTTAT 1282 



Qy 1714 AAGAGATCTATTAAATCTTGTGGGC — 

T G AAT AAAT AT C T C G T GC AGG AC T G T GC AAC AG 1771 

I I I I I I I I I I I I I III II II I II 
Db 12 83 TC-AGGTTTATGAAGGCGAGCGTGCCATGACAAAGGATAACAACCTG — 

CTTGGCAAGTT 1339 



Qy 1772 TAG— 

CCCAGAGCATCCTGCCTGTGGGCATCCACCTCCCAGGTGAGGGCAGTGGGAAGC 182 8 
I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 13 4 0 TGAACTCACAG-GCATACCTCCTG CACC-CCGAGGTGT— 

TCCTCAGATTGA 13 8 7 



Qy 1829 TGGCCC GACG — GCAGCCAGAACTTGT-TTCTCACC- 

TCCCACCAGCAACCCCCCA 1880 

III III I III I I I I I I I I I III II 

I 

Db 13 8 8 AGTCACTTTTGACATTGATGCC 

AATGGTATACTCAATGTCTCTGCTGTGG A 1438 



Qy 1881 CCCAACTCTGGGCCCCAGGCACACGA 

AGCACAAGTCTCAGGGGACCATTCCCACA 1935 

I II III I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 1439 C AAG AG T AC G G G AAAAG AG AAC AAG AT T AC TAT C AC T AAT G AC AAG G G - 

CCGTTTGAGCA 149 7 

Qy 1936 

TTGGGGGATCCTGAGGGAGCCCATCACCGCCTCTTGCATACAACTGTCCACTAGGAGGCA 1995 

I I I I I I I I I I I III I I I I III I 

I I 

Db 1498 

AGG AAG AC AT T G AAC GTATGGTC C AGG AAGC TGAGAAGT AC AAAGC - 15 43 



Qy 1996 CGCCCAGTGTGGGAGAGATGT — 

ATGGTCTTGCCTTCCACCTGTAAAAACTGCACATATG 2 053 

II I I I I I I I I I I I II II 

Db 1544 T G AAG ATGAGAAGC AGAGGGAC AAGG TGTCATCC- 

AAGAATTCACTT 1589 



Qy 2 054 CAAGCCATTTGCACTCTGGAAC- 

TGCATGCCGTGAAAACTCCTAATGGTGTGGAACTT — 2110 

I I I I I I I II I I I I I I I I 

Db 159 0 -GAGTCCTATGCCTTC AACATGAAAGC- 

AAC T G T T G AAG AT G AG AAAC T T C A 1639 



Qy 2111 

AGTTTGAATTTGAAATCACGCCGCATGCACAAAGGGACAGGCCCAGGCCCGACCT 2165 

I I I I I III II II III I I I I I I 

I 

Db 16 40 AGGCAAGATTAACGATGAGGACAAACAG AAGATTCTGGACAAG- 

TGTAAT 16 8 8 



Qy 2166 

CAGGTCATCCGCCCGCTGGCTGCAGAGCATCCCTGGGAGCCAAGGCGAGGCCCGTGGAGC 2 2 2 5 
I I I I I I I I I II I I I I I III II I 

Db 16 89 GAAATTATCAACTGGCT T GAT AAG AAT C AGACT — 

GCTGAGAAGGAAGAAT 173 7 



Qy 2226 

CTGAGCTTTGTGTAGCTCGAGCTTTGTGTAGCTCGTGCACTTATTATGCACCACCTCCCT 2 2 85 
I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 173 8 T T G AAC AT C - AAC AGAAAGAGC TGGAGAAAG TTTGCAAC- 

CCCA 1779 



Qy 2286 

TCAGTCACCACTCCTCTTCCTCCGCCATCCTCATTTATACTGATTGCACACCCCCCGCTC 23 45 
I I I I I I I I I I I I I I I III I I I I 

Db 178 0 TCA-TCACCAAG-CTGTACCAGAG 

TGCAGGAGGCATGCC 1816 



Qy 2346 

AAACAACAATGTCCTTATTATGATGACCATCTCGTAGTGGTACATTCCATTCCTATTTAA 2 4 05 

I I I I I I I I I I II I I I I I I I I I I I I I 

Db 1817 -AGGAGGAATG-CCT GGGGGATTTCCTGGTGGTGGAGC 

TCCT 1856 



Qy 2406 

GGTAAGCCCAAAGCCCACTTTTGGATTTTCTCGACTGTCCGAGAAAAGTTGTGTAAGCGC 2 465 



I I I I I Ml 

Db 1857 

CCCTCTGGTGG 1867 

Qy 2466 

CTGCGTTCTTCTGGGTTTGGCTAGATAGGGTTGTGTCCCTCTATGGAATGGAGAGTGATG 2 52 5 

I I I I I I I I I I I I I I I I I I I I III 

I I I I 

Db 186 8 -TGC-TTCCTCAGGG CCCACCATTGAA — 

GAGGTTGAT- 19 01 

Qy 2526 

TGGGCAAGGGTGTCATTTTCTCGCACAATACAACTCACTGAGGATGCTTCTGTAGAAGTG 2 5 85 

III I I I I I I I I I I I I I I I 

I I 

Db 19 02 TAAGC CAACCAAGTGTAGA 

TGTAGCATTG 193 0 

Qy 2586 

AGAAACACGATGAGTACATTCAGAATTACAATAACTCACTCTCACTGGGTAACTTCTCAT 2 6 45 

I I I I II I I I I I III II I I I I I I I I I I I 

I 

Db 1931 TTCCACACATTTAAAACATT-TGAAGGACCTAAATTCGT AGCAAATTCT- 

GT 1980 

Qy 2646 

GATAGATTTGTATGATCAATACGGGTCTATTTTTATGTCAACTGAACACTGTAGGGTACC 2 7 05 

I I I I I I I III II I I I I I I I I I I I I II I 

I 

Db 19 81 GGCAGTTTTAAAAAGTTAA-GCTG — CTA-TAGTAAGT-TACTGGGCA 

TTC- 2026 

Qy 2706 TTCCAGTCTTTTTCAAGATTGTTAAATTGAGACAAGTAATTGAA — 

TAATTTGTCCTATT 2763 

III III I I I I I I II I I I I I III I III II 

Db 2 02 7 -TCAATACTTG AATATGGAACATATGC- 

AC AGGGGAAGGAAAT AAC AT TGC AC T 2 0 78 

Qy 2764 

TTTATTTTAAAAAAAGTGAATGGACTGAAATGTTAAATGTGAATGTACATTTCTTAATTG 2 823 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 2 0 79 TTATAAAC ACTGTATTGTAAGTGGAAAATGCA- 

ATGTCTTAAATA 2122 

Qy 2824 

CAATTTTTCTACTGAGTGTTTGCACTATACTTTCTGGAATCTTATTTAACAAAAATAAAG 2 8 83 

II I I I I I I I I I I I I I I II I I I I I I I I 

I I I 

Db 2123 AAA CTATTTAAAATTGGCACCATA 

AAAAAAAAAAAAAAAAAAAAAAAA 2170 

Qy 2884 GGAAAAAATTGCTTGA-CT 2901 

II II I I I I I I I I 

Db 2171 GGCCACATGTGCTCGAGCT 2189 
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Qy 2 TGAAATCAGGAAATCAGGCCGGGCGCAGTGGCTCATGC-CTGTAAGCC — 

CAGCACTTTG 58 



Db 12 TTACTTCTGCTCTAAAAGCT — 

GCGGAATTCCTCGAGCACTGTTGGCCTACTGGAATGCG 69 

Qy 59 GGAGGCGGAGGCGGGTGGATCCATTGAGGTCAGGAGC 

AGCCTGGTCAA 114 

I I I I I I I Mill II I I I I I I 

Db 7 0 AGCTGAGCAGACAG— 

GGCTGCAAGGAAATCTGGCGCGGTTCAATACCTCGTCTAGCC — 12 5 
Qy 115 

CATGGTGAAACCCCGTCTCTACTAAAAATACAAAAAAAAAATTAGCTGGGCGTGTTGGCG 17 4 



Db 126 TGGGTTCCAGTATCTAATTTTTTT TTTGTTTTAAC 

TGAC- 16 4 

Qy 175 

GGAGCCTGTAGTCCCAGCTACACGGGAGGCTAAGGTGGGAGAATTGCTTGAACCCGGGAG 23 4 



Db 165 — AAACTCATTTCTCTAC- 

CAGGAT 191 

Qy 235 

GCGGAGGTTGCAGTGAGCTGAGATTGCACCATTGCACTCCAGCCTGGGCGACAGAGCAAG 2 9 4 



Db 192 GCTG TGCTG GCTGGAA — GTTCCATT — TCTAC- 

AGCAAG 226 



TCAAGACC- 

I I I I I I I I I I I I 



Qy 295 

AC T C T C T C AAAAAAAAAAAAAAAG AAAG AAAG AAAT C AG AAAAT C G AC C AC AG T G G T AG C 35 4 

I I I I I I I I I I I I I I I I I I I 

I 

Db 22 7 AATC-CT ATCTGGAAA 

CACAGAAGTTGT 2 53 

Qy 355 CACCTGGCCTAA — TGCT-GTGTTTTTGTACCTGACAGGGGTCACT- 

CATTTTAGGCACA 410 

I I I I I I I I I I I I I I I I III II I I I I I I I 

I I 

Db 254 CCTCTAGCCACAGCAGCTCGAACTTTTTTGATTGTC GTTGCTGCTTTCT 

CCCA 3 06 

Qy 411 

ACTCCTTCATTCTTTGTGAAATTAGTGAGTTTCCTTCTACCCGTCACCAGATTCAATATG 4 7 0 

I I I I I I I I I I I I I I I I I I I I 

Db 307 TCACCCCCATCC CCTTTTGAC 

AAAGATCCAA 33 7 

Qy 471 TTCTATTAATACAC CGATAACCACAGGGGAAGGGC-ACTTGTC- 

GCTCTCCCACCTG 525 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

I 

Db 33 8 — CTGTAAAAAGTCTTACG-TAAC AGTTCAGGACTACT — 

TCGGTTCTTTTACTGG 388 

Qy 526 GTTACCACAGTCTCCATGGGTCTTTTGCCGTGACCACAAATAAAG 

GAAACACT 578 

I I I I I I II I I I I I I I I II III I I I I I 

Db 389 

GTAAGCACTTTCAATTTTTTTTTTTTAACTAAAAGCCATTTTAAAATTGAATCTGTTGAG 4 4 8 

Qy 579 CATCACTAGTATCTAAGTCGGGCTTTACAGTAACTATGCACCTTCTGTG 

TGCTT 632 

I I I I I I I I I I I III I I I I II I I I I I I 

I I 

Db 4 49 GGGCTTGACTAAAATCTTTTAAGTAATTTG- 

TGTAATGGAATACTGTCAGTGGATTTTTT 50 7 

Qy 633 

CACCTCACTCTCTACTTCAAACAGCCCATGGAGGGAGGTATTATTATACTCCTTATGTTG 692 

I I I I I I I I I I I I I I I I I I 

I I I I 

Db 508 CGTCTCATT TCTGCACGTG CTCCTT- 

TGTTC 537 

Qy 693 AC AG T G AAG AAT C T GAGGCCCAGA — 

GAGGTTGGGGACTTGAGTAAAGTCAC 7 42 

III I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 53 8 TCAGAACAGAAGCTTTTTATACACATCCCATAACGCAGCTGGAGA GAGTTATG- 

AAG 593 

Qy 7 43 ACAGCCCTGAGAGGCAGGACCAGGGTTCCATTCCTGCTCTATCCAGTTCCA 

AGCC 79 7 

III I I I I I I I I I I I I I I I I I I I I I I II I I 
Db 59 4 TCAGTTATTATAAGGAACACAAAGGTTGCTTTCC- 

ATTCTTTGCCTTTAGATAATTAATT 652 



Qy 79 8 CTTGTGTTTTCCATTATGTTTAGTGCCTCTTTGCTAACAGCA- 

ACATCTGCAAGATTTG- 855 

I I I I I I I I I I III I I I I I I I I I I I I I I I 

Db 653 TTTTTGTTTTC TTAAAATGGAG — 

TAT T T AAAGAAGGAAGAAAT TCACAAGAAATAA 707 

Qy 856 — TGTTGGTTTTGATGGAGAACTCT — AGCTCAT- 

CCACATGCTAGTGCCCAAGTGGTGG 910 

I I I I I I I I I I I I I II I I I I I I I I I I I I I 

I 

Db 70 8 ACTGTTGGAGAATTTAGAAAAGTTTGAAGTTTTTACCACCTTTTCTATCTCTAGT- 

TTTG 766 

Qy 911 

AGGGGCCACCTCAGCAGGTGGGTTCTGAATGCAGCCAAGGCTGTCCCCGCAATGGG T 96 7 

I I I I I I III II I I I I I I II II I I I I 

Db 76 7 TGTGGCCA AACACTTG TGCCGCCTGGG 

GCGGTGGGGGTA 8 05 

Qy 96 8 GAGACTCGCTCCAACTGCCCGCCCTCAGAGCAGGTGCCTAAGTCCTCCCTGG- 

CACTGGC 1026 

I I I I II III I I I I I I I I I I I I I I I I I 

I I I 

Db 8 06 GAGGCAAGCATAGACAG AG AG G A 

ACTAAGCCAGACATGGACAAAGGC 852 

Qy 102 7 A — GGCC-TTACCTCACATTGCTAAATTAAAGCAATGCAATT — CCT-CT- 

TGGGTAAG- 10 78 

I III III I I I I I I II I I I I II I I I I I I 

I I 

Db 853 AC GAG C C AAAAC C AG AC AG T C C T 

GGCCGTTCGATAGGCCAGCAGTGGGTGAGA 905 

Qy 10 79 -AGGAATTCCTCCTTCTTTACTAACTGATCC CCAGCAAGGAAAT- 

AAAATGTT 1129 

III I I I I I I I II I I I I I I I I I I 

I I I 

Db 9 06 CAGG— 

TCAGCCAGCTGCAGGGGCGGGAGCGGGGGGGCGGGTAGGGACATGGATGTGTG 962 

Qy 113 0 AG — GCTTTAAAAATCCCTACTTTGTCATATC- 

AGACTATATTCTAAAACTATATTTGAG 1186 

II III III II III I I I I I I I I I I I I I I I 

Db 963 AGGTGCT CATGC 

GTGCGATCGTGACTAGATTCTGAAACT 1001 

Qy 1187 

CGAAACCTGTCATTGCGTCTAATTTCAAATATACAGAATCTCCTTAAGAGCTGTTGCCTT 12 46 

I I I I I I I I I I III I I I I II 

Db 10 02 GCCAGCCAT TTTC CCAG CTCC 

GC— 1024 

Qy 1247 

ATTTTTTTGTAAAGCCTCTCTGACATCAAATGGGGAGAAATGGTGGCACCTCCAGACACC 13 06 

I I I I I I I I I I I I I I I I I III I 

I 

Db 1025 TTTG TGCAATCTAAAGG AATG CATC 

CCCT 1053 



Qy 1307 



CTGAAACTACACACCATTTCTTCCCTGCTCAGCTTCTGCTCAGGAGTTCTGTGAGCTATG 1366 
I I I I I I I I I I I I I III II I 

I 

Db 1054 CTGAAGCAG TCTTGCCAG AGC CT 

AG T GAG G 10 83 



Qy 1367 GGAAGGCCATTGGTTGTATTTGCTACTTTTACTTTCATCTTCCTCTGC- 

TGTAGAGCCAT 142 5 

III I I I I I I I I I I 

I I 

Db 1084 GAGAGAAGTATGGT 

AAATACCACAACATATGGAATCA- 112 0 



Qy 1426 

TTAATGTTATTGTCATATGCTGCTGGTGAGGTAAAGGTGGGTCCGGGTGCCTTCCCAGGG 14 85 
I II I I I I I I I I I I I I I I I III 

Db 1121 GAAAATACCG GGAACTGGAGTGGGCAAGGGG 

GAATGC 1157 



Qy 1486 

GTTAGAGGATGTTCAAAGGGCCGATTTCAGCAGGAGTTCAGAGGGCTTATGATGGATGGT 15 45 

I I I I I I I I II I I I I I I I I I I I I I I 

I 

Db 1158 AGAGGGTGT GGAAATTTTTTA GTGATCTGG AATG — 

TGTT 1195 

Qy 1546 GAGAGATTT G AC AAC C AC C AG AGC AC AT GTGCTCTGACC — 

CTCTCCTGGGCATTGGTTC 16 03 

I I I I I II I I I I I I I I II III I I I I I I I I I I 

I 

Db 1196 GAGTGACAG G AAG T GC C C C A- AGC T T C T CCCCCACCAACTCTTCT 

CAGTCGCGC 12 48 

Qy 1604 

CTGCTGGTACCGGGCGGTTCAGACCTTCAAATAGGTTGCTTTCAAAAGAGCTTTCAGGCA 1663 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 

Db 1249 CTGCTTTTGTCTAACTCTTGTAATCTACACACT-ACTGCTTACAA 

AGCTGTCTG 1301 



Qy 166 4 C T T AT T G AG AAT T AAT G T T T AAAC AGAC AT AA — 

T AGC C TAG AT G AAC T C C C AAG AG AT C 1721 

I I I I I I I I I I I I I I I I I I I I I I I 

Db 13 02 AGTTTA AGACAAAAAGAAACCTAAAAG-TCTCCTT 

ACT 1338 



Qy 1722 

TATTAAATCTTGTGGGCTGAATAAATATCTCGTGCAGGACTGTGCAACAGTAGCCCAGAG 17 81 
III I I I I I II I I I I I I I I I I I 

Db 1339 TCATAGTACCTGTGATATG 

GAAGGAATGTAAAAGCATGACC 13 79 



Qy 1782 

CATCCTGCCTGTGGGCATCCACCTCCCAGGTGAGGGCAGTGGGAAGCTGGCCCGACGGCA 18 41 
II I I I I I I I I I I I I I I I I III I I I I I 

Db 13 8 0 C T T T AAGCC TATGGACAT TTTTTCAGGTAT — 

ACAGGAGAAAGAT 142 2 

Qy 1842 

GCCAGAACTTGTTTCTCACCTCCCACCAGCAACCCCCCACCCAACTCTGGGCCCCAGGCA 19 01 
I I I I I I I I I I I I II 



Db 1423 
AGAAT 1451 



- — AAAATAATTT- 



-TCCACAGAAAAATG- 



Qy 19 02 CACGAAGCACAAGTCTCAGGGGACCATTCCCACATT — 

GGGGGATCCTGAGGGAGCCCAT 1959 



Db 1452 TATGAATTATATAGTTCAGGTTCCAAATCTAATTTTTAAAAGAATTCTGA- 

TT 1503 



Qy 1960 

CACCGCCTCTTGCATACAACTGTCCACTAGGAGGCACGCCCAGTGTGGGAGAGATGTATG 2 019 
I I I I I I I I I I I I I I I II I I I I I I I 

Db 150 4 CTGCTACACTT TACAAATG — CTTAGGTTGGTT — 

CCTAATTTGAAGGAG 15 49 

Qy 2020 

GTCTTGCCTTCCACCTGTAAAAACTGCACATATGCAAGCCATTTGCACTCTGGAACTGCA 2 0 79 
I I I I I I II I I I I I I I I I I I I I I I 

Db 155 0 ACTTGTTTTATTTGGTTA- 

ATGCATTGCATTTGAACT 1585 

Qy 2080 

TGCCGTGAAAACTCCTAATGGTGTGGAACTTAGTTTGAATTTGAAATCACGCCGCATGCA 2139 



Db 1586 TGT TTCTATT 

TTCTTTGCATAAATTTGG A 1614 

Qy 2140 

CAAAGGGACAGGCCCAGGCCCGACCTCAGGTCATCCGCCCGCTGGCTGCAGAGCATCCCT 2199 



Db 1615 CTTTGGG — AGAAAAATGC AAAGTAAT AAGTAGAATGCA- 

CTT 1654 

Qy 22 0 0 GGGAGCCAAGGCGAGG CCCGTGGAGCCTGAGCTTTGTGTAG 

CTCGAGCTT 2249 



Db 1655 GGGGGAAAAAAGGAGGATTTTTCCCTT CATGGG — 

TTGGAAAGTATTTTAAAGGGT 170 8 

Qy 2250 TGTGTAGCTCGTGCACTTATTATGCAC-CACCTCCCTTCAGTCACCACTCCT 

CT 2302 



Db 1709 TGTTT TCTTG-AAAAAACAAGCTCTCTCTTACTTTCTG- 

CATCTATGCTATAAAGAT 1763 

Qy 23 03 TCCTCCGCCATCCTCATTTATACTGATTGCACAC- 

CCCCCGCTCAAACAACAATGTCCTT 2361 



Db 176 4 AACTATGCTATAATAAATGTAA — 

G G T AG AAAAC T T T AAAG AG AAAAT AAC AG T G T T C T A 1821 

Qy 2362 

ATTATGATGACCATCTCGTAGTGGTACATTCCATTCCTATTTAAGGTAAGCCCAAAGCCC 2 421 

I I I I I I I I I I I I I I I II II 



I 

Db 1822 A— GTGAAAAGCTACT TAGCATTTTCCCAAACT- 

CACACATTATCA 186 4 



Qy 2 42 2 ACTTTTGGATTTTCTCGACTGTCCGAGAAAAG- 

TTGTGTAAGCGCCTGCGTTC 2473 



Db 1865 AC 

AG AAAC AC T AAAAAT T AAG AG AAAAGCGGCC AAC T AAT T AAAT AGC T AGCC T T A 192 0 
Qy 2474 TTCTGGG 

TTTGGCTAGATAGGGTTGTGTCCCTCTATGGAATGGAGAGTGATGTGGGC 2 53 0 



Db 1921 TTTTGGGGGTATGGGATAGAAAATTAAGTGT 

GAATAAAATGATACTTGGGA 19 71 

Qy 2531 

AAGGGTGTCATTTTCTCGCACAATACAACTCACTGAGGATGCTTCTGTAGAAGTGAGAAA 2 59 0 



Db 1972 ATG TTTCTCCCTC-GTAC CACAAAGGATGTT 

AGTGGTCAG 2010 

Qy 2591 C AC GAT GAG T AC AT T C AG AAT T AC AAT AAC T C ACTCTCA— CTGGGTA-AC 

TT 2640 



Db 2011 C-CTACGAGTTAATCCTTCCTAGC- 

ATGGCTCTGAGCCTTCATGCCGAGCAGACGTTATT 2068 

Qy 26 41 C T C AT GAT AG AT T T G T AT GAT C AAT AC GGG T C T ■ 

ATTTTTATGTCAACT 2688 



Db 2 069 CACATGA- 

CGATTCGAAAAGTCCATTCATATATCTCGCTACCTGGATTTGAATAGAAACC 212 7 
Qy 26 89 GAACA 

CTGTAGGGTACCTTCCAGTCTTTTTCAAGATTGTTAAATTGAGACAAGT 2 7 42 



Db 212 8 AGACAGCAATTCTTTA — GTTCCAGCCA- 

CCATTCGCCCCACTGGACAATAGCGATTTGT 2184 

Qy 2 7 43 AATTGAATAATTTGTCCTATTTTTATTTTAAAAAAAGTGAATGGA- 

CTGAAATGTTAAAT 2801 



Db 2185 — TAGCACAG — AGTCACA GGCTGTGGGACACAAAGCTTGGAGCTGCA — 

GAAAGAT 2 235 

Qy 2 8 02 GTG AATGTAC ATTTCT 

TAATTGCAATTTTTCTACTGAGTGTTTGCA 2 847 



Db 2 236 GGGGGATTCAGAGAACAGGAAATTACAGGCTCGATGCA 

CAGCT 22 85 



II III 
-CTCCTGG 



2 8 4 8 CTATACTTTCTGGAATCTTATTTAACAAAAATAAAGGGAAAAAATTGCTTGA-CT 

II II I I II I I I I I I I I I I II I I I I I I I I 

2 2 86 C T G AG AAT AC AAAA AAAAAAAAAAAAAAAAAGGCCACATGTGCTCGAGCT 



RESULT 15 

US-10-478-914-60/C 

; Sequence 60, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 

; TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
; FILE REFERENCE: 7388-80899 

; CURRENT APPLICATION NUMBER: US / 1 0 / 478 , 9 1 4 

; CURRENT FILING DATE: 2003-11-26 

; PRIOR APPLICATION NUMBER: PCT/ JP02 / 052 9 4 

; PRIOR FILING DATE: 2001-05-30 

; PRIOR APPLICATION NUMBER: JP 2001-162775 

; PRIOR FILING DATE: 2001-05-30 

; PRIOR APPLICATION NUMBER: JP 2001-255226 

; PRIOR FILING DATE: 2001-08-24 

; NUMBER OF SEQ ID NOS : 417 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 6 0 

LENGTH: 2 022 

TYPE: DNA 
; ORGANISM: Homo sapiens 
US-10-478-914-60 

Query Match 28.3%; Score 4867; DB 1; Length 2022; 

Best Local Similarity 41.7%; Pred. No. 2.5e-298; 
Matches 1211; Conservative 0; Mismatches 756; Indels 940; 
Gaps 147; 

Qy 8 CAGGAAATCAGGCCGGGCGCA- 

GTGGCTCATGCCTGTAAGCCCAGCACTTTGGGAGGCGG 6 6 



Db 2 022 CTG CAGCTCGAGCACATGTG 

GCCTTTTTTTTTTTTTTTTTAGATGGAGT 1974 

Qy 6 7 AGGCGGGTGGATCCATTGAGGTCAGGAGCTCAAGACCAGCCTGGTCAAC- 

ATGGTGAAAC 125 



Db 1973 T TTGCTCTTTT 

GCCCAGGCTGGAGTGCAATGGCGCGAT 1936 

Qy 126 

CCCGTCTCTACTAAAAATACAAAAAAAAAATTAGCTGGGCGTGTTGGCGGGAGCCTGTAG 185 



Db 1935 CTCGGCTC-ACTGCAACTTCCGCCTC- 

GTTCAAGCTATTCTCCTG 1889 



Qy 186 TCCCAGCTACACGGGAGGCTAAGGT- 

GGGAGAATTGCTTGAACCCGG 231 

I I I I I I II I I I I I I I 



I III I I I I I I I I I II 



I 

Db 1888 

CCTCAGCCTCCTGAGTAGCTGAGGTGAGAGGACTGACGCGAGAGGATTGCTTGAGCCTAG 182 9 



Qy 232 

GAGGCGGAGGTTGCAGTGAGCTGAGATTGCACCATTGCACTCCAGCCTGGGCGACAGAGC 2 91 
I I I I I I I I I I I I I I I I I I I I I I I I III II I I I I I I I I I I I I 

I I I 

Db 1828 

GAGGTTGAGGTTGCAGTGAGCTGTGTTCACGCCACTGGACTCCAGGCTGGGTGACAAAGC 1769 



Qy 292 AAGACTC — 

T C T C AAAAAAAAAAAAAAAG AAAG AAAG AAAT C AG AAAAT C G AC C AC AG T G 3 49 

I I I I I I I I I I I I I I I I I I I I III III III III I I I I I I 

Db 1768 

AAG AC C C T G T C T C AAAAAC AAAAAC AAAC AAAC AAAC AAAC AAAAAAAT AAC C AAC AAAA 17 09 
Qy 350 

GTAGCCACCTGGCCTAATGCTGTGTTTTTGTACCTGACAGGGGTCACTCATTTTAGGCAC 4 09 



Db 170 8 AAAAC — CCTGAGTTTAAGTTTGATTTTAAGACCTG- 

TAGGGAC 166 8 



Qy 410 AACTCCTTCATTCTTTGTGAAATTAGTGAGTTTCCTTCTACCCGTCA- 

CCAGATTCAATA 46 8 

II II I I I I I I I I I I I I I I I I I I I 

Db 166 7 AGCAGATTAGTTCT 

ACTTTAATGGGTATGCATCATCTGGAAAGCAAGGGTCA 1616 



Qy 469 

TGTTCTATTAATACACCGATAACCACAGGGGAAGGGCACTTGTCGCTCTCCCACCTGGTT 52 8 

I I I I III III III I I I I I I I 

I 

Db 1615 TAAAAAGAAGAT ACA TCTT CACTGGCACC- 

ATC 1584 



Qy 529 

AC C AC AG TCTCCATGGGTCTTTTGCCGT G ACC AC AAAT AAAGG AAAC AC T C AT C AC TAG T 5 8 8 
II I I I I I II III I I I I I I III 

Db 1583 ACATCAGTCATCA G T C C T TAG AT AC AG — 

AAC 1554 



Qy 589 

ATCTAAGTCGGGCTTTACAGTAACTATGCACCTTCTGTGTGCTTCACCTCACTCTCTACT 6 4 8 

I I I I I I I I I I I I I I I I I II 

I 

Db 1553 CTTTACTG ACTCCCTC — AAAACTTC 

CATGCTTC- 1522 

Qy 649 

TCAAACAGCCCATGGAGGGAGGTATTATTATACTCCTTATGTTGACAGTGAAGAATCTGA 7 0 8 
I I I I I I I I I I I I I I I I III 

Db 1521 -CAAACTAACCAAGCAG TTCTTCT 

ACA 1496 



Qy 709 

GGCCCAGAGAGGTTGGGGACTTGAGTAAAGTCACACAGCCCTGAGAGGCAGGACCAGGGT 76 8 
I I I I I I III I I I I 

I I 



Db 1495 TTGGGATC ATCGAATTTCCCT TCA- 

TGT 1469 

Qy 769 

TCCATTCCTGCTCTATCCAGTTCCAAGCCCTTGTGTTTTCCATTATGTTTAGTGCCTCTT 82 8 

I I I I I I I I I I I I I I I I I I I I I I III 

I 

Db 146 8 TCCAACTCTCCTCTAGC T T C AAT AT AC AC AT AG AC AAC C T T A GCC- 

ATC 1421 

Qy 829 TGCTAACAGCAACATCTGCAAGATTTGTGT- 

TGGTTTTGATGGAGAACTCTAGCTCATCC 8 87 

II III I I I I I I I I I I I I I I I I II I I I I I I 

I 

Db 142 0 TGAAAAC — 

CTACTTTTAGAATATTCCCGAGTGGTTCCACGTAAGTCCTCTGCCAGATTC 1363 
Qy 888 

ACATGCTAGTGCCCAAGTGGTGGAGGGGCCACCTCAGCAGGTGGGTTCTGAATGCAGCCA 9 4 7 

III I I I I I I I I I I I I 

I I I I 

Db 1362 ACA TACCCATAAATT CTACTTCA 

TAAGCCC 1333 

Qy 9 48 AGGCTGTCCCCGCAATGGGTG- 

AGACTCGCTCCAACTGCCCGCCCTCAGAGCAGGTGCCT 10 06 

I I I I I I III III III I I I I I I 

I 

Db 1332 

ATACTGTTCT T AAAAAAG AAAT AG AAAAAAT AT ATC AAT AT AAC T T T AAAAT AAG AC AC A 12 73 

Qy 1007 

AAGTCCTCCCTGGCACTGGCAGGCCTTACCTCACATTGCTAAATTAAAGCAATGCAATTC 1066 

II I I I I I I II II I I I I I I I II II I I I I I 

I 

Db 1272 AA — CCTTCATGAAAAT-ATAG TCATCAAATT — TATTTT 

CATTGGGATGC 122 7 

Qy 1067 

CTCTTGGGTAAGAGGAATTCCTCCTTCTTTACTAACTGATCCCCAGCAAGGAAATAAAAT 112 6 
I II I I I II I I I I I I II I II I I I I I I I 

Db 1226 CATTT TTTGAAGAATTCCTAAGACTAATGTTTCT — 

TGACATGCAAG A 1181 

Qy 1127 

G T T AGGC T T T AAAAAT C C C T AC T T T G T C AT ATC AGAC T AT AT TC T AAAAC T AT AT T T GAG 1186 

I I I I I I I I I I I I I I I I I I I I I I I II II I 



Db 1180 GTTA-GCATTAATAG — CTTACGTT ACTATA- 

AATACTGCTGCTTGGAAG 1135 

Qy 1187 CGAAAC — 

CTGTCATTGCGTCTAATTTCAAATATACAGAATCTCCTTAAGAGCTGTTGCC 12 4 4 

I II I I I I III I I I I I I I I I I I I I I I I I 

I 

Db 113 4 CAGTACAACTGTTTTAGAG T T T T AAGAC T AC AG AC TTTCATTA — 

CTCAAATC 1084 



Qy 1245 TTATT TTTTTGTAAAGCCTC- 

TCTGACATCAAATGGGGAGAAATGGTGGC- 1293 



1 1 1 1 1 I 1 1 1 1 1 1 II 1 1 1 1 III III 1 1 1 1 1 1 1 

1 1 

Db 10 83 TTATTCAGTAAATGTAAA-AATCAGAAGGTTCTG— AACAGCTGGTTAGGAA- 

GGTAGCC 102 8 

Qy 129 4 -ACCTCCAGACACCCTGAAACTAC-ACACCATTTCTT- 

CCCTGCTCAGCTTCTGCTCAGG 1350 

I I I I I I II III I I I I I I I I I I I I I I II 

I I I 

Db 102 7 AAGATGCAGGAAAGATG — TCTGCGCCTCCTTTTCAAGGGCAGC- 

CAACTCTTGAACAGT 9 71 

Qy 1351 AGTTCTGTGAGCTATGGGAAGGCCA- 

TTGGTTGTATTTGCTACTTTTACTTTCATCTTCC 1409 



Db 9 70 AG GTGCCA— AAATATCCACATGGCCTTTATAGCTTC 

CACCTGCGTCACAC 922 

Qy 1410 

TCTGCTGTAGAGCCATTTAATGTTATTGTCATATGCTGCTGGTGAGGTAAAGGTGGGTCC 1469 
I I III I I I I I I I I I I I I I I I III III 

Db 921 ACAAAT AACAAAGTAATGTAAATGACA TGCT — 

TGAAATAA 8 83 

Qy 1470 

GGGTGCCTTCCCAGGGGTTAGAGGATGTTCAAAGGGCCGATTTCAGCAGGAGTTCAGAGG 152 9 
I I I I I I II I I I I I I I I I 

Db 8 82 CTTGTCA 

GCAATACTTTGGCTCAATTATATCA 851 

Qy 1530 

GCTTATGATGGATGGTGAGAGATTTGACAACCACCAGAGCACATGTGCTCTGACCCTCTC 1589 
III I I I I I I I I I I I I I I 

I I 

Db 850 -CTT TAAACTTTGCTTGCAGCAATA — AAATGT 

ACTA 817 



Qy 1590 

CTGGGCATTGGTTCCTGCTGGTACCGGGCGGTTCAGACCTTCAAATAGGTTGCTTTCAAA 16 49 

I I I I I I I II I I I I I I I I I I I I I 



Db 816 CTGGGCAAATTATC TCCAGAACTT 

TGACTTTAAAA 782 

Qy 1650 

AGAGCTTTCAGGCACTTATTGAGAATTAATGTTTAAACAGACATAATAGCCTAGATGAAC 17 09 
I I I I I I I I I I I I I I I I I 

Db 781 AATG — TTAAATCAATTATT 

TCAATATA 756 

Qy 1710 

TCCCAAGAGATCTATTAAATCTTGTGGGCTGAATAAATATCTCGTGCAGGACTGTGCAAC 1769 
I I I I I I I I I I I III III I 

Db 755 AAT AAAT AAT - T AGGC C T C AC T AAC AG — 

CTGT 726 

Qy 1770 

AGTAGCCCAGAGCATCCTGCCTGTGGGCATCCACCTCCCAGGTGAGGGCAGTGGGAAGCT 182 9 
I I I I III I I I I I I I 



I 

Db 72 5 CCCAACACCTC TAC- 

CCACCCT T 704 



Qy 183 0 GGCCCGACGGCAGCCAG 

AACTTGTTTCTCACCTCCCACCAGCAACCCCCCACCCAAC 18 86 

I I I I I I I I I I III I I I I I I I I I I I 

I I 

Db 703 

GGCTCGGAAACAACTAGTTAGGCCTGAGTCTTACAATTTTTCAGGAACAAGCTAACAAA- 6 45 



Qy 1887 TCTGGGC — 

CCCAGGCACACGAAGCACAAGTCTCAGGGGACCATTCCCACATTGGGGGAT 19 4 4 
I I I I I I I I I I I I I I I I I I III 

Db 6 44 — AGGGCATCCCAGGCAGAGGAAATAATATGCTGA 

ATTACCA 6 05 



Qy 1945 

CCTGAGGGAGCCCATCACCGCCTCTTGCATACAACTGTCCACTAGGAGGCACGCCCAGTG 2 0 0 4 

I I I I I I I III III II II I I I I I I III 

Db 604 CCTGAGTTA AGTCAAAGGCTTCTGATTCCT — TGTTCAAAA 

TCACCTA 559 



Qy 2 0 05 TGGGAGAGATGTATGGTCTTGCCTTCCACCTGTAAAAACTGCAC- 

ATATGCAAGCCATTT 2 063 

I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 558 TTATGCTCTTG-TTTCC 

TAATAATTCCACAAAATGTTCAACT 518 

Qy 2064 

GCACTCTGGAACTGCATGCCGTGAAAACTCCTAATGGTGTGGAACTTAGTTTGAATTTGA 2123 
I I I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 517 — ACTCT TAT G AC AT T AAC AC T T C AAG AG — AAAG AG C T AAG 

CTGA 476 

Qy 2124 

AATCACGCCGCATGCACAAAGGGACAGGCCCAGGCCCGACCTCAGGTCATCCGCCCGCTG 2183 
III I I I I I I I I I I I I I I I I II 

Db 4 75 AGTCT T T C AT T C AC AT AAAGA TGACACAATTTAA — 

AAAT 438 



Qy 2184 

GCTGCAGAGCATCCCTGGGAGCCAAGGCGAGGCCCGTGGAGCCTGAGCTTTGTGTAGCTC 2 2 43 
I I I I I I I III III II I I I I I 

Db 43 7 GCTACAAGGC 

AGG T AT GG AC AAAG T AC TAT G AGGGT AAT AAGG AG G 3 92 



Qy 2244 

GAGCTTTGTGTAGCTCGTGCACTTATTATGCACCACCTCCCTTCAGTCACCACTCCTCTT 23 03 
II II I I I I I III II 

I I I I I 

Db 391 GA ATGATTAACACG CTAAAAAT 

ATACTCTT 362 



Qy 23 0 4 CCTCCGCCATCCTCATTTATACTGATTGCAC- 

ACCCCCCGCTCAAACAACAATGTCCTTA 2362 

I I I I I I I I I I III II II I I I I I 

Db 361 AACCTAA — TGAAATGAAAGTACTGCCTTCC CAAAC — 

CGAAGTTC 32 0 



Qy 2363 TTATGATGACCATCTCGTAGTGGTACA- 

TTCCATTCCTATTTAAGGTAAGCCCAAAGCCC 2 421 

II III I I I I I I II I I I I I I I III I I I I I I 

Db 319 AAAAC AGGT AT AT T T AC AT T T AAAAT TGT AT T T AAAAT AA- 

TTCAAAGCTT 270 

Qy 2422 

ACTTTTGGATTTTCTCGACTGTCCGAGAAAAGTTGTGTAAGCGCCTGCGTTCTTCTGGGT 2 4 81 

II I I I I I 

Db 269 AC 

CACCAGC 261 

Qy 2482 

TTGGCTAGATAGGGTTGTGTCCCTCTATGGAATGGAGAGTGATGTGGGCAAGGGTGTCAT 2 5 41 

II I I I I I I I I I I I I I 

Db 26 0 AG CCCTTT 

TCTGACCGTAGGTAAC— 23 7 

Qy 2542 

TTTCTCGCACAATACAACTCACTGAGGATGCTTCTGTAGAAGTGAGAAACACGATGAGTA 2 6 01 
I I I I I III II I I I I I I 

I I I 

Db 236 TTTCCCATCAAATTCATC-CACTG 

GTA 211 

Qy 2602 

CATTCAGAATTACAATAACTCACTCTCACTGGGTAACTTCTCATGATAGATTTGTATGAT 2661 

III I I I I III I I I I I I I I I I 

Db 210 CCTT TATATCCGGCTCAACCTGAGAA 

ATGGTA 179 

Qy 2662 CAATACGGGTCTATTTTTATGTC 

AACTGAACACTGTAGGGTACCTTCCAGTCT 2714 

I I I I I I I I II III II I I I I I I I I I I I I III 

I 

Db 178 CAGTTCAGGTGTTCTTCTATCT C AG AT AG T AAC T AAG T AAT G GAG AG AAAAT T T C 

CA 122 

Qy 2715 

TTTTCAAGATTGTTAAATTGAGACAAGTAATTGAATAATTTGTCCTATTTTTATTTTAAA 2 7 7 4 
I II II I I I I II I I I I I I I I I I I I I I I 

I I I I I 

Db 121 AATAAAA AT T TC AT T T AGA- AAG- AG TTG — TATTTTATCATC 

ATT AAA 7 7 

Qy 2775 

AAAAGTGAATGGACTGAAATGTTAAATGTGAATGTACATTTCTTAATTGCAATTTTTCTA 2 83 4 

I I I I I I I I I I I II III I I I I I III I I I I 

I 

Db 76 TAAAG CTGGCCAGTAG-GCCAACAGTGCTCGAGGAATTC 

CGCAGCTTTTAGA 2 6 

Qy 2 835 -CTGAGTGTTTGCACTATACTTTCTGG 2 86 0 

I II I I I I I I I I I I I 
Db 2 5 GC— AGAAGTAACACT-TCCGTACAGG 2 
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RESULT 1 
US-10-478-914-4 

; Sequence 4, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 

; TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
; FILE REFERENCE: 7388-80899 

; CURRENT APPLICATION NUMBER: US/10/478,914 

; CURRENT FILING DATE: 2003-11-26 

; PRIOR APPLICATION NUMBER: PCT/JP02/05294 

; PRIOR FILING DATE: 2001-05-30 

; PRIOR APPLICATION NUMBER: JP 2001-162775 

; PRIOR FILING DATE: 2001-05-30 

; PRIOR APPLICATION NUMBER: JP 2001-255226 

; PRIOR FILING DATE: 2001-08-24 

; NUMBER OF SEQ ID NOS : 417 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 4 

LENGTH: 3210 

TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-478-914-4 



Query Match 36.9%; 
Best Local Similarity 46.1%; 
Matches 1625; Conservative 
Gaps 225; 



Score 7203; DB 1; Length 3210; 
Pred. No. 0; 
0; Mismatches 1095; Indels 802; 



Qy 1 GCGGGGCGGGCCGGC-GGCGGAGGCCGGGC- 

CGCGGAGCCAGGAGTGACTAGCAGCAGTT 5 8 

II I I I I I I I I I I I I I I I I I I I 

I I 

Db 1 GCACTGTTGGCCTACTGGCCCAGGCCTTGCACACG- 

TT 48 



Qy 59 

GGCCGTGCCGTAGCAGCGTCCCGCGCGCGGCGGGCAGCGGCCCAGGAGGCGCGTGGTGCG 118 



Db 49 AGTGC- 

GTTCA 82 



-TCCCTGGCAAGGATGG- 



-GAAGGCAT 



Qy 119 GGTTTCGGCGGCGGCTGAGGAAGAAGCGCGGGCGGCGCCTTC-GGGAGGC 

GAGCAGG 174 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I II 

I 

Db 83 

GTCCTAGGAGTAGGAAGGGGCAGAGGTGTTGATGGCCCCTACAGAGCGGCCAAGGACAAG 142 
Qy 175 

CAGCAGTTGGCCGTGCCGTAGCAGCGTCCCGCGCGCGGCGGGCAGCGGCCCAGGAGGCGC 23 4 

I I I I I I II I III I I I I I I I I I III I I I I 

I 

Db 143 GAGCTGCTGTTCGAAAC — AGC — CTTCCTGCTCCC— 
CAACCTGCCTCCCA C 189 

Qy 235 GTGGCGGCGCTCGGCCT 

CGCGGCGGCGGCGGCGGCAGCGGCCCAGCAGTTGGCGGCG 2 91 

I I I I I I I I I I I I I I I I I I I I I II 

I 

Db 19 0 CCAACAG-GTTTTGCATATACTCTACTG-GGAAGAGG GACACACC 

CGAC- 236 

Qy 292 

AGCGCGTCTGCGCCTGCGCGGCGGGCCCCGCGCCCCTCCTCCCCCCCTGGGCGCCCCCGG 351 

II I I I I I I I I II I I I I I I I III I II I I I I I I I 

I 

Db 23 7 TGCATCACTGC-CCTCCAAGTCTCTCCCTGC- 

CCTGTCCAGCATCCAGGAGCACCCCTAG 294 

Qy 352 CGGCGTGTGAAT- 

GGCGGCCTCCGCGGCGGCAGCCTCGGCAGCAGCG 39 7 

II I I III Ml I I I I I I I I I I I I I 

Db 295 

TTGGGGAAGCTTCTGTGACTCCCCCTACAACAGCCTAGGATGGAGTGGGGTTTGTGAACA 35 4 

Qy 398 — GCC-TCGGCCGCCT CTGGCAG — CCCGGGCCCG- 

GGCGAGGGCTC 43 8 

II I I I I I I I I I I I I I I I I I I I I I I II 

Db 355 

AATGCAGAAGGCAGTCTTAGGGAGGTCAGCTGACATGCCCCTGGCCTGTGGCTGGGAAGT 414 

Qy 439 CGCTGGCGGCGAAAAGCGCTCCACCGCCCCTTCGGCCGCAGCCTCGGC 

CTCTGCCT 494 

II I I I I III II I MM III I III II I I I I 

I 

Db 415 AGC-AGAGGC-TAAGGTTCTTCCCCGC TCTGGGGTTGCC- 

AGGAGTAGCACTGGAT 46 7 

Qy 495 CAGCCGCGGCGTCG TCGCC-CGCGGGGGGCGGCGCC-GAGGC- 

GCTGGAGCTGCTG 547 

I I I I I I I I II I I I I I I I I I I I I I I I I III 

Db 46 8 CAGTC-AGGTGACAGGGCTCTCCTCTCTCTGAGCAGGTCCGGTGGCAGC 

CTTCAA 521 

Qy 548 GAGCACTGCGGCGTGTGCAGAGAGCGCCTGCGACCCGAGAGGGAGCCC — 

CGCCTGCTGC 605 

I II I I I I I I I I III I I I I I II I I I I I III II 
Db 522 GGTCGCCACGCCGTATTCCCTGTATGTCT — GTCCCGAGGGGCAGAACGTCACC — 



CTCA 577 



Qy 6 06 CCTGTTTGCACTCGG — CCTGTAGTGCCTGCTTAGGGCCCGC GGCCCCCGCC- 

GCC 658 

I I I I I I I I I I I I I I I I I I I I I I I I III I I 

I 

Db 578 CCTGCAGGCTCTTGGGCCCTGTGG 

AC AAAGGGC AC GAT G T G AC C T T C T AC AAGAC 632 

Qy 659 G 

CCAACAGCTCGGGGGACGGCGGGGCGGCGGGCGACGGCACCGTGGTGGACTGTCC 714 

I I I I I I I I I I I I I I I I I I I I I I I I I II I 

I I 

Db 633 GTGGTACCGCAGCTCGAGG GGCGAGGT-GCAGAC — CTGCTC 

AGAGCGCCGGCC 6 83 

Qy 715 

CGTGTGCAAGCAACAGTGCTTCTCCAAAGACATCGTGGAGAATTATTTCATGCGTGATAG 7 7 4 

I I I I I I I II I I I I III I I I I 

I 

Db 684 CATCCGCAACCTCACGT TCCAGGACCTTC 

ACCTGCACCATGG 72 5 

Qy 775 

TGGCAGCAAGGCTGCCACCGACGCCCAGGATGCGAACCAGTGCTGCACTAGCTGTGAGGA 83 4 

III I I I I I I I I I I I I I I I I I I I I I I I III III 

I I I 

Db 726 AGGCCACCAGGCTGCCAACACCAGCCACGACCTGGCTCAGCGC — CACGGGCT 

GGA 779 

Qy 835 TAATGCC-CCAGCCACCA — 

GCTACTGTGTGGAGTGCTCGGAGCCTCTGTGTGAGACCTG 891 

I I I I I I I I I I I I I I I I I I I II III 

I I I I I 

Db 78 0 GTCGGCCTCCGACCACCATGGCAAC TTCTC — 

CATCACCATGCGCAACCTG 82 8 

Qy 892 TG — TAGAGGCGCAC C AG - C GGG T G AAG T AC AC C AAGG AC C A- 

TACTG 935 

II I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 829 ACCCTGCTGGATAGCGGC- 

CTCTACTGCTGCCTGGTGGTGGAGATCAGGCACCACCACTC 8 8 7 

Qy 936 TGCGCTCTACTGGG-CCA GCCAAGTCTCGGGATGGTGAACGTACTGTCTA — 

TTGC 988 

III I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 888 GGAGC ACAGGGTCCATGGTGCCATGGAGCTGCA-GGTG — 

CAGACAGGCAAAGATGC 9 41 

Qy 9 89 AACGTACACAAGC ATGAACCCCTTGTGCT 

GTTTTGTGAGAGCTGTGATACTCT 10 41 

I I I I I I I I I I I I I I I I I I I I I I I I 

Db 942 

ACCATCCAACTGTGTGGTGTACCCATCCTCCTCCCAGGAGAGTGAAAGTAAGGGACCAAC 10 01 

Qy 10 42 CACCTGCC GAG ACTGCCAGCTCAATGCCCACAAGGACC 

ACCAGTACC 1088 

I I I I I I II III II I I I I I I II I I I I 



Db 1002 CTCTTGCCCCTTTTGGGTTCTCTGTTTTCT-TCTGTCCTCA 

TCCTGCACCCAGACCC 1057 

Qy 10 89 AGTTCTTAGAGGATGCAGTGAGGAACCAGCGCAAGCTC- 

CTGGCCTCACTGGTGAAGCGC 1147 



Db 1058 TGTTT GGAACTCTG- 

GCCTCATCACCCCAAGCCCTCAGAACCCCCCGGTCCTCCTC 1112 

Qy 1148 

CTTGGGGACAAACATGCAACATTGCAGAAGAGCACCAAGGAGGTTCGCAGCTCAATCCGC 12 0 7 



Db 1113 CTTTTCTGCTGCTGCACATCCCTTCTGCTTC-CTCCTTGGTG 

CAATCC-C 116 0 

Qy 12 0 8 CAGGTG 

TCTGACGTACAGAAGCGTGTGCAAGTGGATGTCAAGATGGCCATCCTGCAG 12 6 4 

III I II III I I I I I I I I I I I I II II 

Db 1161 CAGAAGCCCACTCTCCTTC CATCTGCTC-TGGA-GTCT — 

CTGCTCCTCTTGACT 1211 

Qy 1265 ATCATGAAGGAGC— TGAATAAGCGGGGCCGTGTG— CTGGTCAATGATGCC- 

CAGAAGG 1319 

I I I I I I II II I I I I I I I I I I I I I I I I I I I 
Db 1212 CTC-TGGAGTGGCTGTGCCTTGGCAGTGACCTTTGGCCAGGGCAA — 
GTGCCTCA 1263 

Qy 132 0 TGACTGAGGGGCAGCAGGAGCGCCTGGAGCGGC — 

AGCACTGGACCATGACCAAGATCCA 1377 



Db 126 4 TGAC — AGGTACTG — GGTGCCCCAG — GCAGCTAAGTGCCG — CCCTGCCCA 

CCA 1311 

Qy 13 78 GAAGCACCAGGAGCACATTCTGCGCTTTG 

CCTCTTGGGCTCTGGAGAGTGACAACAA 143 4 



Db 1312 GC— CCCCTATGGCTTGGGAAG- 

GCTGGGGGTCCTCTTGGCCAACAGGGTGAAACCCCAT 136 8 

Qy 1435 CAC AGCCC TTTTGCTTTCTAAGAAGTTGATCTAC- 

TTCCAGCTGC 1478 



Db 1369 

CTCTACTAAAAACACAAAAATTAGCCAGGCATGGTGGTGCGCGCCTGTAGTCCCAGCTAC 142 8 

Qy 14 79 ACCGGGCCCT CAAGATGATTG- 

TGGATCCCGTGGAGCCACATGGCGAGATGAAGTT 1533 



Db 142 9 TCAAGAGGCTGAGGCAAGAGAATCGCTTGAACCCG-GGAGTCA- 

GAGGTT 14 76 



Qy 153 4 TCAGTGGGACCTCAATGCCTGGACCAAGAGTGCCGAGGCCTTTGGCAAGATTG- 

TGGCAG 1592 



Db 1477 GCAGTG 

CCAGCCTGCTGATAG 152 0 

Qy 1593 AGCGTCCTGGCACTAACTCAACAGGCCCTGCACC CATGGCC- 

CC 1635 



Db 1521 AG— 

CAAGACTCCATCTCAAAAAAAACAACCCAAAATTTGCCTGGCATGGTGGCAGGCA 15 7 7 

Qy 1636 

TCCAAGAGCCCCAGGGCCCCTGAGCAAGCAGGGCTCTGGCAGCAGCCAGCCCATGGAGGT 1695 



Db 1578 TCTGTAATCCC 

AGCTACTCGGGAGGCTGAGACATGAGAGCTGCTTGAACCTGGGAG- 1633 
Qy 1696 

GCAGGAAGGCTATGGCTTTGGGTCAGGAGATGATCCCTACTCAAGTGCAGAGCCCCATGT 1755 



Db 163 4 GCAGAA GTTGCAGTGAGCC — GAG AT C A- C AC C AC TGCACTCCAG 

CCTGGGT 1682 

Qy 1756 GTCAG-GTGTGAAACGGTCCC 

GCTCAGGTGAGGGCGAGGTGAGCGGC 1801 

I I I I I I I I I I I I I I III I II I I I I 

Db 1683 

GACAGAGCGAGACTCTGTCCCAAAAAATCAAAAAAATCACTTTTGGTAGAGATGCAC— T 17 4 0 

Qy 18 02 CTTATGCGC-AAGGTGCCACGAGTGAGCCTTGAACGCCTGG 

AC 18 43 



Db 1741 CT CGCTATGTTGCCCAGGCTG- 

GTCTTGAACTCCTGGGCTCAATTGATCTTCCCAC 1795 

Qy 18 44 CTGGACCTC ACAGCTGACAGCCAGCCA — 

CCCGTCTTCAAGGTCT 1886 



Db 1796 CTTGACCTCCAAAGTGCTGGGATTACAGGTGTGAGCCA- 

CCATGCCTAGCCTCAGGGAAT 1854 

Qy 1887 T C C C AGGC AG T AC - C AC TGAGGAC T AC A — AC CTTATTGTTATT G AAC G T G 

GCGCT 1939 

II I I I I I I I I I I II I I I I I I I I III 
Db 1855 TCTTA- 

TAAGAACTCTATGAAGTAGGCATCACCATCTTCTCTGTATCCATGGAAAGAGAG 1913 
Qy 19 40 GCC GCTGCA- 

GCTACCGGCCAGCCAGGGACTGCGCCTGCAGGAACCCCTGGTGCCC 199 4 

III I I I I I I I I I I I I I I I I I I I I I I I II 

Db 1914 GCCTAGAGATGTATGCTAACTTGC — CCAAG— CT-CACAT 

AGCCCAGGGT 1959 



III II I I I I I II I III I I I I 
-AGCCGAGATC GCGCCACTGCACT 



Qy 1995 CACCCCTGGCTGGCATG GCCATTGTCAAGGAGGAGGAGACGGAG — 

GCTGCCATTG 2048 



I I 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 III 1 1 1 1 1 1 

Db 196 0 — AGCATAGCTGGGATGTTGAGCTGAGGCC — 
GGAGGAGAAGTAGCAGTCGCTGGCA 2 012 

Qy 2 0 49 GAGCCCCTCCTACTGCCACTGAGGGCC CTG 

AGACCAAACCTGTGCTT 2 095 

I I I I I I I I I I I I I I I I III I I I I I I I I 

Db 2 013 GAG-CACACAGGCTG- 

CTCTGGGGGATGAGCTGGTGCGTTTAAGGAACAGGCCAGCAC — 2 06 8 

Qy 2 096 ATGGCTCTTGC GGAGGGTCCTGGTGCTGAG — GGTC 

CCCGCCT— 2136 

I I I I III I I I I I I I I I I I I I I I I I I I I I 

Db 2069 - 

TGGCATTCGCAAGCAGTGGGGAAGGGGAGAGATGCCGAGGTGGTCAGTATCCTGACTTT 212 7 
Qy 213 7 GGCCTCACCTAG — 

TGGCAGCACCAGCTCAGGGCTGGAGGTGGTGGCTCCTGAGG- 2189 

I I I I I I I I I I I I I I I I I I I I 

I I 

Db 2128 

CAGAGGCCTTTTTTTGTTTGTTTTAATTTTTGCTAGATTGATATTAAAAACTCATGTGGA 218 7 

Qy 219 0 GTACCTCAGCCCCAGG TGGTGGCCCGGGAACCCTGGATGACAG 

TGCC 2236 

I I I I I I III II III I II I I I I I I I 

I I 

Db 2188 GGAACTCA 

AG G AAT G T T T AG AAG AC C AAAAG T C C C C AAT G AC AGG AAC AAAAGC A 2242 
Qy 2237 

ACCATTTGCCGTGTCTGCCAGAAGCCAGGCGATCTGGTTATGTGCAACCAGTGTGAGTTT 2296 

I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 2243 ACCAATT — TTTAACTTTC TCTTCTCAT 

TCCTGTTTTCATTG 2282 

Qy 229 7 TGTTTCCACCTGGACTGTCACCTGCCGGCCCTGCAGGATGTACCAGGGGAGG- 

AGTGGAG 2355 

I I I I I I I I I I III I I I I I I I I I I I I I I I 

I I I 

Db 2283 ATTTCCCACATGTAGT CCTTTTG CT- 

CAGGAAGTCTTTGGGGAAATTAAGGAT 233 4 

Qy 2356 CTGCTCA-CTCTGCCAT--GTGCTCC CTGAC — 

CTGAAGGAGGAGGAT 239 8 

II I I I I I I II I I I I I I I I I III I I I I I 

I 

Db 2335 CTTTGAAGCTCTGAAATAGGTGATCAGGTTAGTGGTGTCTGTCAGCTGTC- 

TAAGAGGTT 2393 

Qy 2399 GGCAGC — CTCAGC- 

CTGGATGGTGCAGACAGCACTGGCGTGGTGGCCAAGCTCTCACCA 2 455 

III I I I I I I I I I I I I I I I II I I I I 

Db 2394 

GGAAAATGAACTACTCAAGATAGTCACGAAAATACTGAAAGTTTGATTTTTCTTTC 2 4 49 

Qy 2 456 GCCAACCAGCGGAAATGTGAGCGTGTACTGCTGGCC 

CTATTCTGTCACGAACCCTG 2511 

II I I I I I I I I I I I I I I I I I I I I I I I 



Db 2450 -CATATTTGAATTAATTTTTTC- 

TGTTTGACTGGAAGGGGTTTTTGTATAACTAAAACCT 2507 

Qy 2512 CCGCCCCCTGCATCAGCTGGCTACCGAC— TCCACCTTCT 

CCCTGGAC 2557 

III I I I I I II I I I I I I I 

I 

Db 250 8 CAGC 

GCATAAAGGAGATTTAAAAGGAGCACATGATTTAGTGGGTGGGCCATGAAA 2 562 

Qy 2558 C AGCCCGG TGGCA — CCCTGGA- 

TCTGACCCTGATCCGTGCC 2596 

I II II I I I I I I I I I I I I I I I I 

I 

Db 2 563 CTAGAGATGGGATTTGGGGGTGAATTTGTCAATATCTGGATTTTAATCCAGA- 

CATCTC 262 0 



Qy 2 59 7 CGCCTCCAGGAGAAGTTGTCACCTCCCTACAGCTCC CCAC 

AGGA 2640 

II I III III I II I I I I I I I III 
Db 2621 TGCTAAC — 

GAGCCTTTGGTAAGTCACTTCAGATACTTTTCCTCCTTTTTACAAAGAGAG 2 6 7 8 

Qy 26 41 GTTTGCCCAGGATGTGGGCC GCATGTTCAAG — C — AATTCAACAAG 

TTAAC 2688 

I III III III II I I I I I I I I I I I I I I I I 

I I 

Db 2679 

GGCTGGCTTAGTTATTTGCCAAAGCCCCTTCCAGGCCTGAATTCCACAAGTACGATTTAC 2 73 8 

Qy 26 89 TG-AG-GACAAGGCAGACGTGCA- 

GTCCATCATCGGCCTGCAGCGCTTCTTCGAGACGCG 2745 

I I I I I I II I I I I I I I II I I I I I I I I I I I I 

I 

Db 2 739 TGTAGTGTCTTATCACTCTTTCATGTC — ACAATAGCGTGGAGC 

AT T AG AG AAAAG 2 792 

Qy 2 7 46 CATGAAC GAGGCC TTCGGTGACACCA 

AGTTCTCTGCTG 2783 

I I I I III I I I I I I I I I I I I 

Db 2793 

C C TAG AC TTTTAGTTGATAGCCAGTT G AAAT ATC AT TGAT AG AAT T T T AG T T T T AGG AAA 2 852 

Qy 2784 TGCTGG-TGGAGCCCCCGC CGATGAGCCTGCCTG-GTGCT — 

GGCCTGAGTTC 2832 

I I I I I I I II I I I I I I I I I I I I III 

I I 

Db 2 853 AATTGGTTTGATTTCTAGCTTTATTACTATTAG GTATGTGAGCTTGGGC 

AAATC 29 06 

Qy 2 833 CCAGGAGCT GTCTGG-TGGCCCTGGTGATG-GCCCCTGAGGC 

TGGAGCCCCC 2 8 82 

I III I I I I I I III I I I I I I I I I I II 

Db 2907 

GCTTAATCTTTGAGTCTAGTTTTCTCTCAAAATGAGAACATTAGGCTAAATGATTTCCGA 2 966 

Qy 2 8 83 ATGGCCAGCCCAGCCTGGC — TCTGT- 

TCTCTGTCCTGTCACCCCATCCCCACTCCC 2936 

I I I I I I I I I I I I I I I I I I II II I 

Db 2967 



GTTTCCAGCTAGTCCTAGAGTTCTATATTTCTACATAGTTGAATTATTTTATCATGCTGT 3 02 6 



Qy 2937 — CTGGTGGC-CTGACTCCCACTCCCTGGTGGCCCC 

ATC 2972 

I I I I I I I I I I III II II I 

Db 3 02 7 TGCTGGGGAATATGACTAACCCT — 

TTTGAAGCTACTAATTTTATGTCGAGCTTTAAAGT 3084 

Qy 2 9 73 CCCCAGT TCCTC — ACGATATGGTTTTTACTTCTGTG- 

GATTTAAT 3 015 



Db 3085 

CCATAATTGTTATCTTCAGAAAATATTATTTGACCTACAGTATGTCCAAATCAATTTAAT 314 4 

Qy 3016 AAAAAAAACTTC-ACCAGTTCAAAAAAAAAAAAAAAAAAAAA 3056 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 3145 AAAATCGCTTTATAACAGGAAAAAAAAAAAAAAAAAAAAAAA 3186 
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Qy 11 CCGGCGGCGGAGGCC GGGCC 

GCGGAGCC 3 8 



Db 3209 

CGAGCACATGTGGCCTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTCCTGTTATAAAGC- 3151 

Qy 39 AGGAGTGACTAGCAGCAGTTGGCCGTGCCGTAGCAGCGTCCCGCGC 

GCGGCGGGC 93 

I I I I I I I I I I I I I I I I I III 

Db 3150 - GAT T T T AT T AAAT T GAT T T GG AC AT AC T G T AG 



GTCAAATAATATTTTCTGAAGAT 3 096 



Qy 9 4 AGC GGCCCAGGAGGCGCGTGGTGCGGGTTTCGGCGGC 

GGCTGAGGAAGAA 143 

II III I I I I I I I III I I I I I I 

I 

Db 3095 

AACAATTATGGACTTTAAAGCTCGACATAAAATTAGTAGCTTCAAAAGGGTTAGTCATAT 3 036 
Qy 144 

GCGCGGGCGGCGCCTTCGGGAGGCGAGCAGGCAGCAGTTGGCCGTGCCGTAGCAGCGTCC 2 03 

II I I I I I I I I I I I I I I I I I I I 

Db 3 035 TCCC CAGCAACAGCATGATAAAATAATTCAACTAT 

GTAGAAATATAG 29 89 

Qy 204 CGCGCGCGG-CGGGC-AGCGGCCCAGGAGGCGCGTGGCGGCGCTCGGCCTCGC 

GGC 257 

I I I I I I I I I I I I I I I I I I I I I 

I 

Db 29 8 8 AAC TC T AGGAC T AGCTGGAAACTCGGAAATC AT T T AGC — 

CTAATGTTCTCATTTTGAGA 2931 

Qy 258 GGCGGC — 

GGCGGCAGCGGCCCAGCAGTTGGCGGCGAGCGCGTCTGCGCCTGCGCG 311 

I I I I I I I I I I I I II I I I I I I III I 

Db 293 0 GAAAACTAGAC-TCAAAGATTAAGCGATTTGC-CCAAGCTCACAT — 

ACCTAATAGTAAT 2 8 75 

Qy 312 — GC-GGGCCCCGCGCCCCTCCTCCC 

CCCCTGGGCGCCCCCGGCGGCGTGTGAA-T 363 

III I I I I I I I I I I I I I I I 

Db 2874 

AAAGCTAGAAATCAAACCAATTTTTCCTAAAACTAAAATTCTATCAATGATATTTCAACT 2 815 



Qy 36 4 GGCGGCCTCCGCGGCGGC — AGC 

CTCGGCAGCAGCGGCCTCGGCCGCCTCTGGCAG 417 

III II II II III II I I I I 

Db 2814 GGCTATCAACTAAAAGTCTAGGCTTTTCTCTAATGCT CCACG- 

CTATTGTGA 2 76 4 



Qy 418 

CCCGGGCCCGGGCGAGGGCTCCGCTGGCGGCGAAAAGCGCTCCACCGCCCCTTCGGCC — 4 75 
II I I I I I I I I I I I I I I I 

Db 2 763 CAT G AAAG AG T G AT AAGAC AC T AC 

AGTAAATCGTACTTGTGGAATTCAGGCCTG 2 710 



Qy 476 — 

GCAGCCTCGGCCTCTGCCTCAGCCGCGGCGTCGTCGCCCGCGGGGGGCGGCGCCGAG 532 
I I I I I I I I I I I I I I I I I I I 

I I I 

Db 2 709 GAAGGGGCTTTGGCAAATAACTAAGCC 

AGCCCTCTCTTTGTAAAAAGGAG 266 0 



Qy 533 GCGCTGGAGCTGCTGGAGC ACTGCGG — 

CGTGTGCAGAGAGCGCCTGCGACCCGAGA 58 7 

I I I I I I I I I II III I I I I I I I I I I I 

I I 

Db 2 659 GAAAAGTATCTGAAGTGACTTACCAAAGGCTCGTTAGCAGAGA-TGTCTG- 

GA 2609 



Qy 588 

GGGAGCCCCGCCTGCTGCCCTGTTTGCACTCGGCCTGTAGTGCCTGCTTAGGGCCCGCGG 6 4 7 

I II I I I I I I I I I I I I I I I I I I I I 

I I 

Db 26 0 8 TTAAAATCCAGATATTGACAAATT — CACCC— CC — 

AAATCCCATCTCTAGTTTCATGG 2555 

Qy 6 48 CCCCCGCCGC— CGCCAACAGCTCGGGGGACGGC 

GGGGCGGCGG 6 89 

I I I I I I I II I I I I I I I I I I I I 
Db 2 55 4 CCCAC- 

CCACTAAATCATGTGCTCCTTTTAAATCTCCTTTATGCGCTGAGGTTTTAGTTA 2 496 

Qy 69 0 -GCGACGGCACCGTGGTGGACTGTCCCGTGTGCAAGCAACAGTGCTT- 

C T C C AAAG AC AT 747 

II III I I I I I I I I I I I I I I I I I I I 

Db 2 495 T AC AAAAAC C C C TTCCAGT 

CAAACAGAAAAAATTAATTCAAATATGG 2 4 49 

Qy 748 CGTGGAGAATTA TTTCA TGCGTGAT AGTGG 

CAGC— 781 

I I I I I I I I I I I I I I I I I I I I I II 

Db 2448 

AAAGAAAAATCAAACTTTCAGTATTTTCGTGACTATCTTGAGTAGTTCATTTTCCAACCT 23 89 

Qy 782 — AAGGCTGCC-ACCGACGCCCAGGATGCGAACCAGTGCTGCACT 

AGCTGTGAGG 833 

I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 23 8 8 C T T AGAC AGC TGACAGACACC 

AC T AAC CTGATCACCTATTT C AG AGC T T C AAAG 2335 

Qy 83 4 AT AATGCCCCAGCCACCAGCTACTGTGTGGAGTGCTCGGAGCCT — 

CTGTGTGAG 8 86 

II I I I I I I I II II III III I I I I 

I I 

Db 233 4 ATCCTTAATTTCCCCAAAGACTTCCT 

GAGCAAAAGGACTACATGTGGGAA 22 85 

Qy 887 ACCTGTGTAGAGGCGCACCAGCGGGTGAAGTACACCA 

AGGACCATACTG 935 

I I I I I I I I I I I I I I I I I III 

Db 2284 

AT C AAT G AAAAC AGG AAT G AG AAG AG AAAGT T AAAAAT TGG TTGCTTTTGTTCCTGTCAT 2 2 2 5 

Qy 936 TG CGCTCTACTGGGCCAGCCAAGT 

CTCGGGATGGTGAACGTACTGTC- 9 82 



Db 222 4 TGGGGACTTTTGGTCTTCTAAACATTCCTTGAGTTCCTCCACAT- 

GAGTTTTTAATATCA 2166 

Qy 9 83 -TATTGCAACG TACACAAGCATGAA CCCCT TGTGCTGTTTTGTGA- 

GAGC 103 0 

I I I I I I I I I I I I I II I I I I I I I I I III 

I 

Db 2165 ATCTAGCAAAAATTAAAACAAACAAAAAAAGGCCTCTGAAAGT- 

C AGG AT AC T G AC C AC C 2107 



Qy 1031 TGTGATACTCTCACCTGCCGAGACTGCCAGC TCAATGCCCACAAGGACC — 

ACCA 10 83 

I I I I I I I I I I I I I I I I I II I I I I I I I II 

Db 2106 TCGGCATCTCTCCCCTTCC- 

CCACTGCTTGCGAATGCCAGTGCTGGCCTGTTCCTTAAAC 2 0 4 8 

Qy 1084 GTACCAGTTCTT AGAGGATGCAGTGAG — 

GAACCAGCGCAAGCTCCT 112 8 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2047 

GCACCAGCTCATCCCCCAGAGCAGCCTGTGTGCTCTGCCAGCGACTGCTACTTCTCCTCC 19 8 8 

Qy 1129 GGCCTCACTGGTGAAGCGCCTTGGGGACAAACATGCAACATT — 

GCAGAAGAGCACCAAG 1186 



Db 19 8 7 GGCCTCA — GCTCAACATCC CAGCTATGCTACCCTGGGCT- 

ATGTGAGCTTGG 1938 

Qy 1187 G-AGGTTCGCAGCTCAATCCGCCAGGTGTCTGACGTACAGAAGCGTG — TGCA- 

AGTGGA 12 42 



Db 193 7 GCAAGTTAGCA— TACATCT-CTAGGCCTCT— CTTTC 

CATGGATACAGAGAAGA 1888 

Qy 12 43 TGTCAAGATGGC— CATCCTGCAGATCATGAAGGAG 

CTGAATAAGCGGGGCCGTGT 1296 



Db 1887 TG — GTGATGCCTACTTCATAGAGTTCTTATAAGAATTCCCTG 

AGGCTAGGCATGGT 1833 

Qy 129 7 GCTGGTCAATGATG CCCAGAA GGTGACTGAG — GGGCAG — CAG — 

GAGCGC 13 42 



Db 1832 G— 

GCTCACACCTGTAATCCCAGCACTTTGGAGGTCAAGGTGGGAAGATCAATTGAGC-C 17 76 

Qy 13 43 CTGGAG — CGG CAGCACTGGAC — 

CAT G AC C AAG AT C C AG AAGC AC C A 13 86 

I I I I I I I I I I I I I I I III II III I 

Db 17 75 CAGGAGTTCAAGACCAGC- 

CTGGGCAACATAGCGAGAGTGCATCTCTACCAAAAGTGATT 1717 

Qy 13 8 7 GGAGCACATTCTGCGCTTTGCCTCTTGGGCT 

CTGGAGAGTG 142 7 



Db 1716 TTTTTGATTTTTTGGGACAGAGTCT- 

CGCTCTGTCACCCAGGCTGGAGTGCAGTGGTGTG 1658 

Qy 142 8 ACAAC — AACACAGC- 

CCTTTTGCTTTCTAAGAAGTTGATCTACTTCCAGCTGCACCGGG 14 8 4 
I I I I I I I I I I I I I I I I I 

Db 1657 ATCTCGGCTCACTGCAACTTCTGCCTCCCAGG 

TTCAAGCAGCT 1615 



Qy 1485 



CCCTCAAGATGATTGTGGATCCCGTGGAGCCACATGGCGAGATGAAGTTTCAGTGGGACC 15 4 4 
III I I I I I I I I I I I I I I II III 

Db 1614 — CTC ATGTCTCAGCCTCCCGAGTAGC 

TGGGATTACAG 15 79 

Qy 15 45 TCAATGCCTGGACCAAGAGTGCCGAGGCC 

ATTGTGGCAGAGCGTC 159 8 



Db 1578 ATGCCTG — CCACCA-TGCC- 

AGGCAAATTTTGGGTTGTTTTTTTTGAGATGGAGTC 1526 

Qy 1599 CTGGCACTAACTCAACAGGC CCTGCACCCATGGCCC- 

CTCCAAGAGCCCC 16 47 



Db 152 5 TT-GCTCTA — TCAGCAGGCTGGAGTGCA 

GTGGCGCGATCTCGGCTCACTGCAACCT 1472 

Qy 16 48 AGGGCCCCTGAG — CAAGCAGGGCTCTGGCAGCAGCC- 

AGCCCATGGAGGTGCAGG 170 0 



Db 1471 

CTGACTCCCGGGTTCAAGCGATTCTCTTGCCTCAGCCTCTTGAGTAGCTGGGACTACAGG 1412 

Qy 1701 AAGGCT ATGGC TTTGGGT 

C AGG AG AT GAT C C C T AC 1735 

II I I I I I I I I I I I I I I 

Db 1411 

CGCGCACCACCATGCCTGGCTAATTTTTGTGTTTTTAGTAGAGATGGGGTTTCACCCTGT 1352 
Qy 1736 T 

CAAGTGCAGAGCCCCATGTGTCAGGTGTGAAACGGTCCCGCTCAGGTGAGGGC 17 89 

I I I I I I I I I I I III I I I I I I 

Db 1351 TGGCCAAG AGGACCCC CAG 

CCTTCCCAAGCCATAGGGGGCTGG 13 09 

Qy 179 0 -GAGGTGAGCGGCCTTATGC-GCAAGGTGCCACGAGTG — 

AGCCTTGAACGCCTGGACCT 18 45 



Db 1308 

TGGGCAGGGCGGCACTTAGCTGCCTGGGGCACCCAGTACCTGTCATGAGGCACTTGCCCT 12 49 

Qy 18 46 GGACC TCACAGCTGACAGCCAGCCAC-CC GTCTT 

CAAGGTCTTCC 1889 



Db 12 48 GGCCAAAGGTCACTGCCAAGGCACAGCCACTCCAGAGAGTCAAGAGGAGCAGAGAC- 

TCC 1190 

Qy 189 0 CAGGCAGT 

ACCACTGAGGAC T AC AAC CTTATTGTTATT G AAC G T GG 1935 



Db 1189 AGAGCAGATGGAAGGAGAGTGGGCTTCTGGGGATTGC- 

ACCAAGGAGGAAGCAGAAGGGA 1131 



TTTGGCAAG 

I I I I I I I I I I I I 



Qy 1936 CGCTGCCGCTGCAGCT- 



ACCGG 1956 

I I I I I I I I I I I I I I I 

Db 1130 TG- 

TGCAGCAGCAGAAAAGGAGGAGGACCGGGGGGTTCTGAGGGCTTGGGGTGATGAGGC 10 72 

Qy 1957 CCAGCCAGGGACTGCGCCTGCAG 

GAACCCCTGGTGCCCCA 1996 

III I I I I I I I I I I I I I I III I I III 

I 

Db 10 71 CAGAGTTCCAAACAGGGTCTGGG — 

TGCAGGATGAG G AC AG AAG AAAAC AG AG AAC C C AA 1014 

Qy 199 7 CCCCTGGCTGG — CATGGC 

C AT T G T C AAG GAGGAGGAGACGGAGGCTGCCA 2 045 

I I I I I I II I I I I I I I I I I I I I I II 

I I I 

Db 1013 AAGGGGCAAGAGGTTGGTCCCTTACTTTCACTCTCCTG — 

GGAGGAGGATGGGTACACCA 956 

Qy 2 0 46 TTGGAGCCCCT CCT-ACTGCCACTG-AGGGCCCTGAGACCA 

AACC 2088 

I I I I I II III I I I I I I I I I I I I I I I I I I 

I I 

Db 955 

CACAGTTGGATGGTGCATCTTTGCCTGTCTGCACCTGCAGCTCCATGGCACCATGGACCC 896 

Qy 2089 

TGTGCTTATGGCTCTTGCGGAGGGTCCTGGTGCTGAGGGTCCCCGCCTGGCCTCACCTAG 214 8 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

I I I 

Db 895 TGTGCT CCGAGTGGTGGTGCCTGAT-CT CCACCACCAGGCAGCA- 

GTAG 8 49 

Qy 2149 TGGCAGC — ACCAGC TCAGG GC-TGGAGGTGGTG 

GCTCCTGAGGGT- 2191 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 8 48 AGGCCGCTATCCAGCAGGGTCAGGTTGCGCATGGTGATGGAGAAGTTGC- 

CATGGTGGTC 79 0 

Qy 2192 ACCTCAGCC CCAGGTGGTGGCCCG 

GGAACCCTGGAT 222 7 

II I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 789 

GGAGGCCGACTCCAGCCCGTGGCGCTGAGCCAGGTCGTGGCTGGTGTTGGCAGCCTGG-T 731 

Qy 222 8 GACAGTGCCACCAT-TTGCC-GTGTCTG- 

CCAGAAGCCAGGCGATCTGGTTATGTGCAAC 22 84 

I I I I I I I I III III I I I I I I I II II I I I I I 

Db 73 0 G GCCTCCATGGTGCAGGTGAAGGTCCTGGAACGTGAGGTTGCGG — 

ATGGGCCGG 6 78 

Qy 22 85 CAGTGTGAGTTTTGTTTCCACCTGGACTGTCACCTGCCGGCCCTGCAGGAT 

GTACCA 23 41 

I I I I I I I I I I I I I I I I I I I I I I I I II 

I I I I I I 

Db 6 77 CGCTCTGAG CAGGTCTG-CACCT — CGCCCCT- 

CGAGCTGCGGTACCA 63 4 



Qy 23 42 GGGGAGGAGTGGAGCTGCTCAC-TC-TGCCATGTGCTCCCTG 

ACCTGAAGG 239 0 

I I I I I I I I I I I I I I I I I I I III I I I I 

I I I 

Db 633 

CGTCTTGTAGAAGGTCACATCGTGCCCTTTGTCCACAGGGCCCAAGAGCCTGCAGG 5 7 8 

Qy 2391 AGGAGGATGGCAGCCTCAGCCT — GGATGG 

TGCAGACAGCACTGGCGTGGTGGCC — 2443 

I I I I I I I II III III I I I I I I I I I I I I I I I I I I 
Db 577 T-GAGGGTGACGTTCTGCCCCTCGGGACAGACATACAGGGAATAC- 

GGCGTGGCGACCTT 52 0 

Qy 2444 — 

AAGCTCTCACCAGCCAACCAGCGGAAATGTGAGCGTGTACTGCTGGCCCTATTCTGTC 2 5 01 
I I I I I I I I I I I I I I I I I I I I I III 

I I I I I 

Db 519 GAAGGCTGCCACCGG ACCTGC TC AG AG AG AG GAGAGCC 

CTGTC 477 

Qy 2502 ACGAACCCTGCC— 

GCCCCCTGCATCAGCTGGCTACCGACTCCACCTTCTCCCTGGACCA 2 559 

I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 4 76 A CCTGACTGATCCAGTGCTACTCCTGGCAAC 

CCCAG A 440 

Qy 256 0 GCCCGGTGGCACCCTGGATCTGACCCTGATCCGTGCCCGCCT 

CCAGGAGAAGTTGT 2615 

II II I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 439 GCGGGGAAGAACC TTAGCCTCTGCTACTTCCCAGCCACAGGCCAGGGGCA — 

TGT 387 

Qy 2616 C 

ACCTCCCTACAGCTCCCCACAGGAGTTTGCCCAGGATGTGGGCCGCATGTTCAA 2 6 7 0 

I I I I I I I I I I I I I I I I I I I I I II II I 

Db 3 86 CAGCTGACCTCCCTAAGACTGCCTTCTGCA-TTTGTTCA 

CAAACCC— 3 42 

Qy 26 71 GC AAT T C AAC AAG T T AAC T GAGGAC AAGGC AG AC G T GC AG T C - 

CATCATCGGCCTGCAGC 2 72 9 

I I I I I I I III I I I I I I III III I I I 

I 

Db 3 41 -CACTCCATCC — TAGGCTGTTGTAGGGGGAGTC — 

ACAGAAGCTTCCCCAACTAGGGGT 287 

Qy 2 73 0 GCTTCTTCGAGACGCGCATGAACGAGGCCTTCGGTGACACCAAGTTCT 

CTGCTGTGC 2786 

I I I I I I I I I I I I I III I I I I II I I I I I I 

I 

Db 286 GCTCCT GGATGC 

TGGACAGGGCAGGGAGAGACTTGGAGGGCAGTGATGCAGT-C 23 4 

Qy 2787 

TGGTGGAGCCCCCGCCGATGAGCCTGCCTGGTGCTGGCCTGAGTTCCCAGGAGCTGTCTG 2 8 46 

I I I I I I I I I I I I I II I I I I I I I 

I 

Db 233 GGGTGTGTCCCTCTTC CCAGTAG 

AGTATATGCAAAACCTGTTGG 19 0 



Qy 2 8 47 GTGGCC — CTGGTGATGGCCCCTG- 

AGGCTGGAGCCCCCATGGCCAGCCCAGCCTGGCTC 2903 



Db 189 GTGGGAGGCAGGTTGGGGAGCAGGAAGGCTGTTTCGAACAG- 

CTTGTCCT 135 




Qy 29 0 4 TG — TTCTCTGTCCTGTCACCCCATCCCCACT- 

CCCCTGGTGGCCTGACTCCCACTCCCT 2960 



III I I I I I 



Db 13 4 TGGCCGCTCTGTAGGGGC-CATCAACACCTCTGCCCCT- 

TCCTACTCCTA 8 7 



Qy 2961 GG — TGGCC CCATCCCC-CAGTTCCT- 

CACGATATGGTTTTTACTTCTGTGGA 3 0 09 



III I 



Db 86 GGACTGAACATGCCTTCCCATCCTTGCCAGGGAGCACTAAGAG- 

AGCACCTGCGTG 32 



Qy 




Db 
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Qy 2 CGGGGCGGGCCG GCGGCGGAGG CCGGGC 

CGCGGAGCCAGGAGT 4 4 



Db 6 CTGGATTCGCCATCATTTTGCACACTGCGAGGGGCTCCGTGTGTGCGC 

CCTGTCTT 61 



Qy 45 

GACTAGCAGCAGTTGGCCGTGCCGTAGCAGCGTCCCGCGCGCGGCGGGCAGCGGCCCAGG 10 4 



Db 62 GTCTGGCCGTCCTCATCCCTTCCCCCACCCCCTGCCGC- 

CACTCCGACCCGCTCCCAAAG 12 0 

Qy 105 AGG CGC- 

GTGGTGCGGGTTTCGGCGGCGGCTGAGGAAGAAGCGCGGGCGGCGCCTTC 16 0 

II II I I I I I I I I I I I I II I I I I I I I 

Db 121 TGGCTTCACAATAGT-CGGTCCTCGGCG 

GTGTAGGCTGCGC 16 0 

Qy 161 GGGAGGCGAGCAGG- 

CAGCAGTTGGCCGTGCCGTAGCAGCGTCCCGCGCGCGGCGGGCAG 219 

I I I I I I I I II II I I I I III I I 

Db 161 

ACCAGGTCCACACTTAAGCGAAATCAAGGAGACCCCCTTCTACTTCTACCTT 212 

Qy 22 0 CGGCCCAGGAGGCGCGTGGCGGCGCT- 

CGGCCTCGCGGCGGCGGCGGCGGCAGCGGCCCA 278 

II II III III I I I I I I III III I I I I 

Db 213 TGGGTTTGGT-GCTCAATGCGAAGCTGCTGCAACTCAGACACGCCTAAGTCAAC — 

TCAT 269 

Qy 279 

GCAGTTGGCGGCGAGCGCGTCTGCGCCTGCGCGGCGGGCCCCGCGCCCCTCCTCCCCCCC 33 8 



Db 2 70 G C AG AAAAAG G AG A- AAAG T T T T GG T AT AC 

AAATGCTCTCAGTCCAGCCAGACACCA 325 

Qy 339 TGGGCGCCCCCGGCGGCGTGTGAATGGCGGCCTCCGCGGCGGCA — 

GCCTCGGCAGCAGC 396 



Db 326 AG CCGAAAGGTTGTG-CTGGCTGCAACCGAAAGATCAAGGAC- 

CGGTATCTTC 3 76 

Qy 397 GGC-CTCGGCCGCCTCTGGC 

AGCCCGGGCCCGGGCGAGGGCTCCGCTG 443 

I I I I I I I I I I I I I I I I 

Db 3 77 TAAAGGCACTGGACAAATACTGGCATGAAGACTG 

CTGCTGTGAC 432 

Qy 444 GC GGCGAAAAGCGCTCCACC 

GCCCCTTCGGCCGCAGCCTCGGCCTCTGCC 493 



Db 433 TGTCGCTTGGGGGAGGTGGGCTCCACCCTGTACACTAAAGC — 

TAATCTTATCCTTTGTC 49 0 

Qy 49 4 TCAG CCGCGGCGTCGTCG — CCCGCGGGGGGCGGCGCCGAGGCGCTGGAG- 

CT 543 

III I I I I I I I I I I I I I I I I I I I I I I I I 

Db 491 GCAGAGACTATCTGAGGC-TCTTTGGTGTAACGGGAAACTGCGCTGC 

CTGTAGTAA 545 



III I I I I I I I I I 
-CCTGAAGTGTGC- 



Qy 544 GCTGGAGCACTGCGGCGTGTGCAGAGAGCGCCTGCGACCCGAG 

AGGGAGCCCCGCCT 600 



Db 546 GCT CATCCCTGCCTTTG-AGATGGTG- 

ATGCGTGCCAAGGACAATGTTTACCACCT 599 

Qy 601 G — CTGCCCTGTTTGCAC-TCGGC-CTGTAGTGCCTGC 

TTAGGGCCCGCGGC 6 48 

I I I I I I I I I I I I I I I I I I I I II I I I I I 

Db 600 GGACTGC TTTGCATGTCAGCTTTGTAAT — 

C AG AG AT T T T G T G T T GG AG AC AAAT T 653 

Qy 6 49 — CC CCG CCGCCGCCA-ACAG-CTCGGGGGACGG 

CGGGGCGGC 687 



Db 654 

T T T C C T AAAG AAT AAC AT G AT C C T T T GCC AGACGGAC T ACGAGGAAGG T TTAAT GAAAGA 713 

Qy 688 GGGCGACGGCACCGTGGTGGACTGTCCCGTGTGCA-AGCAACAGTGCTTCTCCAAAG- 

AC 745 



Db 714 

AGGTTATGCACCCCAGGTTCGCTGATCTATCAACATCACCCCATTAAGAATACAGAGCAC 7 73 

Qy 7 46 ATCGTGGAGAATTATTTCATGCGTGATAGTGGCAGCAAGGCTGCCACCGACGCC — 

CAGG 8 03 



Db 774 

TACATTCTTTTATCTTTTTTGCTCCACATGTACATAAGAATTGACACAGGAACCTACTGA 833 

Qy 804 AT-GC GAACCAGTGCTGCA — 

CTAGCTGTGAGGATAATGCCCCAGCCACCAGCTACT 857 



Db 83 4 ATAGCGTAGATATAGGAAGGCAGGATGGTTATATGGAATA 

AAAGGCGGACTGCATCT 89 0 

Qy 858 GTGTGGAGTGCTCGGAGCCTCTGT 

GTGAGACCTGTGTAGAGGCGCACCAGCGGGTGA 914 



Db 891 GTATGTAGTG- 

AAATTGCCCCAGTTCAGAGTTGAATGTTTATTATTAAAGAAAAAAGTAA 9 49 

Qy 915 AGTACACCAAGGAC CATACTGTGCGCTCT- 

ACTGGGCCAGCCA 956 



Db 950 TGTACA — 

TATGGCTGGATTTTTTTGCTTGCTATTCGTTTTTGTGTCACTTGGCATGAGA 10 0 7 

Qy 957 AGTCTC GGGATGGTGAACGTACTGTCTATTGCAACGT 

ACACAAGCATGA — 10 05 

III II I III I I I I I I I I I I I I 

Db 1008 TGTTTATTTTGGACTATTGTATATAATG — 

TATTGTAATATTTGAAGCACAAATGTAATA 1065 



Qy 1006 



-ACC- 



-CCTTGTGCTGTTTTGTGAGAGCTG 



TGATA 103 7 

III I I I I I I I I I I I I I III 

I I 

Db 1066 CAGTTTTATTGTGTTACCATTTGTGTTCCATTTGCTTCTTTGT- 

ATTGTTGCATTTAGTA 112 4 

Qy 103 8 CTCTCACCTGCCGAGACTGCCAG 

C T C AAT GC C C AC AAGG AC C AC C AG T AC C AG T TC T 1094 

I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 1125 CAATCA-GTGTTTAAACTTACTGTATATTTATGCTTTCTGTATTTACCAG 

CTATT-T 1179 

Qy 1095 TAGAGGATGCAGTGAGGAACCAGCGCAAGCTC 

CTGGCCTCACTGGTGAAGCGCC 1148 

I I I I I I I I I I III I I I I I I I I I III 

I 

Db 1180 TAAATGA- 

GCTGTAACTTTCTAGTAAAGAATTGAAAAGCAAATCTCACTAATGATACACA 123 8 



Qy 1149 TTGGGGACAAACATG CAACAT 

TGCAGAAGAGCACCAAG- 1186 

I I I I I I I I I I I I I I I I III 

I I 

Db 1239 

G AT AG AT AAAGC AAG T C TAT CAACAT T AAAAAT AC T AAAAAAT AAAGAC AC AC AC AG AGC 12 9 8 

Qy 1187 GAGGTTCGCAGCTCAAT-CCGCCA GGTGTCTGAC 

GTAC 1223 

II III I I I I I I I I I I I I I I I III 

Db 1299 

ATTTTAGTGACATCCACTACTTATTGCCGCTATGAGTTAGAGTCTATCAGTGTTCTTGTT 1358 
Qy 122 4 AG AAG C 

GTGTGCAAGTGGATGTCAAGATGGCCATCCTGCAGATCATGAAGG A 12 75 

I I I I I I I I I I I I II III I I I I I I 

I 

Db 1359 ATAACCCCCTATTTTCAGG- 

GGGTTAAAAATCAGCTTTAAAAAAATACATAAAAATTTCA 1417 

Qy 1276 GCTGAATAAGCGGGGCCGTGTGCTG GTCAATGATGCCC 

AG AAG G T G AC T G 132 5 

I I I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 1418 TCTTA — AAGCACTTTCATTTTATACCAACGTGAAAAGTGCCATTTTTAGAA — 

TAACTT 1473 

Qy 132 6 AGGGGC — AGCAGG AGCGCCT GGAGCGGCAGCACTGG- 

ACCAT 1365 

I I I I I I I I III I I I I III 

I I 

Db 1474 TAAAGCTTAACAGGTTTCCTTTTAATATCCTTTTTTTGTGTGC- 

TCTTTACTTACACAAT 1532 

Qy 1366 GACCAAG ATCCAGAAGCACCAGGAGCACATTCTGCG — CTT- 

TGCCTCTTGG 1414 

II I I I I I I I I I I I I I I I I I I I I I I 

I I I I 

Db 1533 GGCTTTGTTTTGCTTTTTCAGCCACACC CCTTATGTGAACTAGTGCCT 

TTGG 1584 



Qy 1415 G CTCTGGAGAGTGAC AACAAC 

ACAGCCCTT T 1445 

I I I I I I I I I I I I I III 

Db 1585 

GTATCACGTAAAATTTTTTCCAAAGGGTTACTTTAAAAATCTGTTACCACAATTATGAGA 16 4 4 

Qy 14 46 TGCTTTCTAAG AAGTTGATCTACTTCCAG 

CTGCACCGGGCCCTCAAGA 1493 



Db 1645 

TGATTTTTAAGTGATAAATTAAACTTCTTCTTGTATAAATTCTGCCCAGATCTCTCCACA 17 0 4 

Qy 149 4 TGA-TTGTGGATCCCGTGGAGCCACATGGCGAGATGAA 

GTTTC 1535 



Db 1705 AGAGCTGAGGGTTTCAT — 

AACTTTATGGCTTAATAAATGTATGACACTGAAAAGATTTG 1762 

Qy 1536 AGTGGGACCTCAATGCCTGGA CCAAGAGTGCCGAGGCCTTTG— GC- 

AAGATTGTGG 1589 

I I I I I I III I I I I I I I I I I I I I I II 

Db 1763 AGTGTGA-ATCTA CTGAAATCACTATAATGC 

ACATTGAAGCTATGATGGTAT 1813 

Qy 159 0 CAGAG-CGTCCTGGCACTAACTCAACAGGCCCTGCA 

CCCATGGCCCCTCC 1638 

III II I III I I II I II I I I I I 

Db 1814 TTGAGTAGTGAGGTTACTT — 

T T G AT C G G AG C AAC AT AAT GC T C AT AG AAT C T T C T AG AA 1871 

Qy 1639 -AAGAGCCCCAGGGCCCCTGAGCAA — GCAGGGCTCTGG 

CAGCA- 16 79 



Db 1872 

GAAGAGAAACAAAGGGATTGATAAAATGCTGAGAACTAGTGATTATATATTTTTCTGTAT 1931 

Qy 16 8 0 — GCCAGCCC ATGGAGGTGCAGGAAG 

GCT 1706 



Db 1932 

TTACCTGACATTTATTTTAATGTTCAAAAAGTAAACACTTTAAGTTTGATGTGTTTTACT 1991 

Qy 170 7 ATGGCTTTGGGTCAGGAGATGATCCCTACTCAAGT 

GCAGA G 1747 



Db 1992 CTCTCATTGTTTTAAGTAAT — 

TGCCAACTCAGAATACATCATTCTTAGGCTGAAATTTG 2 049 

Qy 1748 CC CCATGTGTCAGGTGTGAAACGGTCCCG-CTCAGGTGAGGGC- 

GAGGTGA 1796 



Db 2050 TCTTTCCATTTTTTAAG- 

G T G AAAT AG T AC T AC C T T AC G TG AT AGC AT AC AAAGAAGAAA 210 8 



Qy 179 7 GC GGCCTTAT-GCGCAAGGTGCCACGAGTGAGCCTT — 

GAACGCCTGGAC 18 43 



Db 2109 

GC T C T AG AAAG AG AAAT T AT GG AGAATGAT TAT T T AAAT T AC AAT T AAGGAAATGAG AAT 216 8 

Qy 18 44 CTGGACCTCACAGCTGA-CAGCCAGCCACCCGTCTTCAAGGTCT TCCCAG— 

GCAG 1896 



Db 2169 

ATGATCCCCTCTTCCGAGTTGCCCACAAACTTGCTTCTTTGCTTTTGCTCCCTGTAATAG 2 2 2 8 

Qy 189 7 TACCACTGAGGACTACAACCTTATTGTTATTGAACGTGGCGCTG 

CCGC 1944 



Db 2229 AACTACT TTTCAACAAATCTAATTTTGCAC — 

GGCACCGTTAACCATATTTTCAC 22 81 

Qy 1945 TGCAGCTACCGGCCAGCCA-GGGACTGCGCCTGC 

AGGAACCCCTGGTGCCCCACCC 1999 



Db 22 82 TACAGCAAACTTAGTGCTATGGGTTTTCTTTTTCTTTGTTTTTTTCTTGAT 

CAC 2335 

Qy 2000 CTGGCT-GGCATGGCCATTGTCAAGGAGGAGGAGACGGAGGCTGCCATTGGA 

GCCC 2054 



Db 2336 TTGTATAGGAAACAACATTTTCCAG 

TGTTATTTGCATATATATTTTGTCC 23 85 

Qy 2055 CTCCTA C T GC C — AC T G AGGGCC C T GAGA — 

CCAAACCTGTGCTTATGGCTCTTGCG 2107 



Db 23 86 TTCCAATATATGCATTACAGATGAAAATTAAATGTTATACCTG 

AATTCTTGGG 2438 

Qy 2108 GAGGGTCC TGGTGCTGAGGGTCCCCGCCTGGCCTCACCTAGTGGCAGCAC- 

CAG 2160 



Db 2 439 T T G G G G C C AAAAT AT T AAGC T G AAAAT AAT — 

GCTGGTGTGGATTTGTTTTAAAACAAAG 2496 

Qy 2161 CTCAGGGCTGGAGGTG GTGGCTCCTGAGGGTACCTCAGC CCCAGGTGGTGG- 

CC 2213 



Db 2497 

CTTTATTATGAACATGCATGTGAATCTGGATATTGCCTCTTATTTTTAAGAAAATGGTTC 2 556 

Qy 2214 CGGGAACCCTGGATGACA GTGCCACCATTTGC — C 

GTGTCTGCCAGAAG 226 0 

I I I I I I I I I I I I III III I I I I I I I I I 



Db 2557 

TGTGAAAAGTGAATGATATGTATTTTTCCAAATGCTTCATGGTTAGGAGTCTTCAAGTT- 2 615 



Qy 2261 CCAGGCGATCTGGTTATGTG — CAACCAGTGTGAG 

TTTTGTTT 2301 

I I I I I I I I I I III I II 

Db 2616 

CCATGTTCCCCAGATTTGAGATATACTAAAGAAAGAAATTCAAAAGTAGCTATTTGGGGC 2 6 75 

Qy 2302 CCACCTG GACTGTCA 

CCTGCCGGCCCTGCAGGATGTACCAGGGGAGGAGTG 2352 



Db 2 6 76 C C AC AAAAAT AAC TAT TAT T T T AGCC T T AGAGCC T T AC A- 

CTTGTTTCAT G AAG AG AAAG 2 73 4 

Qy 2353 GAGCTGC-TCACTCTGCCATGTGCTC CCTGACC 

TG AAG GAG — 2392 

I I I I I I I I III I I I I I I 

Db 2 735 GACTTGCATAAC 

C AAAAT AAAC AAAGC AAGAC AAAT T AAAAAT ATGTGGGGGAGAG 2 79 0 

Qy 2393 GAGGA TGGCAGCCT CAGCCTGGATGGTGCAGA- 

CAGCACTGGCGTG — 2 43 7 

III III II I I I I I I I I I I I I III 

Db 2791 

ATCAGTGAAAAGTGGTTTTCTTAATGCAGCCCTGCTGGTCCCCATTAACAATTGCTTGAA 2 85 0 

Qy 243 8 GTGGCCAAGCTCTCA- 

CCAGCCAACCAGCGGAAATGT 2 4 73 



Db 2851 

ATTCACATGGATGTAAAATTATAATTGTCAGGATCTTATTCAGATGATCTTTTAAGGT-T 2 9 09 

Qy 2 474 GAGCGTGTACTGCTGGCCCTATTC TGTCACGAACCCTG 

CCG 2514 

II II I I I I I II I I I I I I III 

Db 2910 

TAACTGGTTTTGCTTTTGTTTATCTATATGTCAAAATACTTGTAAATTGGGAACAAACTT 2 969 

Qy 2515 CCCCCTGCATC-AGCTGGCTACCGACTCCACCTTCTCCCTG 

GACCAGCCCGGTGGCA 2570 

I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 29 70 CTCTCAGCTTCTTGAAGTTGTTCAACT- 

ATCCTTGCCACTGGAAGACCAAACAAGGTTTT 302 8 

Qy 2571 CCCTGGATCTGACCCTGATCCGTGCCCGCCTCCAGGAGAAGT- 

TGTCACCTCCCTACAGC 2629 



Db 3 029 CACTG CTTTTTCTTTTACATAATATGCT 

GAGAATTATTTCTTATGCTTTTTAC 3081 

Qy 2 63 0 TCCCCACAGGAGTTTGC-CCAG GATGTGGGCCGCA 

TGTTCAAGCAAT 2675 

II III II I I I I III I I I I I I I I I I 

Db 3082 

TACAAACAAAATTACTCACCTGGATTAAAGATTAAGGCCTTAATCTGTTTAGATTATCTT 3141 



Qy 2676 — 

TCAACAAGTTAACTGAGGACAAGGCAGACGTGCAGTCCATCATCGGCCTGCAGCGCT 2 732 



Db 3142 T AAT C T C C AT G AAAT C G T G AAAT AAG AC — 

AAGAATAGTGTTTCAGCTGTAGGCCATTTT 3199 

Qy 2 733 TCTTCGAGACGCGCATGAA- 

CGAGGCCTTCGGTGACACCAAGTTCTCTGCTGTGCTGGTG 2 791 



Db 32 0 0 ACAGCTAATTGCCCATAAATTGTAGCATTTATTGACCTGAAGTACTAAGC — 

TAATTGTC 3257 

Qy 2792 GAGCCCCCGCCGATGAGCCTGCCTGG — TGCTGGC — 

CTGAGTTCCCAGGAGCTGTCTGG 2 8 47 



Db 3258 TTGACTACTC AAAGCC — CCTGAATTGTTGTCAACT 

TGTTGTG 3308 

Qy 2 8 4 8 TGGCCCTG — GTGATGGCCCCTG-AGGCTGGAGCCCCCA — TGGCCAGCCCAGCCTG- 

GC 2901 



Db 3309 

TAGCCCTAACGTCATTTAGCTTGTTGTCTGATGCCTCCAGTAGGACACCTCCGATGGAGC 336 8 

Qy 2902 TCTGTTCTCTGTCCTGTCACCCCATCCCCACTCCCCTGGTGGCCTGACTCCCA 

CTC 2957 

I I I I I I I I II I I I I I I I I I III I I I I I 

I I 

Db 3369 TTTGATTTCTGAGCAG CGAAAGCTCCCTTCCTAAGATGCAT — 

CTCGCATAGGCTG 3422 

Qy 2958 CC TGGTGGCCC CATCCCCCAGTTCCTCA 

CGATATGGTTTTTACTT C 3003 



Db 3 42 3 C C T AT GAT G AAGG AC C G T GC A- 

CCTCCACTCCAACAGAGTGCTGAGTTTAAAAGTTGACC 3 481 

Qy 3004 TGTGGATTTAAT AAAAAAAAC T T C AC C AG 

TTCAAAAAA 3041 

I I I I I I I I I I I I I I I I I I 

I I I I I I I I I 

Db 3482 

TGTGTTTGTAATTTCACTTTCATCTTGCTTAATAAATATCTGCTGGATTCTTTCAAAAAA 35 41 

Qy 3042 AAAAAAAAAAAAAAA 3056 

I I I I I I I I I I I I I I I 
Db 3542 AAAAAAAAAAAAAAA 3556 



I I I I I I I I I 
TTCCCCTTTG- 
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Qy 1 GCGGGGCGGGCC GGC 

GGCGGAGG 23 

I I I I I I I III I 

Db 3587 

GCAGCTCGAGCACATGTGGCCTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGAAAGAAT 352 8 

Qy 2 4 CCGGGCCGC GGAGCCAG GAGTG 

ACTAGCAGCAGTTGGCCGTGCC 6 7 

I I I I I I I I I I I I I I I I I I I I I I I I 

Db 352 7 CC-AGCAGATATTTATTAAGCAAGATGAAAGTGAAATTACAAACA- 

CAGGTCAACTTTTA 3 470 

Qy 6 8 GTAGCAGCGTCCCGC — GCGCGGCGG-GCAGCGGCCC AGGAGGC 

GCGT 112 

I I I I II I I I I I I I I I I I I I I III II 

I I I 

Db 3 469 AACTCAGCACTCTGTTGGAGTGGAGGTGCA- 

CGGTCCTTCATCATAGGCAGCCTATGCGA 3 411 



Qy 113 GGTGC GGG TTTCGGCGGCGGCTGAGGAA — 

GAAGCGCGGGCGGCG — 155 

I I I I III I I I I I I I I I I I I I 

Db 3 410 GATGCATCTTAGGAAGGGAGCTTTC 

GCTGCTCAGAAATCAAAGCTCCATCGGAGGT 3355 



Qy 156 — CCTTCGGGAGGCGAGCAGGCAGCAGTTGGC — 

CGTGCCGTAGCAGCGTCCCGCGCGCG 211 

I I I I I I I I I I I I I I I I I I II I I I I I II II I 

I 

Db 335 4 GTCCTACTGGAGGC-AT C AG AC AAC A AGCTAAATGACGTTAGGGCTACAC- 

AACACA 33 0 0 

Qy 212 GCGGG CAGCGGCCCAGGAGGC GCGTGG CGGC 

GCTCGG — C 249 

III III I I I I I I I I I I I II I I I I 

I 

Db 3299 AAG G G G AAAG T T G AC AAC AAT T C AGG - 



GGC T T T GAG T AG T C AAG AC AAT TAGC T T AGT AC 32 41 



Qy 250 CTCGCGGCGGCGGCGGCGGCAGC GGCCC — AGCAGT TGG CGGC— 

GA 292 



Db 3240 

TTCAGGTCAATAAATGCTACAATTTATGGGCAATTAGCTGTAAAATGGCCTACAGCTGAA 3181 

Qy 293 GCGC GTCT 

GCGCCTGCGCGGCGGGCCCCGCGCCCCTCCTCCCC 335 

II I I I I I I I I I I I I II I 

Db 3180 

AC AC TATTCTTGTCTTATTT C AC GAT T TC ATGGAGAT T AAAGAT AAT C T AAAC AG AT T AA 3121 

Qy 336 — CCCTGGGC GCCCCCGGCG 

GCGTGTGAATGGCGGCCTCCGCGGCGGCAG 383 

III I I I I I I I I I I I I II II 

Db 3120 

GGCCTTAATCTTTAATCCAGGTGAGTAATTTTGTTTGTAGTAAAAAGCATAAGAAATAAT 3 061 

Qy 384 CCTCGGC AGCAGCGGCCTCGGCCGCC TCTGG-CAGCCCG- 

GGCCC 426 

I I I I I III II I II I I I I I I I I 

Db 3060 

TCTCAGCATATTATGTAAAAGAAAAAGCAGTGAAAACCTTGTTTGGTCTTCCAGTGGCAA 3 0 01 

Qy 42 7 GGGCGAGGGCTCCGCT GGCGGCGAAAAG — CGCTCC 

ACCGCCCCTTCG 472 



Db 3000 

GGATAGTTGAACAACTTCAAGAAGCTGAGAGAAGTTTGTTCCCAATTTACAAGTATTTTG 2941 

Qy 473 GC CGCAGCC— TCGGCC TCTGCCTCAG — 

CC 499 



Db 2940 

ACATATAGATAAACAAAAGCAAAACCAGTTAAACCTTAAAAGATCATCTGAATAAGATCC 2 8 81 
Qy 50 0 GCGGCG 

TCGTCGCCCGCGGGGGGCGGCGCCGAGGCGCTGGAGCTGCTGGAGC 551 



Db 2880 -TGACAATTATAATTTTACATCCATGTG AATTTCAA 

GCAATTGTTAATGGGGA 2 829 

Qy 552 ACTGC — GGC- 

GTGTGCAGAGAGCGCCTGCGACCCGAGAGGGAGCCCCGCCTG 6 01 

III I I I I I I I I I I II III 
Db 2 82 8 CCAGCAGGGCTGCATTAAGAAAACCACTTTTCACTGATCTCTC- 

CCCCACATATTTTTAA 2 770 

Qy 602 CTGCCCTGTT TGCA 

CTCGGCCTGTAGTGCCTGCTT 636 



Db 2769 TTTGTCTTGCTTTGTTTATTTTGGTTATGCAAGTCCTTTCTCTTCATGAA — 

ACAAGTGT 2 712 



Qy 63 7 AGGGCCC GC GGCCCCCGC- 

CGCCGC 660 

I I I I I II I I I I I I III 

Db 2711 

AAGGCTCTAAGGCTAAAATAATAGTTATTTTTGTGGGCCCCAAATAGCTACTTTTGAATT 2 652 

Qy 661 CAACAGCTCGGGGGACGGCGGGGCGGCGGGCGACGGC — 

ACCGTG 703 

II I I I I I II II I I III I III 

I I 

Db 2651 TCTTTCTT TAG T AT AT C T C AAAT C T GGGG AAC AT G G AAC — 

T T G AAG AC T C C T AAC C AT G 2 59 4 

Qy 704 G TGG ACTGTCCCGTGTGC 

AAGC 725 

III I I I I I I I I 

I I I 

Db 2593 

AAGCATTTGGAAAAATACATATCATTCACTTTTCACAGAACCATTTTCTTAAAAATAAGA 2 53 4 

Qy 726 AAC AG T GCTTCTC 

C AAAG AC AT C G T GG AG AAT T AT T T 762 

III I I I I I II III I II I I I I I 

Db 2533 

GGCAATATCCAGATTCACATGCATGTTCATAATAAAGCTTTGTTTTAAAACAAATCCACA 2 4 7 4 

Qy 763 CATGCGTGATAGTGGCAGC AAGGCTGCCACCGACGCCCAGGATGCG 

AAC— 811 

I I I I I I I I I I I I I I I I I I I I I I I I I I III 

Db 24 73 CCAGCATTATTTT — CAGCTTAATATTTTGGCCCCAAC- 

C C AAG AAT T C AGG T AT AAC AT 2 417 

Qy 812 CAGTGCTGCACTAGCTGTGAGGATA 

ATGCCCCAGCCACCAG 852 

III I I I I II I I I I I I I I I III 

I 

Db 2416 

TTAATTTTCATCTGTAATGCATATATTGGAAGGACAAAATATATATGCAAATAACACTGG 235 7 

Qy 853 

CTACTGTGTGGAGTGCTCGGAGCCTCTGTGTGAGACCTGTGTAGAGGCGCACCAGCGGGT 912 

I I I I Ml I I I I I I I I III III I 

Db 2356 AAAATGT TGTTT CCTATACAAGTGATC AAGAAAAAAACAA — 

AGAA 2313 

Qy 913 GAAGTACACCAAGGACCATA CTG 

TGCGCTCTACTGGGCCAGCCAA 957 

I I I I I I I I III III I I I I I I I I I 

I I I 

Db 2312 

AAAGAAAACCCATAGCACTAAGTTTGCTGTAGTGAAAATATGGTTAACGGTGCCGTGCAA 2 2 53 
Qy 958 GTCTCGGGATGGTG- 

AACGTACTGTCTATTGCAACGTACACAAGCATGAACCCCTTGTGC 1016 

II I I I I I I I I I I I I I I II I I I I I I I I I I I 

I I 

Db 2 2 52 AATTAGATTTGTTGAAAAGTAGT-TCTATTACAGGGAGCAAAAGCAAAGA 

AGC 2201 



Qy 1017 TGTTTTGTGAG-AGCT 

GTGATACTCTCACCTGCCGAGACTGCCAGCTCA 1064 



Db 22 0 0 AAGTTTGTGGGCAACTCGGAAGAGGGGATCATATTCTCATTT- 

CCTTAATTGTAATTTAA 2142 

Qy 1065 ATGCCCACAAGGACCACCAGTACCAGTTCTTAGAGGATGC 

AGTGAGGAACCAGCG 1119 



Db 2141 ATAATCATTC— TCCATAATTTCTCTTTC-TAGAGCTTTCTTCTTTGTATGCTATCA- 

CG 2086 

Qy 112 0 CAAG CTCCTGGCCTCACTGGTGAAGC 

GCCT 1149 

III I I I I I I I I I I I I I I 

Db 2 0 85 TAAGGTAGTACTATTTCACCT- 

TAAAAAATGGAAAGACAAATTTCAGCCTAAGAATGATG 2 02 7 

Qy 1150 TGGG — GACAA — ACATGCAACATTGC AGAAGAGCAC 

CAAGGAG-G 119 0 



Db 2026 

TATTCTGAGTTGGCAATTACTTAAAACAATGAGAGAGTAAAACACATCAAACTTAAAGTG 196 7 

Qy 1191 TTCGC AGCTCAATCCGCCAGGTGTCTG — AC G T AC AG AAGC G T G T 

GCAAG 1238 



Db 1966 

T T T AC TTT T TGAAC AT T AAAAT AAAT G T C AGG T AAAT AC AG AAAAAT AT AT AAT C AC TAG 1907 

Qy 1239 TGGATGTCAAGATGGCCATCC 

TGCAGATCATGAAGGAGCTGAATAAGCGGGGCC 1292 



Db 19 06 TTCTCAGCATTTTATCAATCCCTTTGTTTCTCTTCTTCTAGAAG 

ATTCTATGAGCA 1851 

Qy 1293 

GTGTGCTGGTCAATGATGCCCAGAAGGTGACTGAGGGGCAGCAGGAGCGCCTGGAGC — G 135 0 
I I I I I I I I I I I I I III II I II 

Db 1850 TTATGTTGCTCC GATCAAAAGTAACCTCA CTACTCAAATACC 

ATCATA 18 03 

Qy 1351 GCAGCACTGGACCATGACCAAGATCCAGAAGCACCAGGAGCACA 

TTCTGCGCTTTG 1406 



Db 1802 

GCTTCAATGTGCATTATAGTGATTTCAGTAGATTCACACTCAAATCTTTTCAGTGTCATA 17 43 

Qy 140 7 CC— TCTTGGGCTCTGGAG-AGTGACAACAACACAGCCCTTTT 

GCTTTCTAAGAAGT 146 0 



Db 17 42 CAT T TAT TAAGCCATAAAGTTATGAAACC — 

CTCAGCTCTTGTGGAGAGATCTGGGCAGA 16 85 



Qy 1461 TGATCTACTTCCAGCTG CACCGGGCCCTCA — 

AGATGATTGTGGATCCCG 150 8 



Db 1684 

ATTTATACAAGAAGAAGTTTAATTTATCACTTAAAAATCATCTCATAATTGTGGTAACAG 162 5 

Qy 1509 TGGAGCCACATGGCGAGATGAAGTTTCAGTGGGACCTCAATGC- 

CTGGACCA 1559 



Db 162 4 ATTTTTAAAGTAACCCTTTGG 

AAAAAAT T T T AC G T G AT AC C C AAAGGC AC TAGTTCA 1568 

Qy 1560 AGAGTG CCGAGGCCTTTG 

GC 1579 

I I I I I I I I I I I I 

Db 1567 

CATAAGGGGTGTGGCTGAAAAAGCAAAACAAAGCCATTGTGTAAGTAAAGAGCACACAAA 15 0 8 

Qy 158 0 — AAGATTGTGGCAGAGCGTCCTG GCACTAA CTCAACA- 

GGCCCTGCAC 1625 



Db 1507 

AAAAGGATATTAAAAGGAAACCTGTTAAGCTTTAAAGTTATTCTAAAAATGGCACTTTTC 14 4 8 
Qy 1626 CCATGG 

CCCCTCCAAGAGCCCCAGGG 1651 

III I I I I I I II 

Db 1447 

ACGTTGGTATAAAATGAAAGTGCTTTAAGATGAAATTTTTATGTATTTTTTTAAAGCTGA 13 8 8 

Qy 1652 CCCCTGAGCAAGCAGGG 

CTCTGGCAGCAGCCAGCCCATGG 1691 



Db 1387 

TTTTTAACCCCCTGAAAATAGGGGGTTATAACAAGAACACTGATAGACTCTAACTCATAG 132 8 

Qy 1692 AGG TG 

CAGGAAGGCTATGGCTTTGGGTC 1719 

II II I I I I I I I I I I I I I I 

Db 132 7 CGGCAATAAGTAGTGGATGTCACTAAAATGCTCTG — 

TGTGTGTCTTTATTTTTTAGTAT 1270 

Qy 172 0 AG GAG AT GAT C 

CCTACTCAAGTGCAGAGCCCCATGTGTCAGGTGTG 1765 

I III I II III III III I I I I 
Db 1269 TTTTAATGTTGATAGACTTGCTTTATCTA — 

TCTGTGTATCATTAGTGAGATTTGCTTTT 1212 

Qy 1766 AAACGGTCCCGCTCA 

GGTGAGGGCGAGGTGAGCGG 1800 

I I I I I I I I I I I I I I I 

Db 1211 C AAT T C T T T AC T AGAAAGT T AC AGC TC AT T T AAAAT AGC TGG T AAAT AC — 

AGAAAGCAT 1154 



Qy 



18 01 CCTTATGC-GCAAGGT — GCCAC-GAGTGAGCCTTGAACGCCTGGAC- 



CTG 1846 



Db 1153 

AAAT AT AC AG T AAG T T T AAAC AC TGAT TGT AC T AAATGC AAC AAT AC AAAGAAGC AAAT G 109 4 

Qy 18 47 GACCTCACA-GCTGACAGCCAGCCACCCGTCTTCAA 

GGCAGTAC 1899 



Db 1093 GAACACAAATGGTAACA- 

C AAT AAAAC TGTATTACATTTG T GC T TC AAAT AT T AC AAT A- 1 0 3 ( 

Qy 19 0 0 CACTGAGGACTACA-ACCTTATTGTTATTGAACGTGGCGC- 

TGCCGCTGCAGCTACCGGC 1957 



Db 1035 CATTATATACAATAGTCCAAAATAAACAT- 

C T C AT GC C AAG T G AC AC AAAAAC G AAT AGC 977 

Qy 1958 CAGCCAGGGACTGCGCCTGCAGGAACC — CCTGGTGCC- 

CCACCCC 2000 



Db 976 

AAGCAAAAAAATCCAGCCATATGTACATTACTTTTTTCTTTAATAATAAACATTCAACTC 917 
Qy 2001 TG- 

GCTGGCATGGCCATTGTCAAGGAGGAGGAGACGGAGGCTGCCATTGGAGCCCCTCCT 2 059 



Db 916 TGAACTGG GGCAATT-TCACTAC ATACAGATGCAGTC- 

CGCCTTTT 873 



Qy 2060 

ACTGCCACTGAGGGCCCTGAGACCAAACCTGTGCTTATGGCTCTTGCGGAGGGTCCTGGT 2119 
III II I I I I I I I I I II I I I I I III 

Db 8 72 ATTCCATATAACCATCCTG CCTTCCTATATCTAC- 

GCTATTCAGTAGGTTCCT 821 

Qy 212 0 GCTGAGGGTCCCCGCCTGGCCTCACCTAGTGGCAGCACCA 

GC 2161 



Db 82 0 GTGTCAATTCTT ATGTACATGTGG- 

AGCAAAAAAGATAAAAGAATGTAGTGC 770 

Qy 2162 TCAGGG — CTGGAGGTGGTGGCTCCTGAGGGTACCTCAGC — 

CCCAGGTGGTGGCCCGGG 2217 

III II I I I I I I II III I I I I I I I I I I I I 

Db 769 TCTGTATTCTTAATGGGGTG ATGTTGATAGATCAGCGAACCTGG-GGT- 

GCAT 719 

Qy 2218 AACCCTGGATGACAGTGCCACCATTTGCCGT- 

GTCTGCCAGAAGCCAGGCGATCTGGTTA 2276 



Db 718 AACCTTCTTTCATTAAACCTTCCT CGTAGTCCGTCTG — GCAAAG- 

GATCATGTTA 666 



GGTCTTCCCA 

I I I I I I II 




-CTG- 



III I II 



I I 



Db 



665 T- 



TCTTTAGGAAAAATTTGTCTCCAACACAAAATCTCTGATTACAAAGCTGACAT 612 

Qy 232 9 GCAGGATGTACCAG — GGGAGGAGTGGAGCT — 

GCTCACTCTGCCATGTGCTCCCTGACC 2384 



Db 611 GCAAAGCAGTCCAGGTGGTAAACATTGTCCTTGGCACGCATCACCAT 

CTCAAAGGC- 556 

Qy 23 85 TGAAGGAGGAGGATGGCAGCCTCAGCCTG- 

GATGGTGCAGACAGCACTGGCGTGGTGGCC 2 443 



Db 555 — AGGGATGAGCTT AC T AC AGGC AGCGC AG TTTCCCGT T AC AC 

CAAAGAGCCT 505 

Qy 2 444 AAGCTCTCACCAGCCAACCAGCGGAAATG-TGAGC 

GTGTACTGCTGGCCCTATTCT 2 49 8 



Db 50 4 CAGATAGTCTCTGCGA — C AAAGGAT AAGAT T AGC T T TAG T G T AC AG — 

GG T 457 

Qy 2499 GTCACGAACCCTGCCGCCCC CTGCATCAGCTGGCTACCGACT — 

CCACCTTC- 2548 



Db 456 G GAGCCC- 

ACCTCCCCCAAGCGACAGTCACAGCAGGCACACTTCAGGCAGTCTTCA 4 02 

Qy 2549 

TCCCTGGACCAGCCCGGTGGCACCCTGGATCTGACCCTGATCCGTGCCCGCCTCCAGGAG 26 0 8 



Db 401 TGCCAGTATTTGTCCAGTG CCTTTAGAAGATACCGGTCCTTGAT — 

CTTTCGGTTG 348 

Qy 26 09 AAGTTGTCACCTCCCTACAGCTCCCCACAGGAGTTTGCCCAGGATGTGGGCC 

GCATG 2665 



Db 3 47 CAGCCAGCACAACCTTTCGGCT 

TGGTGTCTGGCTGGACTGAGAGCATTTGTATA 294 

Qy 2666 TTCAAGCAATTCAAC 

AAGTTAACTGAGGACAAGGCAGACGTGCAGTCCA 2 714 



Db 293 CCAAAACTTTTCTCCTTTTTCTGCATGAGTTGACTTAGG-CGTGTCTGAGTTGCAG 

CA 237 

Qy 2715 

TCATCGGCCTGCAGCGCTTCTTCGAGACGCGCATGAACGAGGCCTTCGGTGACACCAAG- 2 7 73 



I I I I I I 



III I 



I I I I I 



I I I 



I I 



I I I I I I 



Db 236 GCTTC-GCATTGAGCAC C- 

AAACCCAAAGGTAGAAGTAGAAGGGGGTCTCCTTGA 184 



Qy 2 774 -TTCTCTGCTGTG CTGGT — GGAGC 

CCCCGCCGATGAGCCTGCCTGGTGCTG 2822 

I I I I I III I I I I I I I I I I I I I I I I I I I 

Db 183 TTTCGCTTAAGTGTGGACCTGGTGCGCAGCCTACACCGCCGAGGA — 
CCGACTATTG 129 



Qy 2823 GCCTGAGTTCCC AGGAGC-TGTCTG-GTGGC- 

CCTGGTGATGGCCCCTGAGGCTGGA 2 8 76 

III II I I I I I I I I I I I I I I I I I I I I I I 

Db 12 8 TGAAGCCACTTTGGGAGCGGGTCGGAGTGGCGGCAGGGGGTGGGGGAAG- 

GGATGAG 73 



Qy 2 8 77 GCCCCCATGGCCAGCCCAGCCT- 

GGCTCTGTTCTCTGTCCTGTCACCCCATCCCCACTCC 2 935 

II I I I I I I I I I I I I I I I I I I III 

I I I 

Db 72 GAC GGCCAGACAAGACAGGGCGC ACACACGGAGCCC- 



Qy 2936 CCTGGTGGCCTGACTCCCACTCCCTGGTGGCCCCATCCCCCAG 29 78 

I III I I I I I I I I I I I II 

Db 32 CAGTGTG C AAAAT G AT GG - C G AAT C C AG TAG 3 
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Qy 8 GGGCCGGCGGCGGAGGCCGGGCCGCGGAG — 

CCAGGAGTGACTAGCAGCAGTTGGCCGTG 65 



Db 

TA 49 



1 GTTACTTCTGC — TCTAAAAGCTGCGGAATTCC- 



-TCGAGCA-CTGTTGGCC- 



Qy 66 CCGTAGCAGCGTCCCGCGCGCGGCGGGCAGCGGC- 

CCAGGAGGCGCGTGGTGCGGGTTTC 12 4 

II I I I I I I I I I I I I I I I I I I I I I I II 

I 

Db 5 0 CTG GGTGTACTGTG-GAGACTGTCAAAGTCTCCCGGAGCC 

CAATTTCC 96 

Qy 125 GGCGGCGG 

CTGAGGAAGAAGCGCGGGCGGCGCCTTCGGGAGGCGAGCAGGCAGC 17 8 

II I I I I I I I I I I I I I I I I I I I I I I III I 

I I 

Db 9 7 GGAAGCGGTGAGTTCTGA — AAGAAGTTC CTGCACCGTAGTTTCCCAAGT 

CTGC 148 

Qy 179 AGTTGGCCGTGCCGTAGCAGCGTCCCGCGCGCGGCGGGCAGCGGCCCAG- 

GAGGCGCGTG 23 7 

I II III I I I I I I I I I I III I I I I I III I 

Db 149 GAAT-CCCCAACCATG— AGCGCCTCGGGCGT- 

ACTGTCCTTTACCCAGCAAGGATGGGA 204 

Qy 238 GC- 

GGCGCTCGGCCTCGCGGCGGCGGCGGCGGCAGCGGCCCAGCAGTTGGCGGCGAGCGC 2 96 

I I I I I I I I I I I I II I I I I I I III I I I I I 

I I 

Db 2 05 GCAGGTGCT-GGCCAAAGTGAAACG GGCTGTGGTTTACCTG — 

GACGCCGCCTGC 256 

Qy 29 7 GTC- 

TGCGCCTGCGCGGCGGGCCCCGCGCCCCTCCTCCCCCCCTGGGCGCCCCCGGCGGC 355 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 257 GCCGAGAGCCTGCACTGGGG CTGCG GATCCACCCGT 

CTCCTGGAGGC 3 03 

Qy 356 

GTGTGAATGGCGGCCTCCGCGGCGGCAGCCTCGGCAGCAGCGGCCTCGGCCGCCTCTGGC 415 

I I I I I I I I I I I I I I I I I I I I 

Db 304 G-GTG GGGGGCC CTGACT 

GTCACCTGCGAG 332 

Qy 416 

AGCCCGGGCCCGGGCGAGGGCTCCGCTGGCGGCGAAAAGCGCTCCACCGCCCCTTCGGCC 475 

II I I I I I I III I I I I I I III I I I I I 

I I I 

Db 333 AGTTCGAGCCC GACGCAATTGGTGGTG GAGCCAAGCAGCCC — 

AAGGCA 3 79 

Qy 476 GCAGCCTCGGC — 

CTCTGCCTCAGCCGCGGCGTCGTCGCCCGCGGGGGGCGGCGCCGAGG 533 

I I I II I I I I I I III III I I I I I I I I I II 

I 

Db 380 G-TGTTTGTGCTGAGCTGCCT — GCTGAAAGGCCG — GACCGTGGAGATC — 

CTACGGGA 432 

Qy 53 4 CGCTGGAGCTGCTGGAG-CACTGC — 

GGCGTGTGCAGAGAGCGCCTGCGACCCGAGAGGG 590 

I I I I I I I I I I I I I I I I I I I II II I I I I I I I 



Db 433 CATC — ATCTGCCGCAGTCACTTCCAGTATTGTGTGGTGGTCAC- 

AACCGTGAG — 483 



Qy 591 

AGCCCCGCCTGCTGCCCTGTTTGCACTCGGCCTGTAGTGCCTGCTTAGGGCCCGCGGCCC 65 0 

III I I I I I I I I I I I I III I I I I I I I 

Db 48 4 CCAC — GCTGTCC AC C T C AC AGC T AAT C AT G T C 

CCAGCGGCGG 52 4 

Qy 651 

CCGCCGCCGCCAACAGCTCGGGGGACGGCGGGGCGGCGGGCGACGGCACCGTGGTGGACT 710 
I I I I I I I I I I I I I I I I I I I 

Db 525 CAGCGGCC 

GAGATGGAGGGGCAGCAG 55 0 

Qy 711 

GTCCCGTGTGCAAGCAACAGTGCTTCTCCAAAGACATCGTGGAGAATTATTTCATGCGTG 7 7 0 

I I I I I I I I I I I I I I I I I I I I II 

I I I 

Db 551 — CCGGTGTTCGAGCAGCTG GAGGAGAAG 

CTGTGTG 584 

Qy 771 ATAG TGGCAGCAAG-GCTGCCACCGACGCCCAGGATGCGAACCA- 

GTGCTGCACTAG 825 

I I I I I I I I I III I I I I I I I I I I I I I I I I I 

I 

Db 585 AATGGATGGGCAACATGAACTAC ACGGCC— 

GAGGTGTTCCATGTCCCGTTATTG 63 7 

Qy 826 

CTGTGAGGATAATGCCCCAGCCACCAGCTACTGTGTGGAGTGCTCGGAGCCTCTGTGTGA 8 85 

II I I I I I I III I I I I I I I I I I I 

Db 638 CT TGCCCC TGT TGCTC 

CCCACTTTG 662 

Qy 886 

GACCTGTGTAGAGGCGCACCAGCGGGTGAAGTACACCAAGGACCATACTGTGCGCTCTAC 9 45 

I I I I I I I I I I I III II I I I I I I I 

I 

Db 663 — CCT-TG ACTCCAGCTTTTGCA TCC 

CTTTTCCCACTGC 698 

Qy 9 46 TGGGCCAGCCAAGTCTCGGGATGGTGAACGTACTGTCTATTGC 

AACGTACAC A 99 8 

I II III I I I I I I I I I I I I II II I I I I I I 

I 

Db 699 T— ACC— CCA GGAT- 

GTGCACCTCCTTAATAGCGCCC G AC C GGAC AAGAGGA 746 

Qy 999 

AGCATGAACCCCTTGTGCTGTTTTGTGAGAGCTGTGATACTCTCACCTGCCGAGACTGCC 105 8 

III II II I I I I I I I I III III 

I I 

Db 747 AGCTGGGAAGCC TGGGTGATGTGG ACTCCACTAC 

GCT 783 

Qy 1059 

AGCTCAATGCCCACAAGGACCACCAGTACCAGTTCTTAGAGGATGCAGTGAGGAACCAGC 1118 

II I I I I I I I I I I I I I I I I I I I 
Db 78 4 AAC CCCA GAGCTGCTGCTGCAGATC 



AGATGC- 



814 



Qy 1119 

GCAAGCTCCTGGCCTCACTGGTGAAGCGCCTTGGGGACAAACATGCAACATTGCAGAAGA 117 8 

III I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 815 -CTAGTGTCAGGCCTCAGT TCTCTGTGTG AACATT 

TAGGA 853 

Qy 1179 

GCACCAAGGAGGTTCGCAGCTCAATCCGCCAGGTGTCTGACGTACAGAAGCGTGTGCAAG 123 8 

III I I I I I I I I I I I I I I I III 

Db 854 GTAC — GGGAGG AGTGTTT TGCTGTAG 

GTTCCT- 884 

Qy 1239 TGGATGTCAAGATGGCCATC- 

CTGCAGATCATGAAGGAGCTGAATAAGCGGGGCCGTGTG 12 9 7 

I I I I I I I I I I I I I I I I I I I I I II I I I I I II 

Db 8 85 — TAAGTCA GGTCATCGCTGCGGATC-TG GCCAATTATGC 

CC 923 

Qy 1298 

CTGGTCAATGATGCCCAGAAGGTGACTGAGGGGCAGCAGGAGCGCCTGGAGCGGCAGCAC 135 7 

III I I I I I I I I I I I I I I I I I I I I I II I I I I 

I I 

Db 92 4 CTG— CAAAGA — AC AGGAAGAAGAC T GCTGCAGG CAG 

GGCATCAG 965 

Qy 1358 TGG — ACCATGACCAAGATCCAGAAGCAC- 

CAGGAGCACATTCTGCGCTTTGCCTCTTGG 1414 

III II II I I I I I I I I I I I I I I I II III 

I I I 

Db 966 TGGTTTTTGTGGACAGAACCCTGGATCTCACAGGAGCA 

GTTGGACATCATGG 1017 

Qy 1415 

GCTCTGGAGAGTGACAACAACACAGCCCTTTTGCTTTCTAAGAAGTTGATCTACT 1469 

I I I I I I I I I I I I I I I I I I 

I 

Db 1018 AGACAACTTAGTAGAG-AAGATCATTTCAGCACT 

TCC- 1053 

Qy 1470 TCCAGCTGCACCGGGCC-CTCAAGATGAT-TG- 

TGGATCCCGTGGAGCCACATGGCGAGA 152 6 

I I I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I 

Db 1054 -CCAGCT — CCCAGGCCACACAA-ATGATGTGATGGTTAACATG 

ATAGCG 1099 

Qy 152 7 TGAAGTTTCAGTGGGACCTCAATGC- 

CTGGACCAAGAGTGCCGAGGCCTTTGGCAAGATT 1585 

I I I I I I I I I III III I I I I I II 

I 

Db 1100 CTCACTGCACT CCAT- 

ACTGAGGAGGAAAATTATAATGTG 113 8 

Qy 1586 GTGGCAGAGCGTCCTGGC — 

ACTAACTCAACAGGCCCTGCACCCATGGCCCCTCCAAGAG 16 43 

I I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 1139 GTTGCA CCAGGCTGTCTTTCACAA TTCAGTGACACCAC 



AG 1178 



Qy 1644 

CCCCAGGGCCCCTGAGCAAGCAGGGCTCTGGCAGCAGCCAGCCCATGGAGGTGCAGGAAG 17 03 
III I I I I I I I I I II III I I I I I I I I I 

Db 1179 — CCAAAGCCCTATGGGAAGCTTTACT 

GAAC AC T AAGC AC AAAGAGG 12 23 

Qy 1704 

GCTATGGCTTTGGGTCAGGAGATGATCCCTACTCAAGTGCAGAGCCCCATGTGTCAGGTG 1763 
I I I I I I I I I I I I I I I I 

I I I 

Db 122 4 CA-GTGATGG AAG TTCGGAGAC 

ATCTAGTG 1252 

Qy 1764 

TGAAACGGTCCCGCTCAGGTGAGGGCGAGGTGAGCGGCCTTATGCGCAAGGTGCCACGAG 1823 
I I I I I I I I I III I I I I I I I II 

I 

Db 12 53 -GAAGCGG-CAAGC AGAGAAAACCTGCC — 

AATCAAGATGAGTATGGGG 129 7 

Qy 1824 

TGAGCCTTGAACGCCTGGACCTGGACCTCACAGCTGACAGCCAGCCACCCGTCTTCAAGG 18 83 
I I I I I I I I I I I I I I I I I I II I I I I 

I I I I I I I I 

Db 129 8 AGAGTC ACACCGGGAC AGCTCATGTCCTATATTCAGC 

TCTTCAAGA 13 43 

Qy 1884 TCTTC — CCAGGC 

AGTACCACTGAGGACTACAACCTTATTGTTATTGAACGTGGC 1936 



Db 13 4 4 ACAACCTCAAAGCTCTAATGAATCATTGTGGCCT-CCTCCAGCTTG- 

GACTGGCCACAGC 1401 

Qy 193 7 GCTGCCGCTGCAGCTAC — CGGCCAGCCA 

GGGACTGCGCCTGCAGGAACCCCTGGT- 199 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1402 TCAAACGTTGAAACACCCACAGACTGCCAAGTGGGAC — AACTTTCTGG — 

CTTTTGAAA 1457 

Qy 1991 -GCCCCACC — CCTGGC — TGGCATGGC — 

C AT T G T C AAG GAGGAGGAGACGGAGGCTGC 2 043 

I I I I I I I I I I I I I I I I I I I I II 

I I I I I 

Db 1458 

GGCTCCTTCTTCAGAGCATTGGGGAGTCAGCAATGTCCGTTGTGTTAAATCAGCTGCTGC 1517 

Qy 2 0 44 CATTG GAGCC CCTCCTACTGCCACTGAGG GCCCTGAG — 

ACCAA 2 0 85 

I II I I I I II I III I I I I I I I I I I I I 

I 

Db 1518 

CCATGATTAAGCCTGTAACCCAGAGAACCAACGAGGACTACAGCCCTGAGGAACTGCTGA 15 7 7 

Qy 2 0 86 ACCTGTGCTTATGGCTCTTGCGGAGGGTCCTGGTGCT- 

GAGGGTCCCCGCCTGGCCTCAC 2144 

III I I I I II I I I I I I I I I I III 

Db 1578 TCCTTCTCATATATATTT 



ATTCTGTCACTGGAGAGCTCACGG T AG AC AAAG A 1629 



Qy 2145 CTAGTG-GCAGCACCAGCTCAGGGC- 

TGGAGGTGGTGGCTCCTGAGGGTACCTCAGCCCC 22 02 

I I I I I I I I I II III III I I I I I I I I I I I I I 

Db 163 0 C C T G T G T G AAGC AGAAGAAAAAGTC AAGAAAGC AT TGGC T C AGGT- 

CTTCTG 1680 

Qy 2203 

AGGTGGTGGCCCGGGAACCCTGGATGACAGTGCCACCATTTGCCGTGTCTGCCAGAAGCC 2 2 62 

III I I I I I I I I I I I I I I I I I I I I I I I 

Db 16 81 TGAGGAATCTGGAT TGTCACC-TTTG 

CTGCAAAAAATT 1717 

Qy 2263 AGGCGATCTGGTTATGTGCAACCAGTGTGAGTTTT GTTTCCA CC- 

TGGACT 2312 

I II I I I I I I I I I I I I I I III I I I I I I I I I I 

Db 1718 A— 

CGGACTGGGACTCTTCAATTAATCTGACATTTCACAAATCCAAAATTGCCGTGGATG 17 75 

Qy 2313 

GTCACCTGCCGGCCCTGCAGGATGTACCAGGGGAGGAGTGGAGCTGCTCACTCTGCCATG 23 72 

III I I I I I I I I I I I I I I I I I I I I I I I 

Db 1776 AACTCTTTACTTCACTTCGGGATATTGC TGGAGCT- 

CGGAGTCT 1818 

Qy 23 73 TGCTCCCTGACCTGAAGGAGGAGGATGGCAGCCTCAGCCTGGATG- 

GTGCAGACAGCACT 2 431 

I I I I I I III I I I I I I I I III 

I I 

Db 1819 CCTGAA ACAGTTTAAGTCTGTATATGTTC 

CT 1849 

Qy 2 432 GG CGTGGTGGCCAAGC-TCTCACCAGCCAACCAGCGGAAATGT — 

GAGCGTGTACT 2 48 4 

II II I I I I I I I I I I I I I I I I I III II 

I 

Db 1850 GG AAAT CAT AC C C ACC AGGC ATC T TAT AAGC C 

ATTGTTGAAGCAAGTTGT 1899 

Qy 2 485 GCTGGCCCTATTCTGTCACGAAC — CCTGCCGCC 

CCCTGCATCAGCTGGCTACCGA 2538 

I II I I I I I I I I I III II I III I I I I I 

I 

Db 1900 

GGAGGAAATATTTCATCCCGAGAGGCCAGATTCCGTTGATATTGAACACATGTCTTCAGG 1959 
Qy 2539 CTCCACC- 

TTCTCCCTGGACCAGCCCGGTGGCACCCTGGATCTGACCCTGATCCGTGCCC 2 59 7 

I III I I I I I I II II I I I I III I I I I I I 

I I 

Db 196 0 CCTCACTGATCTCCTTAAAACT GGATTTAGCATGTTCATGAAGGTGAGCCGG — 

CC 2013 

Qy 2598 

GCCTCCAGGAGAAGTTGTCACCTCCCTACAGCTCCCCACAGGAGTTTGCCCAGGATGTGG 2 65 7 

I I I I I III I I I I I I I I I I I I I I I III 

I I 

Db 2014 TCATCCTAG TGACTACCCCCTCCTGATCCTC TTTGTGGTAGGTG- 

GG 2059 



Qy 2658 

GCCGCATGTTCAAGCAATTCAACAAGTTAACTGAGGACAAGGCAGACGTGCAGTCCATCA 2 717 

I I I I II I I I I I I I I I I I I I I I I 

I I I I 

Db 2 06 0 GTCACAGTCTCTG AAGTGAA-AATGGTCAAAG — ATCTTGTGG — 

CATCG 2104 

Qy 2718 TCG — GCCTGCAGCGCT — 

TCTTCGAGACGCGCATGAACGAGGCCTTCGGTGACACCAAG 2 7 73 

II I I I I I I I I I I I I I II I I I I III I I I I I 

Db 2105 TTGAAGCCAGGAACCCAGGTAATCGTGCTGTCCA — CACGACTCCT — 

GAAGCCACTTAA 216 0 

Qy 2 774 TTCTCTGCTGTGCTGGTG-GAGCCCCCG- 

CCGATGAGCCTGCCTGGTGCTGGCCTGAGTT 2 831 

II I I I I I I I I II I I I I I I I I I I I I I I I I I 

I 

Db 2161 CATTC — CTGAGCTGTTATTTGCAACTGACCGA CTGC 

ATCCAGACCTTGGCT 2210 

Qy 2 832 CCCAGGAGCTGTCTGGTGGCCCTGGTGAT — GGCC- 

CCTGAGGCTGGAGCCCCCATGGCC 2888 

I I I I I I II I III III I I I I I I I I I II 

I I 

Db 2211 TCT— GAGCATCC 

GC T AAGAAGAT AAGACC T AC TCAAGCTGGAAATGCCGATGCA 2263 
Qy 2889 

AGCCCAGCCTGGCTCTGTTCTCTGTCCTGTCACCCCATCCCCACTCCCCTGGTGGCCTGA 29 4 8 

I II I I I I I I I I I I I I I I I I I II 

I 

Db 226 4 A TTTTCTGCCACCACTCCAAATACTC-CTCCAC 

AACCAGC 23 02 

Qy 2949 CTCCCACTCCCTGGTGGCCCCA TCCCCCAG TTCCT- 

CAC 2986 

I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 2303 

GTCCCTGTCACTAATTGCGAGAATGATGGAATTCTGCCTGAAGGGTCTTGATACCTACTC 2362 

Qy 29 8 7 GATATGGTT-TTTACTT 

CTGTGGATTTAATAAAAAAAACTTCACCAGTTCAAAA 3039 

I III I I I I I I I I I I I I I I I I I I I I I I I II II 

I 

Db 2363 

AGTGAGGTACTTTGCTTGGATTGCTGTGATTCTTAAAAAAAAAAAAAAAAAAGGCCACAT 2 42 2 

Qy 3040 AAAAAAAAAAAAAA 3053 

I I 

Db 2423 GTGCTCGAGCTGCA 2436 
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Qy 12 CGGCGGCGGAGGCCGGGC-CGCGGAGCCAGGAGT- 

GACTAGCAGCAG 56 



Db 2 43 7 CTGCAGC- 

TCGAGCACATGTGGCCTTTTTTTTTTTTTTTTTTAAGAATCACAGCAA 23 83 

Qy 57 TTGGCCGTGCCGTAGCAGCGTCCCGCGCGCGGCGGGCAGC — 

GGCCCAGGAGGCGCGTGG 114 

I II I I I I I I I I I I I I I I I I I I I I I I I 

Db 23 82 T CC-AAGCAAAGTACCTCACTGAGTAGGTATCAAGACCCTTCAGGC- 

AGAAT 2333 

Qy 115 TGCGGGTTTCGGCGGCGGCTGAGGAAGAAGCGC 

GGGCGGCGCCTTCGGGAGGCG 16 8 



Db 2332 TCCATCATTC-TCGCAATTAGTGACAGGGACGCTGGTTGTGGAGGAGTATT 

TGGAG 22 78 

Qy 169 AGCAGGCAGCAGTTGGCCGTGCCGTAGCAGCG — TCCCGCGCGCGGCGGGC — 

AGCGGCC 22 4 



Db 22 77 TGGTGGCAGAAAATTGC 

ATCGGCATTTCCAGCTTGAGTAGGTCTTATCTTCT 2226 

Qy 225 CAGGAGGCGCGTGGCGGCGCTCGGCCTCGCGGCGG-CGG-CGGCGGC 

AGCGGCCCA 2 78 



Db 2225 TAGCGGATGCTCAGAAGC- 

CAAGGTCTGGATGCAGTCGGTCAGTTGCAAATAACAGCTCA 2167 



DB 1; Length 2 43 7; 
879; Indels 911; 



Qy 2 79 GCAGT TGGCGGCGAGCG- 

CGTCTGCGCCTGCGCGGCGGGCCCCGCGCCCCT— C 329 



Ill 1 1 1 1 I I I 1 1 1 1 1 I I 1 1 1 1 1 1 I I III 

Db 2166 GGAATGTTAAGTGGCTTCAGGAGTCGTGTG-GACAGCACGAT — 

TACCTGGGTTCCTGGC 2110 

Qy 330 

CTCCCCCCCTGGGCGCCCCCGGCGGCGTGTGAATGGCGGCCTCCGCGGCGGCAGCCTCGG 3 89 

III I I I I I I I I I I I I I III 

I 

Db 2109 TTCAAC GATGCCACAAG ATCTTTGAC — CATTTTC 

ACTTCAG 2 0 70 

Qy 390 

CAGCAGCGGCCTCGGCCGCCTCTGGCAGCCCGGGCCCGGGCGAGGGCTCCGCTGGCGGCG 4 49 

I I I I I I I I I I II II III III I I I I 

I 

Db 2 069 AGACTGTGACC 

CCACCTACCACAAAGAGGATCAGGAGGGGGTAGTCACTAG G 2 018 

Qy 450 

AAAAGCGCTCCACCGCCCCTTCGGCCGCAGCCTCGGCCTCTGCCTCAGCCGCGGCGTCGT 5 09 

III I I I I I I I I I II III III II 

I 

Db 2017 AT GAG GCCGGCTCACCTTCATGAACA TGCTAAATCC 

AGTTTT 1976 

Qy 510 CGCCCGCGGGGGGCGGCGCCGAGGCGCTGGAGC 

TGCTGGAGCACTGCGGCGTGTGC 565 

III II I I I I I I I I I I III I I III I 

I I 

Db 19 75 AAG GAG AT C AG TGAGGC- 

C T G AAG AC AT G T G T T C AAT AT C AAC GG AAT C T G - 1926 

Qy 566 AGAGAGCGCCT GC G AC C C G AG AGGG AG 

CCCCGCCTGCTGCCCTGTTTGC— 614 

I I I I I I I II I I I I I I I I I I III 

Db 1925 GCCT 

CTCGGGATGAAATATTTCCTCCACAACTTGCTTCAACAATGGCTT 18 7 7 

Qy 615 — ACTCGGCCTGTAGTGCCTGCTTAGGGCCCGCGGCCCCCGCCGCCGCCAACAG- 

CTCGG 671 

I I I I I I II I I I I III I III I I I I II 

I 

Db 18 76 ATAAGATGCCTGGTGGGTATGATTT 

C CAGG AAC AT AT AC AG AC T T AAAC T G T T TC AG 182 0 

Qy 6 72 GGGACGGCGGGGCGGCGGGCGACGGCACCGTGGTGGACTGTCCCG- 

TGTGCAAGCAACAG 73 0 



Db 1819 GAGAC TC — CGAGCTCCAGC AATATCCCGAAGTG-AAGTAA- 

AG 1780 

Qy 731 

TGCTTCTCCAAAGACATCGTGGAGAATTATTTCATGCGTGATAGTGGCAGCAAGGCTGCC 79 0 

II I I I I I II III I I I I I I I I I I I I I I II 

Db 17 79 AGTTCATCCACGGCAATTTTGG ATTT GTGA- 

AATGTCAGATTAATTG— 173 4 



Qy 



791 



ACCGACGCCCAGGATGCGAACCAGTGCTGCACTAGCTGTGAGGATAATGCCCCAGCCACC 85 0 

I I I I I I I II I I I I I I I I I I I I I I 

I I 

Db 1733 -AAGAGTCCCAGTCCGTAATTTTTTGCAGCA 

AAGG T G AC AAT C C AG AT T C C 16 8 4 

Qy 851 AGCTACTGTGTGGAGTGCTCGGAGCCTCTGTGTGAGACCTGTGTAGAGGC 

GCACCA 9 06 

III I I I I I I I I I I I I I I I I I 

I 

Db 16 83 T — CACAG AAGACCTGAGCCAATG CTTTCTTGA — 

CTTTTTCTTCT 1642 

Qy 907 GCGGGTGAAGTACACCAAGGAC CATACTGTGCGCTCTACTGGGCCAGCC- 

AAGTCT 961 

II I I I I I I I I I I I I I I I I I I I I I I I I I III 

I 

Db 1641 GC 

TTCACACAGGTCTTTGTCTACCGTGAGCTCTCCAGTGACAGAATAAATAT 159 0 

Qy 962 

CGGGATGGTGAACGTACTGTCTATTGCAACGTACACAAGCATGAACCCCTTGTGCTGTTT 1021 

I I I I I III I I I I I I III I I I I I II 

I I 

Db 1589 ATATGAGAAGGATCAG-CAGT TCCTCAGGGCTGTAGTCCTCGT — 

TGGTT 1543 

Qy 1022 TGTGAGAGCTGTGATACTCTCACCTGCCGAGACTGCCAGCTCAATGCCCACAA- 

GGAC — 10 78 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1542 CTCTGGGTTACAGGCTTAATCATGGGCAG- 

C AGC T GAT T T AAC AC AAC GG AC AT 149 0 



Qy 10 79 C AC C AG T AC C AG T T C T T AG AGG AT GC AG T G AG G AAC C ■ 

AGCGCAAGCTCC 112 7 

III I I I I I I I I I I I I I I I I 

Db 1489 TGCTGACTCC CCAATGCTCTGAAGA 

AGGAGCCTTTCAAAAGCCAGAAAG 1441 



Qy 112 8 TGGCCTCACTGGTGAAGC- 

GCCTTGGGGACAAACATGCAACATTGCAGAAGAGCACCAAG 1186 

I I I I I I I I I I I I I III I I I I I I I 

Db 1440 TTGTCCCACTTG GCAGTCTGTGGG TGTTTCAACGTT- 

TGAGCT 1399 



Qy 1187 

GAGGTTCGCAGCTCAATCCGCCAGGTGTCTGACGTACAGAAGCGTGTGCAAGTGGATGTC 12 46 

I I I I I I I I I I III I I I I I II III 

I I 

Db 139 8 GTGGC-CAGT CCAAG CTGGAG GAGGC CACAATGAT- 

TC 1363 

Qy 1247 

AAGATGGCCATCCTGCAGATCATGAAGGAGCTGAATAAGCGGGGCCGTGTGCTGGTCAAT 13 06 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1362 ATTAGAGCTTTGAGGTTGTTCTTGAA-GAGCTGAATAT 

AGGACATGAGCTG 1313 



Qy 13 0 7 GATGCCCAGAAGGTGACTGAGGGGCAGCAGGAGCGCCTGGAGCGGCAG- 

CACTGGA 1361 



Db 1312 TCCCGGT — GTGACTCT 

CCCCATACTCATCTTGATTGGCAGGTTTTCTCTG — 1264 

Qy 1362 CCATGAC-CAAGATCCAGAAG-CACCAGGAGCACATTCTGCGCTTTGCCTC- 

TTGGGCTC 1418 



Db 12 63 -CTTGCCGCTTCCACTAGATGTCTCC GAAC T T C C AT C AC — 

TGCCTCTTTGTGCTT 1211 

Qy 1419 TGGAGAGTGACAACAACACAGCCCTTT-TGCTTT — CTAAGAAGT 

TGATC 1465 

I I I I II II I I I I I I I I I I II I II 

Db 1210 AGTGT- 

TCAGTAAAGCTTCCCATAGGGCTTTGGCTGTGGTGTCACTGAATTGTGAAA 1155 
Qy 1466 

TACTTCCAGCTGCACCGGGCCCTCAAGATGATTGTGGATCCCGTGGAGCCACATGGCGAG 152 5 
II I I I I I I I I I I I I I I I I I II 

Db 1154 GACAGCCTGGTGCA ACCACATTATAATTTT CCTC 

CAGT 1115 

Qy 152 6 ATGAAGTTTCAGTGGGACCTCAATGCCTG — 

G AC C AAG AG TGCCGAGGCCTTTG G C AAG A 1583 

I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 1114 ATGGAG-TGCAGTGAG — CGCTAT-CATGTTAACCAT 

CACATCATTTG 10 71 

Qy 1584 TTGTGGCAGAGCGTCCTGGCACTAACTCAACAGGCC — 

CTGCACCCATGGCCCCTCCAAG 16 41 



Db 10 70 -TGTGGCCTGGGAGCTGGGGAAGTGCTGAAATGATCTTCT- 

CTACTAAGTTGTCTCCATG 1013 



Qy 16 42 A-GCCCCAGGGCCCCTGAGCAAG- 

CAGGGCTCTGGCAGCAGCCAGCCCATGGAGGTGCAG 1699 

I I I I I I I I I I I I I I I I I I I I I I I II 
Db 1012 ATGTCCAACTGCTCCTGTGAGATCCAGGGTTCTGTCCACA — 
AAAACCA 966 



Qy 1700 

GAAGGCTATGGCTTTGGGTCAGGAGATGATCCCTACTCAAGTGCAGAGCCCCATGTGTCA 1759 

II I I I I I I I I I I I I I I I II 

I I I 

Db 965 CT GATG — CCCTGC-C TGCAG CA 

GTCT 942 

Qy 1760 

GGTGTGAAACGGTCCCGCTCAGGTGAGGGCGAGGTGAGCGGCCTTATGCGCAAGGTGCCA 1819 

I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 9 41 TCT TCCTGTTCT-TTGCAGG GGCATAATTGGCCAGAT- 

CCG 903 

Qy 182 0 CGAGTGAGCCTTGAACGCCTGGACCT — 

GGACCTCACAGCTGACAGCCAGCCACCCGT-C 1876 

I III III I I I I I I I I I I I I I I I I III I I I I I I I 



Db 9 02 C AG C GAT G A C C T - G AC T T AAGG AAC CTACAGC — 

AAAACACTCCTCCCGTAC 854 



Qy 18 77 TTCAAGGTCTTCCCAGGCAGTACCACTGAGGACTACAACCTTATTGTTAT- 

TGAACGTGG 1935 

III I I I I I I I I I I I I I I I I I I I II II II 

Db 853 TCCTAAATGTTCACA — CAG-AGAACTGAGGCCTGACAC 

TAGGCATCTGAT 8 06 

Qy 1936 

CGCTGCCGCTGCAGCTACCGGCCAGCCAGGGACTGCGCCTGCAGGAACCCCTGGTGCCCC 1995 



Db 8 05 — CTGCAGCAGCAGCT CTGGGGTTAGCG — TAGTGGAGTCC- 

ACATC 76 4 

Qy 1996 ACCC CTGGCTGGCATGGCC- 

AT T G T C AAG GAGGAGGAGACGGAGGCTGCCATT 2 0 4 7 

I I I I II I III II I I I I I 
Db 763 ACCCAGGCTTCCCAGCTT — CCTCTTGTCC 

GCTATTAAGG 718 

Qy 2048 - 

GGAGCCCCTCCTACTGCCACTGAGGGCCCTGAGACCAAACCTGTGCTTATGGCTCTTGC 2106 



Db 717 AGGTGCACATCCTGGGGTAGCAGTGGGAAAAGGGA 

TGC 680 

Qy 2107 GGAGGGTCCTGGTGCTGAGGGTCCCCGCCTGGCCTCACCTAGT- 

GGCAGCACCAGCTCAG 2165 



Db 6 79 AAAAG CTGGAGTCAAGG- 

CAAAGTGGGGAGCAACAG G 6 44 

Qy 2166 GGCTGGAGGTGGTGGCTCCTGAGGGTACCTCAGCCCCAGGTG 

GTGGCCCGGG 2217 

III I I II I I I I I I I I I I I I III I I I I I I 

Db 6 43 GGCAAGCAATAACGGGACATG-GAACACCTCGGCC 

GTGTAGTTCATGTTGCCC 592 

Qy 2218 AACCCTGGATGACAGTGC — 

CACCATTTGCCGTGTCTGCCAGAAGCCAGGCGATCTGGTT 2 2 75 

I I I I I I I I II III III I I I I III I I I I 

Db 591 ATCCAT TCACACAGCTTCTCC TCCAG 

CTGCTCGAACACCGGC 550 

Qy 2276 ATGTGCAACCAGTGTGAGTTTTGTTTCCACCTGGACTGTCACCTGCCGGCCCTG 

CAG 2332 



Db 549 TGCTGCCC CTCCATCTCGGCCG 

CTGCCGCCGCTGGGACAT 510 

Qy 2333 

GATGTACCAGGGGAGGAGTGGAGCTGCTCACTCTGCCATGTGCTCCCTGACCTGAAGGAG 2392 



Db 509 GATTAGCT GTGAGGTGGA CAG-CGTG- 



III I I I I I I I I 
-GGTCG — GGC- 



GCTCAC- 



-G 478 



Qy 2393 GAGGATGGCAGCCTCAGCCTGGATGGTGCAGACAGCACTGGC — GTGGTGGCC- 

AAGC 2447 

I I I I I I I I I I III I I I I I III I I I I I I 

I 

Db 477 G T T G - T G AC C AC C AC A- C AAT AC T GG AAG T G AC T G C 

GGCAGATGATGTCCCGTAGGA 423 

Qy 2 448 TCTCACCAG- 

CCAACCAGCGGAAATGTGAGCGTGTACTGCTGGCCCTATTCTGTCACGAA 2 5 06 



Db 422 TCTCCACGGTCCGGCCT TTCAGCAGG— CAGCTCAG- 

C AC AAA 3 83 

Qy 250 7 CCCTGCCGCCCCCTGCATCAGCTGGCTACCGACTCCACCTTC-TC 

CCTGGACCAGC- 2 561 

I I I I I I I I I I I I I I I I I I I I I I I I I I III 

I 

Db 382 CACTGCCTTGGGCTGCT TGGCT- 

CCACCACCAATTGCGTCGGGCTCGAACTCTCG 329 

Qy 2 562 CCGGTGGCACCCTGGATCTGACCCTGATCCGTGCCCGCCTCCAGGAGA- 

AGTTGTCACCT 262 0 

I I I I I I I III I I I I II I I I I I I I I I I I I I I III 



Db 32 8 CAGGTGACAGTCAGG GCCCC- 

ACCGCCTCCAGGAGACGGGTGGATCC- 281 



Qy 2621 

CCCTACAGCTCCCCACAGGAGTTTGCCCAGGATGTGGGCCGCATGTTCAAGCAATTCAAC 26 8 0 
III I I I I I I I I I I I I I I I II 

Db 280 GCAG — CCCCAGTG CAGGCTCTCGGC 

GC 255 



Qy 2681 

AAGTTAACTGAGGACAAGGCAGACGTGCAGTCCATCATCGGCCTGCAGCGCTTCTTCGAG 2 7 4 0 
I I I I I I I I I I I II I I I I I II 

Db 254 AGGC- 

GGCGTCCAGGTAAACCACAGCCCGTTTCACTT 219 



Qy 2741 

ACGCGCATGAACGAGGCCTTCGGTGACACCAAGTTCTCTGCTGTGCTGGTGGAGCCCCCG 2 8 0 0 
I I I I I I I I I I I I 

I I I 

Db 218 TGGCC AGCACC TGCT 

CCCA 200 



Qy 2801 CCGATGAGCCTGCCTGGTGCTGGCCTGAGTTCCC- 

AGGAGCTGTCTGGTGGCCCTGGTGA 2859 

I I I I I III II I I I I I I I I I I I 

Db 199 TC CTTGCTGGGTAAAGGACAGTACGCCCGAGGCGCT- 

CATGGTTG 156 



Qy 2860 

TGGCCCCTGAGGCTGGAGCCCCCATGGCCAGCCCAGCCTGGCTCTGTTCTCTGTCCTGTC 2 919 

II I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 155 GGGATTCGCAGACTTGGGAAACTACGGT GCAGGAACTTCTTTCAGAAC- 



TC 106 



Qy 292 0 ACCCCATCC— CCACT CCCCTGGTGGC CTGACTCCC 

ACTCCCTGGTGG 2965 



Db 105 

ACCGCTTCCGGAAATTGGGCTCCGGGAGACTTTGACAGTCTCCACAGTACACCCAGTAGG 46 
Qy 2966 

CCCCATCCCCCAGTTCCTCACGATATGGTTTTTACTTCTGTGGATTTAATAAAAAAAACT 3 02 5 
II I I I I I I I I II I I I I I I I I I I I 

Db 45 CCAACAGT GCTCGA GG 

AATTCCGCAGCTTTTAGAGCAGAAGTA 3 

Qy 3026 TC 3027 

I 

Db 2 AC 1 
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Qy 57 TTGGCC- 

GTGCCGTAGCAGCGTCCCGCGCGCGGCGGGCAGCGGCCCAGGAGGCGCGTGGT 115 



Db 1 TAGGCCTGTAC GGAAGTGTT 

ACTTCTGCTCTAAAAGC T 3 8 

Qy 116 

GCGGGTTTCGGCGGCGGCTGAGGAAGAAGCGCGGGCGGCGCCTTCGGGAGGCGAGCAGGC 175 
I I I I III II I I I I I I I I I I I I I I I 

I I 



Db 39 GCGGAATTCCTCG AGCACTGTTG — GCCTACTGGAG- 

TGC 75 



Qy 176 

AGCAGTTGGCCGTGCCGTAGCAGCGTCCCGCGCGCGGCGGGCAGCGGCCCAGGAGGCGCG 235 
I I I I I I I III I I I I I I I I II 

I I I I I I 

Db 76 TGAAGTAGGC GCGG ACGTGCCCGGTG CC- 

TGGCGCG 110 

Qy 236 TGG 

CGGCGCTCGGCCTCGCGGCGGCGGCGGCGGCAGCGGCCCAGCAGTTGGCGGCG 2 91 

III I I I I I I I I I I I I I I I I I I II III I 

I 

Db 111 TGGTAGCAGGCGCCCGGTGCCCCGGC — CGGCGAAGACCATGGC 

GTTCATGGTG 162 



Qy 292 AGCGCGTCTGCGCCTGCGCGGCGGGCCCCGC- 

GCCCCTCCTCCCCCCCTGGGCGCCCCCG 350 

I I III I I I I I I I I I I III 

Db 163 AAGAC CATGGTGGGCGG — CCAGCTGAAGAACCT 

CACTGGGAGCC 2 05 



Qy 351 

GCGGCGTGTGAATGGCGGCCTCCGCGGCGGCAGCCTCGGCAGCAGCGGCCTCGGCCGCCT 
II I I I I I I II I I I I I I 

Db 2 06 TGGGAGGCGGC 

GAGGATAAGGGAGATGGGGACAAGTC 242 



Qy 411 CTGGCAGCCCGGGC- 

CCGGGCGAGGGCTCCGCTGGCGGCGAAAAGCGCTCCACCGCCCCT 469 
I I I I I I I I I I I I I I III III 

Db 2 43 — GGCAGCCGAAGCTCAGGGCATGAGC 

CGGGAGGAGTA 2 78 



Qy 470 

TCGGCCGCAGCCTCGGCCTCTGCCTCAGCCGCGGCGTCGTCGCCCGCGGGGGGCGGCGCC 529 
II I I I I I I II I I I I I 

Db 2 79 CGAGG AG TAT C AG AAG C AAC - T C G T 

GGAA 306 

Qy 530 

GAGGCGCTGGAGCTGCTGGAGCACTGCGGCGTGTGCAGAGAGCGCCTGCGACCCGAGAGG 5 89 
III I I I I I I I I I I I I I II I I I I I I I II 

I I I I 

Db 307 GAGAAGATGGAGCGG — GATGCAC AGTTCACACAGAG-GAAGGC — 

AGAGC 352 



Qy 590 

GAGCCCCGCCTGCTGCCCTGTTTGCACTCGGCCTGTAGTGCCTGCTTAGGGCCCGCGGCC 6 49 
I I I I I I I I I I I I I I I II III I 

I I 

Db 353 GGGCCAC-ACTGCGGAGC CACTT — CC GAGACAAAT 

ACCGGCTACC 395 



Qy 650 CCCGCCGCCGCCAACAGCTCGGGGGACGG — 

CGGGGCGGCGGGCGACGGCACCGTGGTGG 7 0 7 

I I II I I I I I I I I I I I I I I I I I 

I 

Db 396 CAAG — AACG-AGACAGAT-GAGAGCCAGATCCAGATGGCAGG 



TGGAGACGTGGAG- 447 



Qy 70 8 ACTGTCCCGTGTGCAAGCAACAGTGCTTCTCCAAAGACATCGTGGAGAA 

TTATTT 762 

I I I I I I I I I I III II II I I I I I I I I I I I I 

Db 448 -CTGCCCCGGGAGCTGGCCAAGATG- 

ATCGAGGAGGACACAGAGGAGGAGGAGGAGAAGG 505 

Qy 763 

CATGCGTGATAGTGGCAGCAAGGCTGCCACCGACGCCCAGGATGCGAACCAGTGCTGCAC 82 2 

II II I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 506 CCTCAGTCCTTG-GGCAGCTGGCCAGCCT TCCTGGCT-TGAACCTGGGCT- 

CAC 556 

Qy 823 TAGCTGTGAGGATAATGCCCCAGCCACCAGCTACTGTGTGGAGTGCTCGGAGC- 

CTCTGT 881 

I I I I I I I I I I I I I I I I I I I I III III III II 

I 

Db 55 7 TC AAGGACAAGGCCCAGGCCAC ACTG GGGGATCTC- 

AAGCAATCAG- 601 

Qy 8 82 GTGAGACCTGTGTAGAGGCGCACCAGCGGGTGA — AGTACACCAAGG — 

ACCATACTGTG 93 7 

I I I I I I I I I II I I I I I I I I I II III 

Db 6 02 CTGAGAAGTGTCACGTCA 

TGTGACCACTTCCCCGGGGTTACC 643 

Qy 938 

CGCTCTACTGGGCCAGCCAAGTCTCGGGATGGTGAACGTACTGTCTATTGCAACGTACAC 99 7 

III I I I I I I I I I I I III II II I I I I I I I I I I I 

I 

Db 6 44 CACTGGGCTGGGCCC-CCATG AGGGC TAAGAGTG- 

TGTCAACTTCCAGGGACCC 695 

Qy 998 

AAGCATGAACCCCTTGTGCTGTTTTGTGAGAGCTGTGATACTCTCACCTGCCGAGACTGC 1057 
I I I I I I I I I I I I I I I I I I I I I I 

Db 696 A TACTCCATTTG GGGCTTTGTT 

TCCCTTGCC C 727 

Qy 1058 

CAGCTCAATGCCCACAAGGACCACCAGTACCAGTTCTTAGAGGATGCAGTGAGGAACCAG 1117 

I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 72 8 CATCCTAGTTCC AAGACCT TTCCCATCC ATGC 

CCCAAG 765 

Qy 1118 CGCAAGCTCCTGG 

CCTCACTGGTGAAGCGCCTTGGGGACAAACATGCAACATTG 1171 

I I I I I I I I I I I I I I III I I I I 

Db 766 C-CTATCTTCTGGTTTCTTCCTCTC CGC— 

TGGG 796 

Qy 1172 

CAGAAGAGCACCAAGGAGGTTCGCAGCTCAATCCGCCAGGTGTCTGACGTACAGAAGCGT 1231 

I I I I I III II I I I I I I I I I I II I II III 

I 

Db 79 7 -AGTAAAGTCCCCA TCTTCACTCTACCCTTCAGGACCCTCCC 



CACCAGC-T 8 45 



Qy 1232 

GTGCAAGTGGATGTCAAGATGGCCATCCTGCAGATCATGAAGGAGCTGAATAAGCGGGGC 12 91 

II I I I I I I I I I II I I I I I I I I I I I I I I 

I 

Db 8 46 CAGCCTGTGGA GGCCTCCC — AAGAT — TGTAG 

GAATAGGC C 8 83 

Qy 1292 

CGTGTGCTGGTCAATGATGCCCAGAAGGTGACTGAGGGGCAGCAGGAGCGCCTGGAGCGG 1351 
II III 

Db 884 CAT 

CCCT 890 



Qy 1352 CAGCACTGGACCATG — ACCAAGATCCAGAAGCAC- 

CAGGAGCACATTCTGCGCTTTGCC 140 8 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I III 

I 

Db 891 CTCTGG-CCATGGCCCCAAGTTCC TGCACACAGGAGCACCCACAGAG- 

AGAC 9 40 



Qy 1409 

TCTTGGGCTCTGGAGAGTGACAACAACACAGCCCTTTTGCTTTCTAAGAAGTTGATCTAC 146 8 
I I I I I I I I I I I I I I I I I I I I III 

Db 9 41 AC ACACAGGA CACAAAAC — 

CCCTGGCACGTTCAGAGA 9 76 



Qy 1469 

TTCCAGCTGCACCGGGCCCTCAAGATGATTGTGGATCCCGTGGAGCCACATGGCGAGATG 152 8 
I I I I I I I I I I I I I I 

Db 9 77 CAGAAGCCACAGACAC ATCCC 

GGC 1000 



Qy 1529 AAGTTTCAGTGGGAC- 

CTCAATGCCTGGACCAAGAGTGCCGAGGCCTTTGGCAAGATTGT 15 8 7 
III I I I I I I 

Db 1001 ACAGACAGACACACA 

CGAGGCC 1022 



Qy 1588 

GGCAGAGCGTCCTGGCACTAACTCAACAGGCCCTGCACCCATGGCCCCTCCAAGAGCCCC 16 4 7 
III I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 1023 AGCTCCCTTGCG-TGTC CAGCCCCT CCA-GACACCACCA— 

CTC 1062 



Qy 1648 

AGGGCCCCTGAGCAAGCAGGGCTCTGGCAGCAGCCAGCCCATGGAGGTGCAGGAAGGCTA 17 0 7 
II I I I I I I I I I I I I Mill II III III 

Db 1063 AGAAACTCTGAG — AG-AGAGCATGGGCAGACACC — CTCA 

GCAGACAGG 1107 



Qy 170 8 TGGCTTTGGGTCAGGAGATGATCCCTAC TCAAGTGC 

AGAGCCCCATGTGTCAGG 1761 

I I I I I I I I I I I I I I III 

Db 1108 AGGCCTGAGTTCCAG 

TCTCCACCTTTATTGTTCTTGAAAGCCCC 1151 



Qy 1762 



TGTGAAACGGTCCCGCTCAGGTGAGGGCGAGGTGAGCGGCCTTATGCGCAAGGTGCCACG 1821 
II I I I I I I I I I I I I I 

Db 1152 — TG CTCTCTCTG 

AGCCTTAT 1170 

Qy 1822 AGTGAGCCTTGAACGCCTG- 

GACCTGGACCTCACAGCTGACAGCCAGCCACCCGTCTTCA 18 8 0 

I I I I III I III I I I I I I II II I I I I 

Db 1171 TTCATCATCTGTAAAATGGGAATGTC — 

CTGAATGACTTCTAAGGCTCTTTC 1220 

Qy 1881 

AGGTCTTCCCAGGCAGTACCACTGAGGACTACAACCTTATTGTTATTGAACGTGGCGCTG 19 4 0 

II I I I I I I II I I I I I I I I I III I I II 

Db 1221 TGGCTTGAACTGTCAGAGCC AAGCCCACATCCCTC CTTGGGCAGG — 

GCAG 1269 

Qy 1941 

CCGCTGCAGCTACCGGCCAGCCAGGGACTGCGCCTGCAGGAACCCCTGGTGCCCCACCCC 2 0 0 0 

I I I I I I I I I I I I I I I I I I I II I III II II I I I I I II 
Db 12 70 CAGCTGCTGCCA-CAGCCT-CCAGCGGCTGCCACTGTGGG — 

CTCTGGGAGCCGGAGCGA 1325 

Qy 2001 TGGCTG GCATGGCC- 

ATTGTCAAGGAGGAGGAGACGGAGGCTGCCATTGGAGCCCCT 2 056 

I I I I I I III I I I I I I I I I I I I II III III 

I I 

Db 1326 T-GCTGTGTGAGAGGCAGAGTGCCAAGGATGAAG 

CTGGCACTGAACAGTAAGCGGCT 13 81 

Qy 2 057 CCT-ACTGCCACTGAGGGCCCTGAGACCAAAC 

CTGTGCTTATGGCTCTTGCGGAG 2110 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 13 82 CCAGGCCTCCTCT — GGGCCCAGGGCCCAGCCAATTTCTGTTCT 

GTTCCTGTAGAA 1435 

Qy 2111 GGTCCTGGTGCTGAGGGTCCCCGCCTGGCCTCACCTAGTGGCAGCAC- 

CAGCTCAGGGCT 2169 

I II I I I I III I I I I I I I I I II I I I I I I I 

I 

Db 1436 CGCTCT CTG-GATTCCATAGCTGGAATCTCCT 

CTCTTAGCTCAG T 1479 

Qy 2170 GGAGGTGGTGGCTCCTGA GGGTACCT-CAGCCCCA GGTGGTGGC — 

CCGGGAAC 222 0 

II III I I I I I I I I I I I I I I I I I I I I I I 

Db 1480 

GAAAAATAAAAATCCCAAATGGTGTGCCTACCTTCCCACTTCTTACTGGCTTCCAGGAGT 1539 
Qy 2221 

CCTGGATGACAGTGCCACCATTTGCCGTGTCTGCCAGAAGCCAGGCGATCTGGTTATGTG 2 2 8 0 
I I I I I II I I I I I I I I I I I I I II I I I I 

I I I 

Db 1540 CTTGGAGTTCATAGCCCC CCGAGCCTGCCTTAA AGGGG 

TGT- 1580 



Qy 2281 

CAACCAGTGTGAGTTTTGTTTCCACCTGGACTGTCACCTGCCGGCCCTGCAGGATGTACC 23 4 0 



I 1 1 1 1 1 1 1 1 1 1 1 1 I I 1 1 1 1 

Db 1581 CCTCCA CCCCC CACCTACAG 

CTTCA C 1606 

Qy 23 41 AGGGGAGGAGTGGAGCTGCTCACTCTGCCATGTG CTCC 

CTGACCTGA 23 8 7 

I I I I I I I I I I II I I I I I I I I I I I III II 

Db 16 0 7 AGGAGGGGAGAGG 

GCATCCAGTGCTAGGAGTAGAAGTGTCTCCAGCTCTGTTCTCT 1662 

Qy 23 8 8 AGGAGGAGGATGGCAGCCTCAGCCTGGAT 

GGTGCAGACAGCACTGGCGTGGTGGCCA 2 444 

II III Mill I I I I I I I I I I I I I I 

Db 1663 TGG GGC CCTGGGTGAAGGTGGGGTCTG GGGC- 

TTATGAAAT 1702 

Qy 2445 

AGCTCTCACCAGCCAACCAGCGGAAATGTGAGCGTGTACTGCTGGCCCTATTCTGTCACG 2 5 0 4 

I I I I I II III I I I I I I I III I I I I I I 

I 

Db 1703 AGGTCT GGGCTTTGAGGAGG — ATG-GAGC 

AGCCTCATTATGTGGGG 1746 

Qy 2505 AACCCTGCCGCCCCCTGCATCAGCTGGCTACCGACTCCACCTTCTCCCTG- 

GACCAGCCC 2 563 

II I I I I I I I III I I I I I I I I I I 

I 

Db 1747 AAGATGG — GGCCTCTG GGGCG TCACTGAGACCA — 

CA 1780 

Qy 2564 

GGTGGCACCCTGGATCTGACCCTGATCCGTGCCCGCCTCCAGGAGAAGTTGTCACCTCCC 2 623 

I I I I I II II III III I I I I III III I 

I I I 

Db 1781 GGTGGGGCCGGGG CTGGACCGCAGCTGTCT- 

TGGGTGCCTGTGCCTGCACCC 1831 

Qy 2624 

TACAGCTCCCCACAGGAGTTTGCCCAGGATGTGGGCCGCATGTTCAAGCAATTCAACAAG 2 6 83 

I I I I I I I I I I I II III I I I I 

I I I 

Db 1832 CTCCTCA CCCTAGAGACGG 

AAGATGTGCAAAAAG 1865 

Qy 2684 

TTAACTGAGGACAAGGCAGACGTGCAGTCCATCATCGGCCTGCAGCGCTTCTTCGAGACG 2 7 43 

I III I I I I I I I I I I I I I I I III I III III 

Db 1866 AAAGAAGGAAGGGCA-AC-TGCATTCCA GCC— CCACACT 

GTGATG 19 0 7 

Qy 2 7 4 4 CGCATGAACGAGGCCTTCGGTGACACCAAGTTCTCTGCTGTGCTGGTGGAGCCCCCG- 

CC 2802 

I I I I I I I I I I III I I I I I I I I I I I I I I I I 

Db 1908 ACTTTGAGCCTGTCCTT TCCCTCCTTGAGC-CTGTCTTGCTTGT 

CCCCTGTAA 1959 



Qy 2 8 03 GATGAGCCTGCCTGGTGCTGGCCTGAGTTCCCAGGAGCTGT- 

CTGGTGGCCCTG 2855 



1 1 1 1 I 



I 



1 1 



I 



1 1 



Db 196 0 AATGAACAGTCC C CCT 

C T C C C C C AAAT AG T AAT AAT AC ATGT T TC AAAG 2008 



Qy 2 856 GTGATGGCCCCTGAGGCTGGAGCCCCCATGGCCAGCCCAGCCTGGCTCTGT- 

TCTCTGTC 2914 



I I I 



Db 2 0 09 — GGTGACCATTTA TAAAG — 

CATATGACAAACCATATCAATAAATGTAACTCATTC 2 061 



Qy 2915 CTGTCACCCCATCCCCACT 

CCCCTGGTGGCCTGACTCCCACTCCCTGGTGGCCCCAT 2 9 71 



I I 



I I I 



I I I I 



Db 2 062 TTTTAAAAAAAAAAAAAAAAGGCCACATGTG- 

CTCGAGCTGCAGGTCGCGG 2111 



Qy 



Db 



2972 CCCCCAG 2978 

I I I I I 
2112 CCGCTAG 2118 
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Qy 45 

GACTAGCAGCAGTTGGCCGTGCCGTAGCAGCGTCCCGCGCGCGGCGGGCAGCGGCC 10 0 



Db 212 0 GTCTAGC GGCCGCGAC CTGCAGCTCGAGCAC- 

ATGTGGCCTTTT 2078 



Qy 101 CAGGAGGCGCG- 

TGGTGCGGGTTTCGGCGGCGGCTGAGG 13 8 



I 1 1 1 I III II 1 1 I I 

Db 2 0 77 TTTTTTTTTTTTAAAAGAATGAGTTACATTTATTGATATGG — 

TTTGTCATATGCTTTAT 2 02 0 

Qy 139 AAGAAG-CGCGGGCGGCGCCTTCGGGAGGCGAGCA 

GGCAGCAGTTG 183 

II III I I I I I I I I I I 

I 

Db 2 019 AAATGGTCAC CCTTT — 

GAAAC ATGT AT TAT T AC TAT T TGGGGGAGAGGGGG 1970 

Qy 184 GCCGTGCC- 

GTAGCAGCGTCCCGCGCGCGGCGGGCAGCGGCCCAGGAGGCGCGTGGCGGC 2 42 

I I I I I I I I I I II I I I I I I I I I I I I 

I 

Db 1969 AC TGTTCATTT T AC AG -GGGAC AAGC AAGAC AGGC T 

CAAGGAGGGAAAGGACAG- 1917 

Qy 243 GCTC— 

GGCCTCGCGGCGGCGGCGGCGGCAGCGGCCCAGCAGTTGGCGGCGAGCGCGTC 2 99 
I I I I I I I I I I I II III I I I I I 

Db 1916 GCTCAAAGTCATCACAGTGTGGG 

GCTGGAATGCAGT 1881 

Qy 300 

TGCGCCTGCGCGGCGGGCCCCGCGCCCCTCCTCCCCCCCTGGGCGCCCCCGGCGGCGTGT 359 
I I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 1880 TGC-CCTTC-CTTCTTTCTTTTTGCACATCTTCCGTCTCTAGG 

GTGA 1836 

Qy 360 

GAATGGCGGCCTCCGCGGCGGCAGCCTCGGCAGCAGCGGCCTCGGCCGCCTCTGGCAGCC 419 
I I I I II II I I I I I I I I I I I I I I I I 

I I I I I 

Db 1835 GGAGGGGTGCAGGCAC — AGGCACCCAAGACAGCTGCGGTC 

CAGCC 1792 



Qy 42 0 CGGGCCCGGGC- 

GAGGGCTCCGCTGGCGGCGAAAAGCGCTCCACCGCCCCTTC 4 71 

I I I I I I I II III I II II I I I I I I I I I I III 

Db 1791 CCGGCCCCACCTGTGGTCTCAG-TGACGCCCCAGAG- 

GCCCCATCTTCCCCACATAATGA 173 4 

Qy 472 GGCCGCAGCCTCGGCCTC--TGCCTCAGCCGCGGCGTCGT- 

CGCCCGCGGGGGGCGGCGC 52 8 

I I I I I III I I I I II I I I I I I III I I I I 

I 

Db 1733 GGCTGCTCCATCCTCCTCAAAGCC-CAGAC-CTATTTCATAAGCCC 

CAGACC 16 8 4 

Qy 529 

CGAGGCGCTGGAGCTGCTGGAGCACTGCGGCGTGTGCAGAGAGCGCCTGCGACCCGAGAG 5 8 8 
I II III I I I I I I III 

Db 1683 C CAC CTTC ACCCAG-GGC 

CCCAAGAGAACAGAGCT 1650 

Qy 589 

GGAGCCCCGCCTGCTGCCCTGTTTGCACTCGGCCTGTAGTGCCTGCTTAGGGCCCGCGGC 6 4 8 
I I I I I I I I I I II I I I I I I I I I I I II 



Db 16 49 GGAGACACTTCTACT — CC TAGCACTGG- 

ATGCCCTCT 1615 



Qy 649 

CCCCGCCGCCGCCAACAGCTCGGGGGACGGCGGGGCGGCGGGCGACGGCACCGTGGTGGA 7 0 8 

I I II I I I I I I I I I I I I I I I I I I III I I I I 

I I 

Db 1614 CCCCTCCTGTG AAGCT GTAGGTGGGG-GGTGGAGGAC- 

ACCCCTTTAAGG- 1567 

Qy 709 

CTGTCCCGTGTGCAAGCAACAGTGCTTCTCCAAAGACATCGTGGAGAATTATTTCATGCG 76 8 

I I I I I I I III I I I I I I I I I I I I I I I 

I I 

Db 1566 CAGGCTCGGGGG GCTATG-AACTCC-AAGAC-TCCTGG 

AAGCC 1527 

Qy 769 TGATAGTGGCAGCAAGGCTGCCACCGACGCCCA 

GGATGCGAACCAGTGCTGCACT- 823 

I I I I I I I I I I I I I III I I I I I II 

I I I I 

Db 152 6 AGTAAGAAGTGGGAAGGTAGGCAC ACCATTTGGGATTTTTA 

TTTTTCACTG 1476 

Qy 82 4 AGCT — 

GTGAGGATAATGCCCCAGCCACCAGCTACTGTGTGGAGTGCTCGGAGCCTCTGT 8 81 
I I I I I I I I I I I I I I I I I I I I I I I I 

Db 14 75 AGCTAAGAGAGGAGATT CCAGCTA 

TGGAAT 14 46 

Qy 882 

GTGAGACCTGTGTAGAGGCGCACCAGCGGGTGAAGTACACCAAGGACCATACTGTGCGCT 9 41 

I I I I I I III I I I I I II I 

I 

Db 14 45 CC AG AG AG C G T T C T AC AGG AAC AG AAC — 

AGAAAT 1413 

Qy 9 42 CTACTGGGCC AGCCAAGTCTCGGGATGGTG- 

AACGTACTGTCTATTGCAACGTACAC 99 7 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I 

I I 

Db 1412 TGGCTGGGCCCTGGGCCCAGAGGAGGCCTGGAGCCGCTTACTGTTCAGTGC 

CAG 1359 

Qy 99 8 AAGCATGAACCCCTTGTGCTGTTTTGTGAGAGCTGTGATACTCTCAC- 

CTGCCGAGACTG 1056 

III I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 1358 CTTCAT CCTTG GCACTCTG — CCTCTCACACAGCATCG- 

CT- 1321 

Qy 1057 CCAGCT— 

CAATGCCCACAAGGACCACCAGTACCAGTTCTTAGAGGATGCAGTGAGGAAC 1114 

I I I I I II I I I I I I I II II II I I I I I I I I 

Db 132 0 CCGGCTCCCAGAGCCCACAGTGGCAGCCG CT 

GGAGGCTGTGG 1279 

Qy 1115 

CAGCGCAAGCTCCTGGCCTCACTGGTGAAGCGCCTTGGGGACAAACATGCAACATTGCAG 117 4 

I I I I I I I I I I I I I I I I I I III I II III 



Db 12 78 CAGC AGCTGCTGCCCT- 

TTG 1240 



-GC-CCAAGGAGGGATGTGGGC 



Qy 1175 AAGAGCACCAAGGAGGTTCGCAGCTCAATCCGCCAGGTGTCT 

GACGTACAGAAGC 1229 

III I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 1239 GCTCTGA 

C AG T T C AAG C C AG AAAG AG C C T T AG AAG T C AT T C AGG A- C 119 4 



Qy 1230 

GTGTGCAAGTGGATGTCAAGATGGCCATCCTGCAGATCATGAAGGAGCTGAATAAGCGGG 12 89 

I II I I I I I I I I I I I I I I I I I I I I 

Db 1193 AT TC CCATTTTACAGATGATGA 

AATAAG 1166 



Qy 1290 

GCCGTGTGCTGGTCAATGATGCCCAGAAGGTGACTGAGGGGCAGCAGGAGCGCCTGGAGC 13 49 
I I I I I I III I I I I I I I I 

Db 1165 GCTCAGA GAG 

AGCAGGGGC 1147 



Qy 1350 

GGCAGCACTGGACCATGACCAAGATCCAGAAGCACCAGGAGCACATTCTGCGCTTTGCCT 14 09 



Db 1146 T T T C AAG AAC AAT A- AAGGTGGAGAC T GG AAC T C A- 

GGCCT 1108 

Qy 1410 CTTGGGCTCTG — GAG AG T G AC AAC AAC AC AGC CCTTTTGC 

TTTCTAAGAAGTTGA 1463 



Db 1107 CCTG TCTGCTGAGGGTGTCTGC — 

CCATGCTCTCTCTCAGAGTTTCTGAGTGGTGGT 1053 

Qy 1464 

TCTACTTCCAGCTGCACCGGGCCCTCAAGATGATTGTGGATCCCGTGGAGCCACATGGCG 1523 
II I I I I I I I I I I I I I I I II II 

Db 1052 GTCTGGAGGGGCTGGACACGCAAG GGAGC TG— 

GCCTCGTG 1014 

Qy 1524 

AGATGAAGTTTCAGTGGGACCTCAATGCCTGGACCAAGAGTGCCGAGGCCTTTGGCAAGA 15 83 
I I I I I II I I I I I I I I I I I I I I I I 

Db 1013 TGTGTCTG TCTGTGCCGGG ATGTGT- 

CTGTGGCTTCTGTCT 974 

Qy 158 4 TTGTGGCAGAGCGT-CCTGGCACTAACTCAACAGGCCCTGCACCCATG- 

GCCCCTCCAAG 16 41 



Db 9 73 CTGAACGTGCCAGGGGTTTTGT GTCCTG 

TGTGTGTCTCTCTGTG 930 



Qy 1642 

AGCCCCAGGGCCCCTGAGCAAGCAGGGCTCTGGCAGCAGCCAGCCCATGGAGGTGCAGGA 17 01 

I I I I I I I I I I I I I III I I I I I I I III 

Db 929 GGT GCTCCTGTG — TGCAGGAACTTGG GGCCATGGC 

CAGAG 891 



Qy 1702 AGGCTATGGCTTTGGGTCAGGAGATGATCCCTACTCAAGTGCAGAGCC- 

CCATGTGTCAG 176 0 

III I I I I I I I I I I I II I I I I I I 

I I I 

Db 89 0 AGGGATGGGCCT ATTCCTACAATCTTGGGAGGCCTCCA 

CAG 850 

Qy 1761 

GTGTGAAACGGTCCCGCTCAGGTGAGGGCGAGGTGAGCGGCCTTATGCGCAAGGTGCCAC 182 0 

I II I I I I I I I I I I I I I I I I I III 

Db 8 49 G CTGAGCTGGTGGG GAGGGTCCTGAAGGGTAGAGTG- 



Qy 1821 

GAGTGAGCCTTGAACGCCTGGACCTGGACCTCACAGCTGACAGCCAGCCACCCGTCTTCA 18 8 0 

II I I I I I I I I I I I I I I I I I II I I I I 

I 

Db 811 GATGGGGACTTTA 

CTCCCAGCGGAGAG G AAG AAAC CAG A 773 

Qy 1881 

AGGTCTTCCCAGGCAGTACCACTGAGGACTACAACCTTATTGTTATTGAACGTGGCGCTG 19 4 0 

III I I I I I I I I II I I I I I I I I 

I I 

Db 7 72 AGAT AGGCT TGGGG 

CATGGATGGGAAAGGTCTTGGAACT- 734 

Qy 1941 

CCGCTGCAGCTACCGGCCAGCCAGGGACTGCGCCTGCAGGAACCCCTGGTGCCCCACCCC 2 0 0 0 

Db 733 

AAAGCCCCA 706 

Qy 2001 TGGCTGGCATGGCCATTGTCAAGGAGGAGGAGAC 

GGAGGCTGCCATTGGAGCCCCTC 2 057 

I I I I I I I I I I I I I I I I I 

I I I 

Db 705 AATGGAGTATGGGTCCCTGGAAGTTGACA 

CACTC 6 72 

Qy 2 058 CTA-CTGCCAC- 

TGAGGGCCCTGAGACCAAACCTGTGCTTATGGCTCTTGCGGAGGGT — 2113 

I I I I I I I I I I I I I I I II II 

Db 6 71 TTAGC — CCTCATGGGGGCCCAG CC 

CAGTGGGTAA 639 

Qy 2114 - 

CCTGGTGCTGAGGGTCCCCGCCTGGCCTCACCTAGTGGCAGCACCAGCTCAGGGCTGGA 2172 

I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 63 8 CCCCGGGGAAGTGGTCACATGACGTGACACTTCT 

CAGCTGATTGCTTGA 59 0 



Qy 2173 GGTGGTGGCTCCTGAGGGTACCTCAGCC CCAGGTGGTGGCCCGGGAAC 

CCTGG 2225 

II I I I I I I III III II II I I I I I I I II 

I I 

Db 589 GAT CCCCCAGTGTG- 



GCCTGGGCCTTGTCCTTGAGTGAGCCCAGGTTCAAGCCAGG 535 



Qy 2226 ATGACAGTGCCACCATTTGCCGTGTCTGCCAGAAGCC — 

AGGCGATCTGGTTATGTGCAA 2283 

I I I I I I I I I I I I I III I I I I III I I I 

Db 53 4 AAGGCTG-GCCA GCTGCCCAAGGACTGAGGCCTTCTCCTCCT 

CCT 491 

Qy 22 8 4 CCAGTGTGAGTTTTGTTTCCACCTGGACTGTCACCT — 

GCCGGCCCTGCAGGATGTACCA 23 41 

II I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 49 0 CCTCTGTG TCCTCCT CGATCATCTTGGCCAGCTC 

CC- 455 

Qy 2342 

GGGGAGGAGTGGAGCTGCTCACTCTGCCATGTGCTCCCTGACCTGAAGGAGGAGGATGGC 2 4 01 
I I I I I I I I I I I I I I I I I I I I I I 

I I I I 

Db 454 GGGGCAGCT CCACGT-CTCC 

ACCTGCCATCTGGATCTGGC 416 

Qy 2402 

AGCCTCAGCCTGGATGGTGCAGACAGCACTGGCGTGGTGGCCAAGCTCTCACCAGCCAAC 2 461 
I I I I I I I I I I III I I I I I I III 

Db 415 T— CTCA-TCTGTCTCGT TCTTG 

GGTAGCCGGTATTTGTCT 3 78 

Qy 2462 

CAGCGGAAATGTGAGCGTGTACTGCTGGCCCTATTCTGTCACGAACCCTGCCGCCCCCTG 2521 

I I I I I I I Ml II I I I I I I I I I I I I I II 

I I I 

Db 3 77 CGGAAGTG GCT — CCGCAGTGTGGCCCG — 

CTCTGCCTTCCTCTG 337 

Qy 2 52 2 CATCAGCTG- 

GCTACCGACTCCACCTTCTCCCTGGACCAGCCCGGTGGCACCCTGGATCT 2 5 8 0 

I I I I I I I II I I I I I I I I I I I III I I I I I 

I I I 

Db 336 TGTGAACTGTGCATCCCGCTCCATCTTCTCT TCCA — CGAGTTGC 

TTCT 290 

Qy 2581 

GACCCTGATCCGTGCCCGCCTCCAGGAGAAGTTGTCACCTCCCTACAGCTCCCCACAGGA 2 6 4 0 
I I I I I I I I II I I I I I I I I I I I 

Db 289 GATACT CCTC GT — ACTCCTCC 

CGGCTC 264 

Qy 2641 

GTTTGCCCAGGATGTGGGCCGCATGTTCAAGCAATTCAACAAGTTAACTGAGGACAAGGC 2 7 0 0 
I I I I I I I I I III 

I I I 

Db 263 — ATGCCC TGAGC TTC 

GGC 247 

Qy 2701 

AGACGTGCAGTCCATCATCGGCCTGCAGCGCTTCTTCGAGACGCGCATGAACGAGGCCTT 2 76 0 

III I I I I I I I I III I I I I I I I I I I I I I I 

Db 2 46 TGCCGACTTGTCCC-CATCTCCCT TATCCTCGC CGCCT— 

CCCAGGCTCC 200 



Qy 2 761 CGGTGACACCAAGTTC-TCTGCTGTGCTGGTGGAGCC- 

CCCGCCGATGAGCCTGCCTGGT 2818 

I I I I I I I I I I I I I I I I I I I I I I I 

Db 199 CAGTG AGGTTCTTCAGCTG 

GCCGCCCACC 171 

Qy 2819 

GCTGGCCTGAGTTCCCAGGAGCTGTCTGGTGGCCCTGGTGATGGCCCCTGAGGCTGGAGC 2 8 7 8 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 170 -ATGGTCT T C AC CAT G AAC GCCATGGTCTTCGCC 

GGCCGGGGC 129 

Qy 2 8 79 CCCCATGGCCAGC — CCA 

GCCTGGCTCTGTTCTCTGTCCTGTCACCCCATCCCCACT 2 933 

II I I I I I III I I I I I I I I I I I I I I I I 

Db 12 8 ACCGGGCGCCTGCTACCACGCGCCAGGC ACCGGGCA 

CGTCCGC 86 

Qy 2934 

CCCCTGGTGGCCTGACTCCCACTCCCTGGTGGCCCCATCCCCCAGTTCCTCACGATATGG 2 993 

I I I I I I I I III I I I I I I I I I I I I I I 



-GCCT-ACTTCAGCACTCCAGTAGGCCAA CAGT- 



GG 47 



Qy 299 4 TTTTTACTTCTGTGGATTTAATAAAAAAAACTTCAC — CAGTTCAAAAAAA 3 0 42 

I I I I I I I I I I I III III I I I I I I 

Db 46 AATTCCGCAGCTTTTAGAGCAGAAGTAACACTTCCGTACAGGCCTA 1 
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Qy 11 CCGGCGGCGGAGGCCGGGCCGCG-GAGC-CAGGAGTGAC 

TAGCAGCAGTTGGCCGT 6 4 



Db 2 495 CTAGCGGCCGCGACC-TGCAGCTCGAGCACATGTG- 

GCCTTTTTTTTTTTTTTTTTACAT 2438 

Qy 65 GCCGTAGCAGCGTCCCGCGCGCGGCGG- 

GCAGCGGCCCAGGAGGCGCGTGGTGCGGGTTT 123 



Db 2 43 7 G T T T AAAT AT AT T T TAT T TC AAGT T AAT AT AAC AGC C AT GAT - ■ 

ATTT 2386 

Qy 12 4 CGGCGGCG- 

GCTGAGGAAGAAGCGCGGGCGGCGCCTTCGGGAGGCGAGCAGGCAGCAG — 18 0 

II III I I I I I I I I I I I III 

Db 23 85 AAGTGAAATGCTTCGATAAAAACCCCGGTGTCAC — 

T T GG T T AAAAT AT T GG AG AC AG AT 232 8 

Qy 181 

TTGGCCGTGCCGTAGCAGCGTCCCGCGCGCGGCGGGCAGCGGCCCAGGAGGC- 232 



Db 232 7 TAAATGTTAAGCC — AC TAT AG TAG T G AC T C 

TGGTTCTTTAGACT 22 85 

Qy 233 GCGTGGCGGCGCTCGGCCTCGCGGCGGCGGCGGCGGCAG — 

CGGCCCAGCAGTTGGC 287 

I I I I I I I I I I I I I 

Db 2284 ATTGCATG TACCTTGTTAATG AACTACAGGAATGCACA 

TTGAT 22 42 

Qy 288 GGCGAGCGC— GTCTGCGCCTG- 

CGCGGCGGGCCCCGCGCCCCTCCTCCCCCCCTGGGC 343 

III I I I I I I II I I III I I I I I I I I 

Db 22 41 ATAATGCACAATGTCTTCACATGTCTC 

AATCCTCATTCCTCTTCCTTTCAT 2191 

Qy 344 

GCCCCCGGCGGCGTGTGAATGGCGGCCTCCGCGGCGGCAGCCTCGGCAGCAGCGGCCTCG 4 03 

I I I I I I I I I I I I I I I I 

Db 2190 GCCTCC CTAATGCCTCAGTATC — 

CAATATAA 2161 

Qy 40 4 GCCGCCTCTGGCAGCCCGGGCCCGGGCGAGGGCTC 

CGCTGGCGGCGAAAAGCGCTC 459 



Db 216 0 AACTACAAAAACTACT 

GTTCTTTTCGTTTGAGTGGTATTTAATTA 2116 

Qy 46 0 CACCGCCCCTTCGG CCGCAGCCTCGGCCTCTGCCTCAGC 

CGCGGCGTC 507 



Db 2115 

AAACTTCAATTCTATGTTATTTCCACAGTCCAGTAATTTATTTTAAATTTGAGTAATTTC 2 056 



I I I I 
-TTGTTC — 



Qy 508 GTCGCCCGC 

GGGGGGCGGCGCCGAGGCGCTGGAGCTGCTGGAGCACTGCG 55 7 



Db 2 055 AAAT T C C AC AAAC AAAAC TGAAGAAC AGC AAT AT T T T G T T T G AAT T T C T 

CTCTTCT 2000 

Qy 55 8 GCGTGTGCAGAGAGCGCCTGCGACCCGAGAGGGAGCCCCGCCT — 

GCTGCCCT 608 

I I I I I I I I I I I I I I I I I III 

Db 1999 GTACACTCAGTGATCTAAAACACCACAATA 

TCCAACATACACAAACCTCAGGGAA 19 45 

Qy 609 — GTTTG — 

CACTCGGCCTGTAGTGCCTGCTTAGGGCCCGCGGCCCCCG 653 

I I I I III I III I I I I I I I I I I I 

Db 1944 

GGGTTAGTAAACACACACAAGATTGGAATCATGGTGCTCTTTGCTCCTGAATGGAATGGT 18 85 
Qy 654 - 

CCGCCGCCAACAGCTCGGGGGACGGCGGGGCGGCGGGCGACGGCACCGTGGTGGACTGT 712 



Db 1884 CCCACAG-AAAAAGCACAGGATACAGC ACAACATAAG — GGCACC — 

TGTTACATAT 1833 

Qy 713 CCCGTGTGC-AAGCAACAGTGC- 

TTCTCCAAAGACATCGTGGAGAATTATTTCATGCGT- 769 



Db 1832 GAAGTGAGCAAAACATACTAGCATTTTCTATATGCATAATGGGGAA 

ACCTGCATA 1778 

Qy 770 - 

GATAGTGGCAGCAAGGCTGCCACCGACGCCCAGGATGCGAACCAGTGCTGCACTAGCTG 82 8 



Db 1777 GGTTAGAGG GCCTTTTACGCTCA-TTTAAAAATCAG 

GCAAGTTG 1735 

Qy 829 TGAGGATAATGCCCCAGCCACCAG 

CTACTGTGTGGAGTGCTCGGAGCCTCTGT 8 81 

I I I I I II III I I I I I I I I I 

Db 1734 

TCTGTAACATTTTTCAATAATCTGGGAAGCACTGCAATCCAGTGTACGATGTGCTGATTA 16 75 

Qy 8 82 GTGAGACCTGTGTAG 

AGGCGCACCAGCGGGTGAAGTACACCAAGGACCATACTGTG 93 7 



Db 16 7 4 GTGTTGTCTTTGTTGGCACTGACAGTCTTGCATG-GTCATATGCCAGTGTTTTTGCT- 

TT 1617 

Qy 93 8 CGCTCTAC — TGGGCCAGCCAAGTCTCG — 

GGATGGTGAACGTACTGTCTATTGCAAC-G 992 



Db 1616 AGC T T T T C T T T GAAT AAAAC AGAT T T AAATGC AT T T AGAC AAT AC - 

GTATTTGTAAGCTG 1558 



Qy 993 TACACAAGCATGAACCCCTTGTGCTGTTTTGTGAGAG CTGTGATACTCTCA — 

CCTG 1047 



Db 1557 

TTTACATACACTAAATTTAAGAGATTCAATATTAGAGTTTCTTTGTTTCTTTAAACACTA 149 8 

Qy 10 48 CCGAGACTGCCAGCT C AAT GC C C AC AAG GACCACCAGT 

ACCAGTTCTTAG 109 7 

I I I I I I I I I I I III I I I I I II 

Db 149 7 CAGAG — TG-CAAATCAGGTTCTTCA 

CAACAGATTGAATATTGAGCAGTTCTTTA 1446 

Qy 1098 

AGGATGCAGTGAGGAACCAGCGCAAGCTCCTGGCCTCACTGGTGAAGCGCCTTGGGGACA 115 7 
III I I I I I I I III I I I I I I 

Db 14 45 AAGA — AAGAGGGGGAAGAAAAAAAG 

CCCAAGTGA 1413 

Qy 1158 

AACATGCAACATTGCAGAAGAGCACCAAGGAGGTTCGCAGCTCAATCCGCCAGGTGTCTG 1217 



Db 1412 ATAAAACATTGAAACTATTCCCC TTCGAAAATAAAT 

TCTA 1373 

Qy 1218 ACGTACAGAAGCGTGTGCAAGTGGATGTCAAGAT-GGCCATCCTGCAGATCATG — 

AAGG 1274 



Db 13 72 AAATGATGTGGAATGTGAAATAAGGTTTTAACATAGGTGATCC — 

AAG T T TAT AG T TAG A 1315 

Qy 12 75 AGCTGAATAAGCGGGGC — CGTGTGCTGGTCAATGATGCCCAGAA 

GGTGACTGAGGG 1329 

II I I I I I I I III I I I I I I I I I I III 

Db 1314 AAC — AAAAAG AAG TCCTTCAT G AAAT 

AAAGGTTACAAGAACACGTTGCCTGTTTT 1261 

Qy 133 0 GCAGCAGGAGCGCCTGGAGCGGCAGCACTGGACCATGACCAAGATCCAGAA — 

GCACCAG 13 8 7 



Db 126 0 CCCCCATTATAAACTG 

AG AAG T G G G T AAAG AC GATGTTTCAGTAC G AAAAT AG 12 0 8 

Qy 13 8 8 GAG 

CACATTCTGCGCTTTGCCTCTTGGGCTCTGGAGAGTGACAACAACACAGCC— 14 41 

II II I I I I I I I I III III II I 

Db 12 0 7 G T G AC T AC AGG AT C AGC GC T T C AT C T C AC AT G C T GTACC 

CAAAGCCAG 116 0 

Qy 1442 - 

CTTTTGCTTTCTAAGAAGTTGATCTACTTCCAGCTGCACCGGGCCCTCAAGATGATTGT 15 0 0 
I I I I I I I I I III III III III 

Db 1159 GCTGTGGCTGTCCATACGGTGG 

TGCGGTATAACCATAACCAAAAGGTG 1112 



Qy 1501 

GGATCCCGTGGAGCCACATGGCGAGATGAAGTTTCAGTGGGACCTCAATGCCTGGACCAA 156 0 



Db 1111 CCTGGGGAGGGAC — GGCAA CACTGGGTCCT- 

GCTGTCAGCATCAA 1069 



Qy 1561 GAGTGCCGAGGCCTTTGGCAA — 

GATTGTGGCAGAGCGTCCTGGCACTAACTCAACAGGC 1618 

III I I I I III I I I I I I I 
Db 106 8 CTGGGGC-TGACCATAAACAATGGGTTGTGTCT- 
CTGTAGCTTGTTCTTCT 1020 



Qy 1619 CCTGCACCCA 

TGGCCCCTCCAAGAGCCCCAGGGCCCCTGAGCAAGCAGGGCTC 16 71 

II III II I I I I I III III I I I I I 

I I I 

Db 1019 TCTTTTCTCAGTGATTCTGAAGCATCTAATTTATCCA CCTTTGTCAAGTA- 



Qy 1672 

TGGCAGCAGCCAGCCCATGGAGGTGCAGGAAGGCTATGGCTTTGGGTCAGGAGATGATCC 1731 
I I I I I I I I I II I I I I I I I I I I 

I I I I 

Db 966 CTTCATGACC-TGGA — TGAAATAGGGC-ATGGC 

AAAATCC 930 



Qy 1732 

CTACTCAAGTGCAGAGCCCCATGTGTCAGGTGTGAAACGGTCCCGCTCAGGTGAGGGCGA 1791 
II I I I I I I I I I I I I I I II 

I I I 

Db 929 ATGATATTGTGC CTCCATG CAGTT TCT 

AGGACGA 896 



Qy 1792 

GGTGAGCGGCCTTATGCGCAAGGTGCCACGAGTGAGCCTTGAACGCCTGGACCTGGACCT 1851 
I I I I I I I I I I I I 

I 

Db 895 CATCTG GCCTTAAAAG 



Qy 1852 

CACAGCTGACAGCCAGCCACCCGTCTTCAAGGTCTTCCCAGGCAGTACCACTGAGGACTA 1911 
I I I I I I I I I I I I I I I I I I III 

Db 8 77 C G T AAC AGG T AAAC AGAC AAGC TCCAAAGCA 

CTCT 843 



Qy 1912 

CAACCTTATTGTTATTGAACGTGGCGCTGCCGCTGCAGCTACCGGCCAGCCAGGGACTGC 19 71 
I I I I I I I III I I I I I I I I I I I I I I 

Db 8 42 CTTTTTTCTTCCTGCAAAAACCACTGCAGGAGTT- 

CTTCAGCCAATTCAGTATCTTT 78 7 



Qy 1972 

GCCTGCAGGAACCCCTGGTGCCCCACCCCTGGCTGGCATGGCCATTGTCAAGGAGGAGGA 2 031 
I I I I I I I I I I I I I I I I I I I I 

Db 786 AG AT T C AGAAGC ATACTGCATTGCAT 

CCTTGTAA 753 



Qy 2 032 GACGGAGGCTGCCATTGGAGCCCCTC- 

CTACTGCCACTGAGGGCCCTGAGACCAAACCTG 2 090 

I I I I I I I I I III I I I I I 



I 

Db 752 AGGCTGTCTTTCTTGCACAGCTCTAC- 

ACTCTGTTTCCA G 714 



Qy 2091 

TGCTTATGGCTCTTGCGGAGGGTCCTGGTGCTGAGGGTCCCCGCCTGGCCTCACCTAG — 214 8 

I I I I I I III III III II I I I I I I I I II 

Db 713 CGATTGTTGCCTTTGAAGAG AT AAGC AGC AAT T C T 

CCTGAACTCAATGAGTT 662 

Qy 2149 — 

TGGCAGCACCAGCTCAGGGCTGGAGGTGGTGGCTCCTGAGGGTACCTCAGCCCCAGGT 2 2 06 

II I I I I I I I I I I I III I I I I I I 

I 

Db 661 CATGTTTTTCCAAACGCTGAGCAAGC- 

GAGATATTGTCAAAGTTGTCATAAGCATCTATT 603 

Qy 2207 

GGTGGCCCGGGAACCCTGGATGACAGTGCCACCATTTGCCGTGTCTGCCAGAAGCCAGGC 2 2 66 
I I I I I I I I I I I I I I I I I I I III 

Db 6 02 GATGTTCGCAGAGCC TGATAAT 

CTTCTTCTGTAATAAAAAGA 561 

Qy 226 7 GATCTGGTTATGTGCAACCAGTGTGAGTTTTGTTTCCACCTGG- 

ACTGTCACCTGCCGGC 2325 

I I I I III III I I I I I I I I I III III I I I I 

I I 

Db 56 0 TTGTTCAATG-ATTCATTCACAGATTTGTTGTTA — TGGTTCTG — 

AACTGAACGC 510 

Qy 2326 

CCTGCAGGATGTACCAGGGGAGGAGTGGAGCTGCTCACTCTGCCATGTGCTCCCTGACCT 23 85 

III I I I I I I I I I I I I I I II III I III 

I I 

Db 509 AAATACGGTTTCACCAGTGGT AGCTGTTTAACCTTGC-TGAAATAATTGA- 

CT 459 

Qy 2386 

GAAGGAGGAGGATGGCAGCCTCAGCCTGGATGGTGCAGACAGCACTGGCGTGGTGGCCAA 2 4 45 
I I I I I III I I I I I I I I I I I I I I I I 

Db 45 8 GCACGAGTGTGAT CCAACC 

GTGGAGACAGCAC 42 7 

Qy 2446 

GCTCTCACCAGCCAACCAGCGGAAATGTGAGCGTGTACTGCTGGCCCTATTCTGTCACGA 2 5 05 

I I I I I I I I I I II III I I I I I I I I I 



Db 42 6 CATCAGCAAATCATTTAACAACAGAG GCTTG 

AAT T C T AAG TAG A 3 83 

Qy 2506 

ACCCTGCCGCCCCCTGCATCAGCTGGCTACCGACTCCACCTTCTCCCTGGACCAGCCCGG 2 565 
III II I I I I I I I I I I I I 

Db 3 82 ACTGTATTGC TCTGTAGTATA 

GTTCCACAT 353 

Qy 2566 TGGCACCC- 

TGGATCTGACCCTGATCCGTGCCCGCCTCCAGGAGAAGTTGTCACCTCCCT 2 62 4 

I I I I I I I I I I III I I I I I I I I I 

I I I I 



Db 352 TGGCAACCTTGGTAATGAT ATCTTTG- 

CCCT 318 



-AATTG- 



Qy 2625 

ACAGCTCCCCACAGGAGTTTGCCCAGGATGTGGGCCGCATGTTCAAGCAATTCAACAAGT 2 6 8 4 

I III I I I I I I I I I I I I I I I I I I 

I 

Db 317 TCT-TTCCAGGCATCAGTT GGATG 

ATTCATCATGG 284 

Qy 2685 

TAACTGAGGACAAGGCAGACGTGCAGTCCATCATCGGCCTGCAGCGCTTCTTCGAGACGC 2 7 4 4 

I I I I I I I I I I I I I I I I I I 

Db 2 83 TAATTATGG CATTATC 

ATATTCTTC 259 

Qy 2745 

GCATGAACGAGGCCTTCGGTGACACCAAGTTCTCTGCTGTGCTGGTGGAGCCCCCGCCGA 2 8 0 4 

I I I I I I I I I I I I I I I I I III 

I I 

Db 258 ATACTTGTCATACAAAAACACCA GTTCTG 

CCCAAAGA 222 

Qy 2 8 05 TGAGCCTG-CCTGGTGCTGGCCTGAGTTCCCAGGAGCTGT 

CTGGTGGCCCTGGTGAT 2860 

I I I I I I I III III I I I I II II II II I I I I I 

I 

Db 221 TGAGCTTGTTCTGCAGCT C T T AGC AC C T T GGG AAT AT T C AC T - C TAG AC C AG — 

AAC 168 



Qy 2 861 GGCCCCTGAGGCTGGAGCCCCCAT GGCC 

AGCCCAGCCTGGC 29 01 

I I I I I I I I I I I I I I III II II 

Db 16 7 AGCTCCAGGTGCT 

CCCTCATTTTCT G AGGC T T AAAT T T AGAGT AT AGAAT AGC T AA 112 
Qy 2902 -TCTGTTCTCTGTCC — 

TGTCACCCCATCCCCACTCCCCTGGTGGCCTGACTCCCACTCC 2 95 8 

I I I I I III I I I I I I I I I I I I I III III I 

I 

Db 111 TTCAGTAAACATTCCCATGTGAGCTCG-CTCAAGTCCC — AGTG — 
CTGCTTCCAA C 6 0 

Qy 2959 CTGGTGGCCCCATCCCCCAGTTCCTCACGAT- 

ATGGTTTTTACTTCTGTGGATTTAATAA 3017 

I I I I I I II II I I I I I I I III I I I I I I 
Db 59 ATGGTGATCAGCTCTTCAAAATAGCCACGATCCTGATAGTAGC — 
CAGTAGG 10 

Qy 3018 AAAAAAC T T C AC C AG T 3 033 

II I I I I 

Db 9 CCAACAGT 2 



RESULT 10 

US-10-478-914-17/C 

; Sequence 17, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 



; TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 

FILE REFERENCE: 7388-80899 
; CURRENT APPLICATION NUMBER: US/ 1 0 / 478 , 9 1 4 

CURRENT FILING DATE: 2003-11-26 

PRIOR APPLICATION NUMBER: PCT/JP02/05294 
; PRIOR FILING DATE: 2001-05-30 

PRIOR APPLICATION NUMBER: JP 2001-162775 
; PRIOR FILING DATE: 2001-05-30 
; PRIOR APPLICATION NUMBER: JP 2001-255226 
; PRIOR FILING DATE: 2001-08-24 
; NUMBER OF SEQ ID NOS : 417 
; SOFTWARE: Patentln version 3.2 
; SEQ ID NO 17 

LENGTH: 219 8 
TYPE: DNA 
; ORGANISM: Homo sapiens 
US-10-478-914-17 

Query Match 27.3%; Score 5339; DB 1; Length 2198; 

Best Local Similarity 41.5%; Pred. No. 5.6e-298; 
Matches 1274; Conservative 0; Mismatches 814; Indels 982; 
Gaps 180; 

Qy 6 GCGGGCCGGCGGCG-GAGGCCG GGCC 

GCGGAG 36 



Db 2198 GC- 

GACCTGCAGCTCGAGCACATGTGGCCTTTTTTTTTTTTTTTTTTTTTTTTTATGGTG 214 0 

Qy 3 7 CCAGGAGTGACTAGCAGCAGTTGGCCGTGCCGTAGCAGCGTCC- 

CGCGCGCGGCGGGCAG 95 

III I I I I I III I I I I I I I I I I I 

Db 2139 CCAATTTTAAATAGTTTTATTT 

AAG AC AT TGCATTTTC C AC T T AC AAT AC AG 2 088 

Qy 96 

CGGCCCAGGAGGCGCGTGGTGCGGGTTTCGGCGGCGGCTGAGGAAGAAGCGCGGGCGGCG 155 



Db 2087 -TGTTTATAAAGTGCAATGT TATTTC — CTTCCCCTGT-GCATATGTTC- 

CA 2041 

Qy 156 

CCTTCGGGAGGCGAGCAGGCAGCAGTTGGCCGTGCCGTAGCAGCGTCCCGCGCGCGGCGG 215 

III I I I I I I I I I I I III I I I I I I I I I 

Db 2 0 40 TATTCAAGTATTGAGAATGCC-CAGT — AACTTACTATAGCAGCTTAACTT- 

TTT 1990 

Qy 216 GCAGCGGCC- 

CAGGAGGCGCGTGGCGGCGCTCGGCCTCGCGGCGGCGGCGGCGGCAGCGG 2 7 4 



Db 19 89 AAAACTGCCACAGAA TTTGCTACG 

AATTTAGG 1958 

Qy 2 75 CCCAGCAGTTG GCGGCGAGCGCGTCTGCGCCTGCGC — 

GGCGGGCCCCGCGC 32 4 

II II II II III III I I I I 

Db 1957 



TCCTTCAAATGTTTTAAATGTGTGGAACAATGCTACATCTACACTTGGTTGGCTTAATCA 189 8 



Qy 325 CCCTCCTCCCC CCCTG 

GGCGCCCCCGGCGGCGTGTGAATGGCGGCCTCCG 374 



Db 189 7 ACCTCTTCAATGGTGGGCCCTGAGGAAGCACCACCAGAGG GAGGAGC- 

TCCACCA 18 44 

Qy 3 75 CGGCGGCAGCCTCGGCAGCAGCG-GCCTC — GGCC- 

GCCTCTGGCAGCCCGGGCCCGGGC 43 0 

I I I I I I III II I I I I III I I I I I I I I I I I I I 
Db 18 43 CCAGGAAATCCCC — CAG — GCATTCCTCCTGGCATGCCTCCTGCA- 

CTCTGGTACAGCT 1789 

Qy 431 

GAGGGCTCCGCTGGCGGCGAAAAGCGCTCCACCGCCCCTTCGGCCGCAGCCTCGGCCTCT 49 0 

III I I I I I I I I I I II I I I I I I I I I I II 

I I I 

Db 178 8 TGGTGAT— GATGGGGTTGCAAA-CTTTCTCCAGCTCTTTCTGTTGATG- 

TTCAAATTCT 1733 

Qy 491 GC— CTCAGCCG- 

CGGCGTCGTCGCCCGCGGGGGGCGGCGCCGAGGCGCTGGAGCTGCT 5 46 

I I I I I I I I I I I I I I I I I I I I 

I 

Db 1732 TCCTTCTCAGCAGTCTGATTCTTATCAAGC 

CAGTTGATAAT 1692 

Qy 547 GGAGCACTGCGGCGTGTGCAGAGAGCGCCTG — 

CGACCCGAGAGGGAGCCCCGCCTGCTG 604 

I I I I I I I I I I I I I I III III I I I I I I I I 

I I 

Db 1691 — TTCATTAC-ACTTGTCCAGA-ATCTTCTGTTTGTCCTCATCGTTAATCTTGCCT- 



Qy 6 05 CCCTGTTTGCACTCGGC — 

CTGTAGTGCCTGCTTAGGGCCCGCGGCCCCCGCCGCCGCCA 662 

I I I I I I I I I III I I I I I I I I I 

I I 

Db 1637 — AAGTTT CTCATCTTCAACAGT TGCTTT 

CATGTTGAAGGCA 159 8 



Qy 663 ACAG-CTCGGGGGACGGCGGGGCGGCGGGCGACGGCACCGTGGTGGACTG — 

TCCCGTGT 719 

I I I I III I I I I I I I I I I I III I I I I 



Db 159 7 TAGGACTCAAGTGAATTCTTG- 

GATGACACCTTGTCCCTCTGCTTCTCATCT 15 47 



Qy 720 

GCAAGCAACAGTGCTTCTCCAAAGACATCGTGGAGAATTATTTCATGCGTGATAGTGGCA 7 79 
I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 1546 TC-AGCTT-TGTACTTCTC AGCTTCCTGGA CCATACGT- 

TCAATGTCT 1502 

Qy 780 

GCAAGGCTGCCACCGACGCCCAGGATGCGAACCAGTGCTGCACTAGCTGTGAGGATAATG 839 
I I I I I I I I I I I I I I I I I I 



I 

Db 1501 TCCTTGCT-CAAACG — GCCCTTGT- 

GATAGTA 1466 



-CATTAGT- 



Qy 8 40 CCCCAGCCACCAGCTACTGTGTGGAGTGCTCGGAGCCTCTGTGTGAGACC- 

TGTGTAGAG 89 8 

II I II I I I I I I I I I I I I I I I I I I I 

Db 1465 ATCTTGTTCTCTTTTCC CGTACTCTTGTCCACAGC- 

AG AG AC AT T GAG T A 1417 

Qy 899 GCGCACCAGCGGGTGAAGTACACCAAGGACCATACTGTGCGCTCTA- 

CTGGGCCAGCCAA 957 

I I I I II I I I I I I I I I III I I I I I I I I I 

Db 1416 TACCATTGG CATCAATGTCAAAA — 

G T G AC T T C AAT C T G AGG AAC AC — 13 72 

Qy 958 GTCTCGGGATGGTGAACGTA 

CTGTCTATTGCAACGTACACAAGCATGAACCCCTTGT 1014 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 

I I I I 

Db 13 71 — CTCGGGGTGCAGGAGGTATGCCTGTGAGTTCAAACTTGC-CAAGCAGG 

TTGT 1321 



Qy 1015 

GCTGTTTTGTGAGAGCTGTGATACTCTCACCTGCCGAGACTGCCAGCTCAATGCCCACAA 10 7 4 
I I I I I I I II I I I I I I I I I III I I I I I 

I 

Db 132 0 TATCCTTTGTCATGGC ACGCTCGCCTTCATAAAC 

CTGAAT A 12 8 0 



Qy 1075 

GGACCACCAGTACCAGTTCTTAGAGGATGCAGTGAGGAACCAGCGCAAGCTCCTGGCCTC 113 4 
I I I I I I I I I I I I I I I I I I I I I I I III III 
Db 12 79 AGCACACCAG-GCTGGTTGTCAGAATAGGTAGTGAAGGT-CTGTGTCTGCT — 

TGGTAGG 122 4 



Qy 1135 ACTGGTG AAGCGCCT-- 

TGGGGACAAACATGCAACATTGCAGAAGAGCACCAAGGAG 1189 

I I I I I I I I I I I I I I I I I I I I I I 

Db 1223 AATGGTGGTATTACGCTTGATGAGGACAGTCATGACTC- 
CACCA 1181 



Qy 119 0 GTTCGCAG-CTCAATCCGCCAGGTGTCTGACGTACAGAAGCGTG 

TGCAAGTGGATGT 12 45 

I I I I I I I I I I I I I I I I I I I I I I 

Db 1180 GCAGTTTCAAT — ACCAAG 

GG AAAG AGG AG T G AC AT C C AAG AG 1140 



Qy 1246 

CAAGATGGCCATCCTGCAGATCATGAAGGAGCTGAATAAGCGGGGCCGTGTGCTGGTCAA 13 05 
I I I I I I I I I I I II II 

Db 1139 C AGC AAAT C T T G AAC AT T C T CAG 

ACTTGTCT- 1109 



Qy 13 06 TGATGCCCAG — 

AAGGTGACTGAGGGGCAGCAGGAGCGCCTGGAGCGGCAGCACTGGACC 1363 

I I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 1108 CCAGACAAGATGGCT GCCTGGA-CAGCTGC- 

ACC 1077 



Qy 1364 

ATGACCAAGATCCAGAAGCACCAGGAGCACATTCTGCGCTTTGCCTCTTGGGCTCTGGAG 1423 
I I I I I I III I I I I I I I I I I I I I 

Db 10 76 ATAAGCAA CAGCTTCATCAGGG 

TTGATGCTCTT 1044 



Qy 1424 

AGTGACAACAACACAGCCCTTTTGCTTTCTAAGAAGTTGATCTACTTCCAGCTGCACCGG 14 83 

I III I I I I I I I I I I I I I I I III II II 

I 

Db 10 43 ATTCAGTTCTTTTCCATT-GAAGAAGT 

CTTGGAGAAGCTTCTG 10 02 



Qy 148 4 GCCCTCAAGATGATTGTGGATCC- 

CGTGGAGCCACATGGCGAGATGAAGTTTCAGTGGGA 1542 

I I I I I I I I I I I I I I I I I 

Db 1001 AATC 

TTGGGGATACGAGTAGAACCAC 9 76 



Qy 1543 

CCTCAATGCCTGGACCAAGAGTGCCGAGGCCTTTGGCAAGATTGTGGCAGAGCGTCCTGG 16 02 
I I I I I I I I I I I I I I I I I I I I I I I I 

Db 9 75 CAACCAGGACAATATCATGAAT-CTGTGAC — TTGTCTAGTTTG 

GCAT 931 



Qy 1603 

CACTAACTCAACAGGCCCTGCACCCATGGCCCCTCCAAGAGCCCCAGGGCCCCTGAGCAA 1662 

I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 93 0 CTCGAAGGGCTTT 

CTCTACTGGGTCCAGGGTGCCACGGAACAG 8 8 8 

Qy 1663 GCAGGGCTCTGGCAGCAGCCAGCCCATGGAG — 

GTGCAGGAAGGCTATGGCTTTGGGTCA 172 0 

I Mill III I I I I I I I I I I I I I II 

I 

Db 887 GT CAGCATTCAGTTCTTCAAATCGGGCACG — GGTAATGG 

AGGTATA 8 43 



Qy 1721 

GGAGATGATCCCTACTCAAGTGCAGAGCCCCATGTGTCAGGTGTGAAACGGTCCCGCTCA 17 8 0 
I I I I I I I I I I I I I I I I I I 

Db 8 42 GAAGTCGATTCCTTCATA 

G AG AG AAT C GAT C T C AAT — 807 



Qy 1781 

GGTGAGGGCGAGGTGAGCGGCCTTATGCGCAAGGTGCCACGAGTGAGCCTTGAACGCCTG 18 4 0 

I I I I I I I I I I I I I I I I I I I I I 

I 

Db 8 06 ACTGGCCT GGGTGCTGGAAGAGAG 

GGTACGCTTA 7 73 

Qy 1841 

GACCTGGACCTCACAGCTGACAGCCAGCCACCCGTCTTCAAGGTCTTCCCAGGCAGTACC 19 0 0 
I III I I I I I 

I I I I I I I 

Db 772 G CACG 

TTCACAAGCAGTACG 753 



Qy 1901 

ACTGAGGACTACAACCTTATTGTTATTGAACGTGGCGCTGCCGCTGCAGCTACCGGCCAG 196 0 

II I I I I I I I I I I I I I I I I I II 

Db 752 GAGGCGTCTTACAGCTCTCTTGTTCT CACTGATGT 

CCTT 714 

Qy 1961 

CCAGGGACTGCGCCTGCAGGAACCCCTGGTGCCCCACCCCTGGCTGGCATGGCCATTGTC 2 02 0 

I I I I I I I I I I I I I I I I I I I I I 

I I I I I 

Db 713 CTTATGCTTGCGCTT AAACTCAG 

CAATAAAATGGTTGACCATTCGGTTGTC 663 

Qy 2021 

AAGGAGGAGGAGACGGAGGCTGCCATTGGAGCCCCTCCTACTGCCACTGAGGGCCCTGAG 2 0 8 0 

II I I I I I I I I I I I I I I 

Db 662 AA AATCTTCTCC ACCCAAGTGGG 

TGTC 636 

Qy 2081 

ACCAAACCTGTGCTTATGGCTCTTGCGGAGGGTCCTGGTGCTGAGGGTCCCCGCCTGGCC 214 0 
III III III I I I I I I I I I I 

I I 

Db 635 TCCA GCTGTAGACTT GACCT CAAAGATTCCAT 

CC 602 

Qy 2141 TCACCTAGTGGCAGCACCAGCTCAGGGCTGGAGGTGGTGGCTCCT- 

GAGGGTACCTCAGC 2199 

I I I I I I I I III I I I I I I I I I I I 

Db 601 TCA-ATAGTG AGGAT 

T G AC AC AT C AAAAG T GC C AC C - T 566 

Qy 2200 

CCCAGGTGGTGGCCCGGGAACCCTGGATGACAGTGCCACCATTTGCCGTGTCTGCCAGAA 2259 

I I I I I I I I I I I I I II I I I I I I I I I I I 

Db 565 CCCAGGT CAAAGATGA GC— ACGTTT— C- 

TTTCTGC 534 

Qy 2260 

GCCAGGCGATCTGGTTATGTGCAACCAGTGTGAGTTTTGTTTCCACCTGGACTGTCACCT 2319 

II I I I I I I I I I I I II I I I I 

I 

Db 533 TC CAACCT TTTTGT 

CTAAGCCGTAAGCA 506 

Qy 232 0 GCCGGCCCTGCAGGATGTACCAGGGGAGGAGTGGAGCTGCTCACT 

CTGCCATGTGC 2375 

I I I I I I I I I I I I I I I I I I I I I 

Db 505 ATAG CAGCAG CAGTTG 

GCTCATTAATAATTCTAAGTAC 46 8 

Qy 23 76 TCCCTGACCTGAAGGAGGAGGATGGCAGCCTCAGCCTGGATGGTGCAGACAGCA- 

CTGGC 2434 

I I I I I I I I I I I I I I I I I I 

Db 46 7 ATTGAGACCAGCA ATAGTTC 

CAGCATCT 440 



Qy 2435 

GTGGTGGCCAAGCTCTCACCAGCCAACCAGCGGAAATGTGAGCGTGTACTGCTGGCCCTA 2 49 4 



1 1 1 1 1 1 1 I 1 1 I 1 1 1 1 

Db 439 TTGGTAGCCTGACGCTGA-GAGTCAT- 

TGACC 392 

Qy 2495 

TTCTGTCACGAACCCTGCCGCCCCCTGCATCAGCTGGCTACCGACTCCACCTTCTCCCTG 2 55 4 

II I I I I I I I I I I I I I I I I I I I I I 

I 

Db 391 -ACAGCATTGGTAACAGTCTTCCCAAG 

GTAGGCTTCTGCAATTTCCTTCATCTTT 338 

Qy 2 555 GACCAGCCCGGTGGCACCCTGGATCTGACCCTGATCCGTGCCCGCCTCCAGG- 

AGAAGTT 2 613 

II I I I I I I I III I I I I I I I I I I I I 

Db 33 7 GTC AGAACCATAGA — AGAC 

ACCTCCTCTGGATAGAAGCT 300 

Qy 2614 

GTCACCTCCCTACAGCTCCCCACAGGAGTTTGCCCAGGATGTGGGCCGCATGTTCA 2 669 

I I I I I I I I I III I I I I I I I I I I I I 

I 

Db 299 TTTGGTCT-CTCCCTTGTATTCTAC TTGGACCT TGGGCC 

TGCC 258 

Qy 26 70 AGCA — 

ATTCAACAAGTTAACTGAGGACAAGGCAGACGTGCAGTCCATCATCGGCCTG — 2 725 

I I I I I I I I I I I III I I I I III I I I I I I I I I I I 

Db 257 AGCATCATTCACCACCATAA AGGGCCAATGT TTCAT- 

ATCAGACTGGA 211 

Qy 2726 CAGC-GCTTCTTCGAGAC- 

GCGCATGAACGAGGCCTTCGGTGACACCAAGTTCTCTGCTG 27 83 

I I I I I I I I I I I I I I I I I I I I I I I II II 

I I I 

Db 210 CAACAGCATCATCAAATCTGCGTCCAATC-AGACGTTTGGCATCAAAAA 

CTG 160 

Qy 2 78 4 TGCTGGTGGAGCCCCCGCCGATGAGCCTGCCTGGTGCT 

GGCCTGAGTTCCCAGGAG 2 839 

I I I I I I I I I I II I I I I I I I I I I I I III 

I 

Db 159 TGTTGGTGGGGT TCATTG CAACTTGATTCTTTGCGGCATCA 

CCGATCAA 111 

Qy 2840 

CTGTCTGGTGGCCCTGGTGATGGCCCCTGAGGCTGGAGCCCCCATGGCCAGCCCAGCCTG 2 899 

III I I I I I I I I I I I I II I I I I I II I II 

Db 110 CCGTTCAGTGTCC GTAAAGGCGACATAGCTTGGAG TG 

GTTCGGTTT- 65 

Qy 2900 

GCTCTGTTCTCTGTCCTGTCACCCCATCCCCACTCCCCTGGTGGCCTGACTCCCACTC — 2 95 7 

I I I I I I III I I II I I I I I I I I I I I I I I I I I 

Db 6 4 -CCCTGATCATTGGCAATTATC TCGACTTTTCCGTGCTGG- 

AAAAC AC C C AC AC AA 11 



Qy 



2958 — CCTGGTGG 2965 
III II 



Db 



10 GACCAGTAGG 1 
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Qy 51 

CAGCAGTTGGCCGTGCCGTAGCAGCGTCCCGCGCGCGGCGGGCAGCGGCCCAGGAGGCGC 110 

I I I I I I I I I I II I I I I I I I 

I 

Db 1 CA-CTGTTGGCC TA CTGGCTACTATC 

AGGATC 31 



Qy 111 

GTGGTGCGGGTTTCGGCGGCGGCTGAGGAAGAAGCGCGGGCGGCGCCTTCGGGAGGCGAG 17 0 
I I I I I III I I I I I I I I I I I 

Db 32 GTG — GCTATTTT G AAG AGC T GAT — C AC CAT — 

GTTGG 6 4 



Qy 171 

CAGGCAGCAGTTGGCCGTGCCGTAGCAGCGTCCCGCGCGCGGCGGGCAGCGGCCCAGGAG 23 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 65 -AAGCAGCACTGGGACTTG AGCG AGCTCAC- 

ATGGGAATGTTTACTGAAT 112 

Qy 231 

GCGCGTGGCGGCGCTCGGCCTCGCGGCGGCGGCGGCGGCAGCGGCCCAGCAGTTGGCGGC 2 9 0 

II I III I II I I I I I I I 

I 

Db 113 T AGC TAT T C T AT AC TC T AAAT T T AAGC CTCAGAAAATG- 



Qy 



291 GAGCGCGTCTGCGCCTGCGCGG — 



CGGGCCCCGCGCCCCTCCTCCCCCCCTGGGCGCCCC 348 

I I I I I III I I I I I I I I II I I III I I I I 

Db 154 GAGCAC— CTG GAGCTGTTCTGGTCTAGAGTGAATATT 

CCCAAGGTGCTAA 2 02 

Qy 349 

CGGCGGCGTGTGAATGGCGGCCTCCGCGGCGGCAGCCTCGGCAGCAGCGGCCTCGGCCGC 4 0 8 

I I I I III I I I I I I I I I I I I II 



Db 2 03 GAGCTGC AGAA CAAGCTCATCTTTG GGCAGAA 

CTGGTGT 241 

Qy 409 

CTCTGGCAGCCCGGGCCCGGGCGAGGGCTCCGCTGGCGGCGAAAAGCGCTCCACCGCCCC 46 8 
III III I I I I I I I I I I I 

I 

Db 242 TTTTGTAT 

GACAAGTATGAAGAATATGATAATGCCATAATTACCATGATGAATCA 2 96 

Qy 469 TTCGGCCGCAGCCTCGGCC- 

TCTGCCTCAGCCGCGGCGTCGTCGCCCGCGGGGGGCGGCG 52 7 



Db 29 7 TCCAACTGATGCCTGGAAAGAAGGGCAATTCAAAGATATCATTACC— AAGGTTGC- 

CA 352 

Qy 528 

CCGAGGCGCTGGAGCTGCTGGAGCACTGCGGCGTGTGCAGAGAGCGCCTGCGACCCGAGA 58 7 

III II I I I I I I I I I I I I III III I 
Db 353 AT G T G G AAC T AT A- C T AC - AG AGC AAT AC AG T - 
TCTACTTAGAATTC 396 

Qy 58 8 GGGAGCCCCGCCTGCTGCCCTGTTTGCACTCGGCCTG- 

TAGTGCCTGCTTAGGGCCCGCG 6 46 

I I I I I III I I I I I I I I I I I I II II 

I I 

Db 397 — AAGCCTCTGTTGTTAAATGATTTG CTGATGGTGCTGTCT 

CCACG 440 



Qy 6 47 G CCCCCGC-CGCCGCCAA CAGCTCGG- 

GGGACGGCGGGGCGGCGGGCG 692 



Db 441 GTTGGATCACACTCGTGCAGTCAATTATTTCAGCAAGGTTAAACAGC — 

TACCACTGGTG 49 8 

Qy 693 ACGGCACCGTG 

GTGGACTGTCCCGTGTGCAAGCAACAGTGCTTCTCCAAAGACATCG 7 49 

I I I I I I I I I I I I I I I I I I I III 

I I I 

Db 499 A AAC CGTATTTGCGTT C AG T T C AG AAC CAT AAC AAC A 

AATCTGTGAATGAATCA 552 

Qy 750 

TGGAGAATTATTTCATGCGTGATAGTGGCAGCAAGGCTGCCACCGACGCCCAGGATGCGA 8 09 
III I I I I I III I I I I I I I II III 

I I I I I 

Db 553 TTGAACAATCTTT T T AT T AC AG AAG - AAG AT TATCA — GGCTC — 

TGCGA 59 7 



Qy 810 ACCAGTGCTGCACTAGCTGTGAGGATAATGCCCCAGCCACC — 

AGCTACTGTGTGGAGTG 86 7 



Db 59 8 ACAT CAATAG 

ATGCTTATGACAACTTTGACAATATCTCGCTTG 640 

Qy 86 8 CTCGGAGCCTCTGTGTGAGACCTGTGTAGAGGCGCAC- 

CAGCGGGTGAAGTACACCAAGG 926 



Db 6 41 CTC — AGCGTTTG-GAAAAACAT G AAC T C AT T GAG T - 

TCAGGA 6 79 

Qy 92 7 ACCATACTG- 

TGCGCTCTACTGGGCCAGCCAAGTCTCGGGATGGTGAACGTACTGTCTAT 9 85 



Db 680 GAATTGCTGCTTATCTCTTCAAAGGCAAC — AATCGCTGGA 

GTGTAG 73 0 

Qy 9 86 TGCAACGTACACAAGCATGAACCCCTTGTGCTGTTTTGTGAGAGCTGTGATACTCT- 

CAC 1044 



Db 731 AGCTGTG C AAGAAAGAC AGCC T T - T AC AAGGATGC - 

AATGCAGTATGCTTCTGAAT 78 4 

Qy 1045 

CTGCCGAGACTGCCAGCTCAATGCCCACAAGGACCACCAGTACCAGTTCTTAGAGGATGC 110 4 



Db 785 CTAAAGATACTG AATTGGC — TGAAGAACTCCTG CAGTGGTTT 

TTGC 829 

Qy 1105 AGTGAGGAACCAGCGCAAGCTCCTG- 

GCCTCACTGGTGAAGCGCCTTGGGGACAAACATG 1163 



Db 83 0 AGGAAGAAAAAAGAGAGTGCTTTGGAGCTTGTCTGTTTA 

GTTACGATCTTT 8 83 

Qy 116 4 CAA — 

CATTGCAGAAGAGCACCAAGGAGGTTCGCAGCTCAATCCGCCAGGTGTCTGACGT 12 21 

III I I I I I I I I I I I I I I I I 

Db 884 TAAGGC CAGATGTCGTCCTAG 

AAACTGCATGGAGGC 919 

Qy 1222 ACAGAAGCGTGTGCAAGTGGATGTCAAGATGGCC 

ATCCTGCAGATCATGAAGGAGCT 12 78 

III II I I I I I I I I I I I I I I I I I I I I I 

Db 92 0 ACAATATC 

ATGGATTTTGCCATGCCCTATTTCATCCAGGTCATGAAGGA 96 8 

Qy 1279 

GAATAAGCGGGGCCGTGTGCTGGTCAATGATGCCCAGAAGGTGACTGAGGGGCAGCAGGA 133 8 



Db 969 GTACTTGACAAAGGTG- 

GATAAATTAGATGCTTCAGAA 10 05 



III I II 
-AACAGA— 



III I I I I I 
CCT — 



Qy 1339 

GCGCCTGGAGCGGCAGCACTGGACCATGACCAAGATCCAGAAGCACCAGGAGCACATTCT 139 8 

II II III I I I I I I I I I I I I III I I I I I 

Db 10 06 T C AC T G AG AAAAGAAG AAGAAC AAGC T AC AGAG AC AC 

AACCCATT— 1050 

Qy 1399 GCGCTTTGCCTCTTGGGCTCTGGAGAGTGACAACAACACAGCCC — 

TTTTGCTTTCTAAG 1456 

III II III I II I I I I I III I I I I I 

Db 1051 — GTTTATGGTC 

AGCCCCAGTTGATGCTGACAGCAGGACCCAGTGTTGC 109 7 

Qy 1457 AAGTTGATCTACTTCCAGCTGCACCGGGC-CCTCAAGATGATTGTGG — 

ATCCCGTGGAG 1513 

I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 109 8 CGTCC — CT-CCCCAGGCACCTT TTGGTTATGGTTATACC 

GCA 113 7 

Qy 1514 

CCACATGGCGAGATGAAGTTTCAGTGGGACCTCAATGCCTGGACCAAGAGTGCCGAGGCC 15 73 



Db 113 8 CCAC CGT-ATGGA CAG CCACA — GCCTGG-CTTTGGGTAC 

AGCA 1177 

Qy 1574 TTTGGCAAGATTGTGGCAGAGC — GTCCT GGCACTAA— CTC- 

AACAGGCCCTGCAC 1625 

III I I I I I I I II I I I I I I I I I II III I 

Db 1178 TGTG AGAT 

GAAGCGCTGATCCTGTAGTCACCTATTTTCGTACTGAAACATCGT 1230 

Qy 1626 CCATGGCCCC-TCCAAGAGCCCCAGGGCCCCTGAGCAAGCAGG 

GCTCTGGCAGC 16 78 



Db 1231 CTTTACCCACTTCTCAGTTTATAATGG 

GGGAAAACAGGCAACGTGTTCTTGTAAC 12 85 

Qy 16 79 AGCCAGCCCATGGAGG — TGCAGGAAG 

GCTATGGCTTTGGGTCAGGAGATGATC 173 0 

I I I I I I I I I I I II 

Db 1286 

CTTTATTTCATGAAGGACTTCTTTTTGTTTCTAACTATAAACTTGGATCA- 
Qy 1731 

CCTACTCAAGTGCAGAGCCCCATGTGTCAGGTGTGAAACGGTCCCGCTCAGGTGAGGGCG 179 0 



Db 1336 CCT ATGTTAAAA CCTTATTTC 

ACATTCCACATCATTTTA G 13 75 

Qy 1791 AGGTGAGCGGCCTTATGCGCAAGGTGCCACGAGTGA — 

GCCTTGAACGCCTGGACCTGGA 18 48 

I II I I I I I I I I I I II 

Db 1376 AATTTA TTTTCG- 

AAGGGGAATAGTTTCAATGTTTTATTCACTTGGGCTT 142 4 



I I I I III 

1335 



Qy 18 49 CCTCACAGCTGACAGCCAGCCACCCGTC-TTCAAGGTCTTCCCAGGCAGTACCA- 

CTGAG 19 06 



Db 1425 TTTTTCTTCC — CCCTCTTTCTTTAAAGAACTGCTCA — 

ATATTCAATCTGTT 1473 

Qy 19 0 7 GACTACAACCTTATTGTTATTGAACGTGGCGCTGCCGCTGCA — 

GCTACCGGCCAGCCAG 196 4 



Db 1474 GTGAAGAACCTGAT TTGCACT 

C T G TAG T G T T T AAAG AAAC AAAG 1517 

Qy 1965 GGACTGCGCCTGCAGGAACCC CTGGTGCCCCACCCCTGGCTGGCATGGCC — 

AT 2016 



Db 1518 AAACT-CTAATATTGAATCTCTTAAATTTAGTGTATGTAAAC- 

AGCTTACAAATACGTAT 15 75 

Qy 2017 

TGTCAAGGAGGAGGAGACGGAGGCTGCCATTGGAGCCCCTCCTACTGCCACTGAGGGCCC 2 0 76 



Db 1576 TGTCTAAATG CATTTAAATCTGTTTTATT — 

C AAAGAAAAGC T 1616 

Qy 2 0 77 T GAG AC C AAAC - 

CTGTGCTTATGGCTCTTGCGGAGGGTCCTGGTGCTGAGGGTCCCCGCC 2135 



Db 1617 AAAGCAAAAACACTG-GCATATGAC-CATGC — AAG 

ACTGTCAGTGCCAACA 166 4 

Qy 2136 TGGCCTCACCTAGTGGCAGCACC- 

AGCTCAGGGCTGGAGGTGGTGGCTCCTGAGGGTACC 2194 



Db 1665 AAGAC AAC AC T AAT — C AGCACATCGTACA 

AGTGCT 170 7 

Qy 2195 TCAGCCCCAGGTGGTGGCCCGGGAACCCTGGATGACAGTGCCACCATTTGCC- 

GTGTCTG 2253 



Db 170 8 T CCCAGATTAT TGAAAAATG— TTACAG 

ACAACTTGCCTGATTTTT 1751 

Qy 2254 CCAGAAGC CAGGCGATCTGGTTATGTGCA 

ACCAGTGTGAGTTTTGTTTCC 23 03 

I III I I I I III I I I I I I I I I 

Db 1752 AAATGAGCGTAAAAGGCCCTCT- 

AAC C T AT GC AGG T TTCCCCATTAT GC AT AT AGAAAAT 1810 

Qy 23 0 4 ACCTGGACTGTCACCTGCCGGCCCTGCAGGATGTACCAGGGG 

AGGAGTGGAGCTGCT 236 0 

III III III I I I I I I I I I I I I I I I 

Db 1811 G-CTAGTATGTTT — TGCT — CACTTCA- 

TATGTAACAGGTGCCCTTATGTTGTGCTG — 1862 



I I I I I III 
-CTGGA TTGC-- 



Qy 2361 

CACTCTGCCATGTGCTCCCTGACCTGAAGGAGGAGGATGGCAGCCTCAGCCTGGATGGTG 2 42 0 



I I 1 1 1 1 1 1 I 1 1 1 1 1 1 II II I 

Db 1863 TATCCTGTGCT TTTTCTGTGGGA CCATT- 

CCATTCAGGAG 19 01 

Qy 2421 

CAGACAGCACTGGCGTGGTGGCCAAGCTCTCACCAGCCAACCAGCGGAAATGTGAGCGTG 2 4 8 0 

I I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 19 02 CAAAGAGCAC CATGAT-TCCAATCT 

TGTGTGTGTT 1935 

Qy 2481 

TACTGCTGGCCCTATTCTGTCACGAACCCTGCCGCCCCCTGCATCAGCTGGCTACCGACT 2 5 4 0 

II I I I I I I I I I I I I I I I I I I I I I 

Db 1936 TA CTAACCCT TCCCTGAGG 

TTTGTGTATGTTGGATA 19 71 

Qy 2541 

CCACCTTCTCCCTGGACCAGCCCGGTGGCACCCTGGATCTGACCCTGATCCGTGCCCGCC 2 6 0 0 

II I I I I I I I I I I I I I III 

Db 1972 TT GTGGTGTTTTAGAT CACTGA 

GTG 1996 

Qy 26 01 TCCAG GAGAAGTT — 

GTCACCTCCCTACAGCTCCCCACAGGAGTTTGCCCAGGATG 2 65 4 

I I I I I I I I I I I II I I I I I I I I I I I I 

I I 

Db 199 7 T AC AG AAG AG AG AAAT T C AAAC AAAAT AT TGCTGTTCTTC AGTTT 

TGTTTG 2047 

Qy 2655 TGGGC — 

CGCATGTTCAAGCAATTCAACAAGTTAACTGAGGACAAGGCAGACGTGCAGTC 2 712 

III II II I I I I I I I I I I I I I I I II I 
Db 2 0 48 TGGAATTTGAAATTACTCAAATTTAAAATAAATTACT — GGACT — 
GTGGAAAT 2 09 7 

Qy 2713 

CATCATCGGCCTGCAGCGCTTCTTCGAGACGCGCATGAACGAGGCCTTCGGTGACACC — 2 7 7 0 
I I I I I I I I I I II I I I I I I I I I I III 
Db 2 09 8 - AAC AT AG AAT T G AAGT T T T AAT T AAAT AC - 

C AC T C AAAC G AAAAG AAC AG TAG T T T T T G 2155 

Qy 2771 

AAGTTCTCTGCTGTGCTGGTGGAGCCCCCGCCGATGAGCCTGCCTGGTGCTGGCCTGAGT 2 83 0 

I I I I I I I I I I II II I I I I I I I II II 

I I I I 

Db 2156 TAGTTTTATATTG- GAT AC T G AGGC AT TAG — GGAG- 

GCATGAAAGGAAGAGGAATGAG- 2210 

Qy 2 831 TCCCAGGAGCTGTCTGGTG — GCCCTGGTGATGGCCCCTGAGGCTG- 

GAGCCCCCATGGC 2887 

I I I I I I I I I I I I II I III II 

Db 2211 G AT T GAG AC AT G T G AAG AC AT T G T G 

CATTATATCAATGTGCATTCCTGTA 226 0 

Qy 2888 

CAGCCCAGCCTGGCTCTGTTCTCTGTCCTGTCACCCCATCCCCACTCCCCTGGTGGC 2 9 4 4 

I III III II III I I I I I I I I I I I 

Db 2261 



GTTCATTAACAAGGTACATGCAATAGTCTAAAGAACCAGAGTCACTACTATAGTGGCTTA 232 0 



Qy 2945 CTGACTCCCA CTCCCTGGTGGCCCC- 

ATCCCCCAGTTCCT 2983 



Db 2321 

ACATTTAATCTGTCTCCAATATTTTAACCAAGTGACACCGGGGTTTTTATCGAAGCATTT 23 8 0 

Qy 29 8 4 CAC-GATATGGTTTTTACTT — CTGTGGATTT 

AATAAAAAAAACTTCACCAGT 3 033 

III I I I I II I I I I II I I I I I I I I I 

Db 2381 

C AC T T AAAT G AAC AAAT CATGGCTGT TAT AT T AAC T TGAAAT AAAAT AT AT T T AAAC AT G 2 4 4 0 

Qy 3034 TCAAAAAAAAAAAAAAAAAAAAA 3056 

I I I I I I I I I I I I I I I I I I 
Db 2441 TAAAAAAAAAAAAAAAAAGGCCA 2463 
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Qy 277 

CAGCAGTTGGCGGCGAGCGCGTCTGCGCCTGCGCGGCGGGCCCCGCGCCCCTCCTCCCCC 336 

I I I I I I I I I I I I I I I I I I 

Db 2336 CAGCT CGAGCACATGTG-GCCT 

TTTTTTTTT 2307 



Qy 337 

CCTGGGCGCCCCCGGCGGCGTGTGAATGGCGGCCTCCGCGGCGGCAGCCTCGGCAGCAGC 396 

I II II I I I I I I I I I I III 

I 

Db 2306 TTT TTTTTTTTGTATTCTCAG 



AGCTGCCAGGAGTGCATC 226 8 



Qy 39 7 GGCCTCGGCCGCCTCTGGCAGCCCGGGCCCGGGCGAGGGCTCCG- 

CTGGCGGCGAAAAGC 455 

I I I I I I Mill II I I I I I I I I 

Db 226 7 G AGCCTGTAATTTCCTGTTCTC 

TGAATCCCCCATCTTTCTGC A 2225 

Qy 456 

GCTCCACCGCCCCTTCGGCCGCAGCCTCGGCCTCTGCCTCAGCCGCGGCGTCGTCGCCCG 515 

I I I I I I II I I I I I I I I I I I I I I I I I II I 

I I 

Db 222 4 GCTCCA-AGCTTTGTGTCCCACAGCCTGTGACTCTG- 

TGCTAACAAATCGCTATTGTCC- 216 8 



Qy 516 

CGGGGGGCGGCGCCGAGGCGCTGGAGCTGCTGGAGCACTGCGGCGTGTGCAGAGAGCGCC 5 75 
I I I I I I I I I I I I I II I I I I I I I I I I 

Db 216 7 AGTGGGGCGAATGGTGGCTGGA ACTAAAGAATTGC TGT- 

CTGGTTTCTAT 2119 



Qy 576 TGCGACCCGAGAGGGAGCCCCGCCT GCTGCCCTGTTTGCACTCGGCCTG 

TAGT 62 8 

I III I I I I I I I II I I I I I I I I I I I I 

Db 2118 TCAAATCC AGG T AGC G AG AT AT AT G AAT GG AC T T T T C G - 

AATCGTCATGTGAATAAC 2 063 



Qy 629 GCCTGCTTAGGGCCCGCGGCCCCCGCCGCCGCCAACAGCTCGG — 

GGGACGGCGGGGCGG 6 86 

I I I I I I III I I I I I I I I I I I I I I I 

I I 

Db 2 062 GTCTGCTC — GGCATGAAGGCTCAG AGCCA — 

TGCTAGGAAGGATTAACTCGTAGG 2011 

Qy 687 CGGGCGAC-GGCA- 

CCGTGGTGGACTGTCCCGTGTGCAAGCAACAGTGCTTCTCCAAAGA 7 4 4 

I I I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 2010 CTGACCACTAACATCCTTTGTGGT ACGAGGG— AGAAACA- 

TTCCCAAGT 196 4 



Qy 745 

CATCGTGGAGAATTATTTCATGCGTGATAGTGGCAGCAAGGCTGCCACCGACGCCCAGGA 8 0 4 
I I I I I I I I I I I I I I I I I I I I 

I 

Db 1963 -ATCAT TTTATTCACACTTAATTTT 

CTATCCCATACCCCCAAAA 1921 



Qy 8 05 TGCGAACCAGTGCTGCACTAGCTGTGAGGATAATGCCCCAGCCAC 

CAGCTACTGTG 86 0 

II I I I I I I I I I I I I I I I I 

I 

Db 192 0 TAAG GCTAGCTAT 

TTAATTAGTTGGCCGCTTTTCTCTTAATTTT 1877 

Qy 861 

TGGAGTGCTCGGAGCCTCTGTGTGAGACCTGTGTAGAGGCGCACCAGCGGGTGAAGTACA 92 0 
I I I I I I I I I I I I I I I I I I I I I I III 

I I 

Db 18 76 TAGTGTTTCTGTTGATAATGTGTGAGT-TTGGGAAAATGCTAAGTAGC 



TTTTCA 1824 



Qy 921 CCAAGGACCATACTGTGCGCTCTACTGGG CCAGCCAAGTCT — 

CGGGATGGTG 9 71 

I I I I I I I I I I I I I II I I I I III 

I I 

Db 1823 

CTTAGAACACTGTTATTTTCTCTTTAAAGTTTTCTACCTTACATTTATTATAGCATAGTT 176 4 

Qy 9 72 AACGTACTGTCTATTGCAACG-TACA-CAAGCATGA- 

ACCCCTTGTGCTGTTTTGTGAGA 1028 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

I 

Db 1763 ATC T T T AT AGC AT AG AT GC AG AAAG T AAG AG AG AG C T T G T — 

TTTTTCAAGAAA 1712 

Qy 1029 G CTGTGATACTCTCACCTGCC 

GAGACTGCCAGCTCAATGCCCACAAG 10 75 

I I I I I I I I I I I III I I I I I I I I 

I I I I 

Db 1711 

ACAACCCTTTAAAATACTTTCCAACCCATGAAGGGAAAAATCCTCCTTTTTTCCCCCAAG 1652 

Qy 10 76 GACCACCAGTACCAGTT-CTTAGAGGATGCAGTGAGGAACCAGCGCAAGCTCC 

TGGC 1131 

I I I I I I I I I I I I I I I I I I I I 

I 

Db 1651 TGCATTC — T AC T T AT T AC T T TGCATT 

TTTCTCCCAAAGTC 1613 

Qy 1132 CTCACTGGTGAAGCG-CCTTGGGGACAA 

ACATGCAACATTGCAGAAGAGCACCAAG 1186 

I II III I II I I I I I I I I I I I I I I I I 

I I I I I 

Db 1612 CAAATTTATGCAAAGAAAATAGAAACAAGTTCAAATGCAA TGCATTA 

AC C AAA 1560 

Qy 1187 GAGGTTCGCAGCTCAATCCGCCAGGTGTCTGACGTACAGAAGCGTGTG- 

CAAGTGGATGT 12 45 

I II II III II I II II I I I I I I I I I I I I I 

Db 1559 TA AAACAAGTC — TCCTTCAAATTAGGAACCAACCTAAGCATTTGTAAAGTG — 

TAG 1507 

Qy 12 46 CAAGATGGCCATCCTGCAGA — 
TCATGAAGGAGCTGAATAAGCGGGGCCGTGTGC T 12 99 

II II I I I I III I I I II II I I I I I 

I 

Db 1506 CAGAAT C AG AAT T C T T T T AAAAAT TAGATTT — G G AAC C - 

TGAACTATAT 146 0 

Qy 1300 

GGTCAATGATGCCCAGAAGGTGACTGAGGGGCAGCAGGAGCGCCTGGAGCGGCAGCACTG 1359 

I I I I I I I I I I I I I I II II III 

I I I 

Db 1459 AATTCATAATTCTCATT 

TTTCTGTGGAAAATTATTTTATCTTTCTCCTGTATACCTG 14 03 

Qy 136 0 - G AC CAT G AC C AAG ATCC AGAAGC ACC AGGAGC ACATTCTGCGCTT 

TGCCTCT 1411 

I I I I I I I I III II II I I I I I I III I I 



I 

Db 1402 AAAAAATGTCCATAGGCTTAAAGGGTCA — TGCTTTTACATTC 

CTTCCATATCACA 13 49 



Qy 1412 

TGGGCTCTGGAGAGTGACAACAACACAGCCCTTTTGCTTTCTAAGAAGTTGATCTACTTC 14 71 

I I I I I I I I I I I I I I I I I I I I III 

I 

Db 13 4 8 GGTACTATGAAGTAAGG AGACTTTTAGGTTTCTT 

TTTGTCTTAAAC 1303 

Qy 14 72 -CAG-CTGCACCGGGCCCTCAAGATGATTGTGGATCCCGTGG AGCCACA- 

TGGCG 1523 

I I I I I I I I II I I I I I I I I I I I I I I 

I I I I 

Db 13 02 TCAGACAGCTTTGTAAGC 

AG T AG T G T G TAG AT T AC AAG AG T T AG AC AAAAGC AGGCG 12 46 



Qy 152 4 AGA— 

TGAAGTTTCAGTGGGACCTCAATGCCTGGACCAAGAGTGCCGAGGCCTTTGGCA 15 8 0 
II I I I I I I I I I I III 

Db 12 45 C G AC T GAG AAG AG TTGGTGGG GGA 

GAAGC 1217 



Qy 1581 

AGATTGTGGCAGAGCGTCCTGGCACTAACTCAACAGGCCCTGCACCCATGGCCCCTCCAA 16 4 0 

I I I I I I I I I I I I I I I I I I I I I I II I I III 

I I 

Db 1216 TTGGGGCA CTTCCTGTC ACTCAACA 

CATTCCAGATCACTAAAA 1174 

Qy 1641 GA— 

GCCCCAGGGCCCCTGAGCAAGCAGGGCTCTGGCAGCAGCCAGCCCATGGAGGTGCA 169 8 

I I I I I I I I I III I I I I I I III II II 

I 

Db 1173 AATTTCCACA CCCTCTGCATTCC CCCTTGC CCACTCC 

AGTTCCC 1130 



Qy 1699 

GGAAGGCTATGGCTTTGGGTCAGGAGATGATCCCTACTCAAGTGCAGAGCCCCATGTGTC 175 8 

III III II I I I I I I I III I I I I 

I I 

Db 1129 GGTA TTT TC 

TGATTCCATATGTTGTGGTATTTACCATACTTC 10 8 8 

Qy 1759 AGGTGTGAAACGGTCCCGCTC-AGGTGAGGGCGAGGTGAGCGGC — 

CTTATGCGCAAGGT 1815 

I I I I I I I II I I I I I I III I III 

Db 1087 TC — TCCCTCACTAGGCTCTGGCAAG 

ACTGCTTCAGAGGGGATGCAT 10 43 



Qy 1816 GCCACGAGTGAGCCTTGAACGCCTGGACCTGGACCTCACAGCTGACAG — 

CCAGCCACCC 18 73 

II II II I I I I I I I I I I I I I I I I I I I 

Db 10 42 TCCTTTAGATTGCAC — AAAGC — GGAGCTGGG- 

AAAATGGCTGGCAGTTTCAGAATCTA 988 



Qy 18 74 GTCTTCAAGGTCTTCCCAGGC 

AG T AC C AC T G AGG AC T AC AAC C T T AT TGT T AT TGAA 1930 



ACATCCATGT- 



Qy 1931 

CGTGGCGCTGCCGCTGCAGCTACCGGCCAGCCAGGGACTGCGCCTGCAGGAACCCCTGGT 199 0 
II I I I I I I I I I 

I 

Db 9 49 CC CTACC CGCC 

CCCCCGCT 931 



Qy 1991 GCCCCACCCCTG— 

GCTGGCATGGCCATTGTCAAGGAGGAGGAGACGGAGGCTGCCATTG 2 0 4 8 

III I I I I I I I I I I I I II II I I I I 
Db 93 0 — CCCGCCCCTGCAGCTGGC-TGACC — 
TGTCT 903 



Qy 2 0 49 GAGCCCCTCCTACTGCCACTGAGGGCCCTGAGACCAAACCTG — 

TGCTTATGGCTCTTGC 2106 

III I I I I I I I I I I I I I I III I I I I I I I I I I 

I I I 

Db 9 02 CACCCACTGCTGGC-CTATCGAAC 

GGCCAGGACTGTCTGGTTTTGGCTCGTGC 851 

Qy 2107 

GGAGGGTCCTGGTGCTGAGGGTCCCCGCCTGGCCTCACCTAGTGGCAGCACCAGCTCAGG 2166 

II I I I I I I I I I I I II II III 

I 

Db 850 CTTT — GTCCATGTCTGGCTT AGTTCCT- 

CTCTG- 820 

Qy 2167 

GCTGGAGGTGGTGGCTCCTGAGGGTACCTCAGCCCCAGGTGGTGGCCCGGGAACCCTGGA 2226 

II I I I I I I I I I I III III 

I I 

Db 819 TCTATGCTTGCCTCTACCCCC ACCG 

CCCCAGG- 78 8 



Qy 222 7 TGACAGTGCCACCATTTGCCGTGTCTGCCAGAAGCCAGGC- 

GATCTGGTTATGTGCAACC 22 85 

II I I I I I I I I II I I I I I I I I I II 

Db 78 7 CGG CACAA GTGTTTG GC C AC AC AAAAC TAG AG AT - • 

AGAA 750 



Qy 2286 

AGTGTGAGTTTTGTTTCCACCTGGACTGTCACCTGCCGGCCCTGCAGGATGTACCAGGGG 23 45 
I I I I I I I I I I II II I I I I I I 

Db 7 49 AAGGTGGTAAAAACTTCAAACTTTTCTAAATTCT 

CCAACAGTTTATTTCT 70 0 



Qy 23 46 AGGAGTGGAGCTGCTCACTCTGCCATGTGCTCCCT- 

GAC 2383 

I I I I I II III I I I I I I I 

Db 699 TGTGAATTTCTTCCTTCT- 

T T AAAT AC T C C AT T T TAAG AAAAC AAAAAAAT T AAT T A 644 



Qy 23 8 4 -CTGAAGGAGGAGGATGGCAGCCTCAGCCTGGATGGTGCAGACAG- 

CACTGGCGTGGTGG 2 441 

I I I I I I I I I I I I I I I I I I I I I I II 

Db 6 43 TCTAAAGGCAAAGAATGGAA — 



AGCAACCTTTGTGTTCCTTATAATAACTGACTT 591 



Qy 2 4 42 CCAAGCTCTCACCAGCCAACCAGCGGAAATGTGAGCGTGTACTGCTGGC- 

CCTATTCTGT 2500 

I I I I I I I I I I I I I I I I I I I I I I I I I II II 

I I I I I 

Db 59 0 CATAACTCTCTCCAG CTGC-GTTATGGGA- 

TGTGTATAAAAAGCTTCTGTTCTG- 539 

Qy 2501 CACGAACCCTGCCGCCCCCTGCATCAGC-TGGCTACCGACTCCACCTTC 

TCCCTGG 2555 

I I I I I I I I I I I I I III I I I I I I I III 

Db 53 8 — AGAACAAAGGAG- 

C AC G T GC AG AAAT GAG AC G AAAAAAT C C AC TGAC AGT AT TC C AT T 4 82 
Qy 2556 

ACCAGCCCGGTGGCACCCTGGATCTGACCCTGATCCGTGCCCGCCTCCAGGAGAAGTTGT 2 615 

II I II I I I I I I I I I I I I I I I III 

I 

Db 481 AC AC AAAT T AC T T AAAAGAT T T TAGT- 

CAAGCCCCTCAACAGATTCAATTTT 431 

Qy 2616 CACCT-CCCTACAGCTCCCCA 

CAGGAGTTTGCCCAGGATGTGGGCCG 2661 

I I I I I I I I I I I I I I I I I I I I I I I 

Db 430 

AAAATGGCTTTTAGTTAAAAAAAAAAAATTGAAAGTGCTTACCCAGTAAAAGAACCGAAG 3 71 

Qy 2662 CATGTTC AAGCAATTCAACA 

AGTTAACTGAGGACAAGG 2699 

I I I I III I I I I I I I I I I I I I 

I I 

Db 370 

TAGTCCTGAACTGTTACGTAAGACTTTTTACAGTTGGATCTTTGTCAAAAGGGGATGGGG 311 

Qy 2 70 0 CAGACGTGC-AGTCCATCATCGGCCTGCAGCGCTTCTTCGAGACGC 

GCATGAACGA 2 754 

I I I I I I I I I I I I II I I I I I I II II II 

I I 

Db 310 GTGATGGGAGAAAGCAGCAACGACAATCAAAAAAG-TTCGAGCTGCTGTGGCTAGAG- 

GA 2 53 

Qy 2 755 GGCCTTCGGTGACACCA AGTTCTCTGCTGTGCTGGTGGAGC- 

CCCCGCCGATGAG 2808 

I I I I I I I III I I I I I I I I I I I I I I I I I I I I 

I I 

Db 2 52 CAACTTCTGTGTTTCCAGATAGGATTCT-TGCTGTAGAAATGGAACTTCCAGCC 

AG 198 

Qy 2 8 09 C-CTGCCTGGTGCTGGCCTGAGTTCCCAGGAGC 

TGTCTGGTGGCCCTGGT 2857 

I I I I I I I I I I I I I I I I I I I I I I I I 

Db 19 7 CACAGCAT CCTG T C C C AG TAG AG AAAT GAGTTTGTCAGT T AAAAC - 

AAA 150 

Qy 2858 

GATGGCCCCTGAGGCTGGAGCCCCCATGGCCAGCCCAGCCTGGCTCTGTTCTCTGTCCTG 2 917 
I II II I I I I I I I I I I I I I II II I 

Db 149 AAAAAAAT TAG AT AC T GG AAC C C AGGCTAGACGA 

GGTATTGAAC 106 



Qy 2918 TCACCCCA 

TCCCCACTCCCCTGGTGGCCTGACTCCCACTCCCTGGTGGCCCCATCC 2 9 73 

I I I I I III I I I I I I II I I I I I I I I I I I I I I 

Db 105 -CGCGCCAGATTTCCTTGCAGCCCTGTCTGCTCAGCTCGCATTCC — 
AGTAGGCCAA 52 

Qy 29 74 CCCAGTTCCTCACGATATGGTTTTTACTTCTGTGGATTTAATAAAAAAAACTTCAC — 

CA 3031 



Db 51 — CAGT GCTCGA GG 

AATTCCGCAGCTTTTAGAGCAGAAGTAACACTTCC 5 

Qy 3032 GTTC 3035 

I I I 

Db 4 GTAC 1 
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Qy 223 

CCCAGGAGGCGCGTGGCGGCGCTCGGCCTCGCGGCGGCGGCGGCGGCAGCGGCCCAGCAG 2 82 



Db 1 CCTA CT-GGTCTTGTGTGGGTGTTTTC 

CAGCACGGAA- 36 

Qy 283 

TTGGCGGCGAGCGCGTCTGCGCCTGCGCGGCGGGCCCCGCGCCCCTCCTCCCCCCCTGGG 3 42 
I I I I I I I I I II I I I I III I I I I I I 

Db 3 7 AAG T C G AG AT AAT - T GC C AAT G AT C AG - 

GGAAACCGAACCACTCCAAGC 83 



Qy 343 

CGCCCCCGGCGGCGTGTGAATGGCGGCCTCCGCGGCGGCAGCCTCGGCAGCAGCGGCCTC 4 02 
I I I I I I I I III II III I 

I I 

Db 8 4 TATGTC-GCCTTTACGGACACTG-AACGGTTG- 



Qy 403 

GGCCGCCTCTGGCAGCCCGGGCCCGGGCGAGGGCTCCGCTGGCGGCGAAAAGCGCTCCAC 462 

II I I I I I I I I I I III III 

I I I I 

Db 117 GGT GATGCC GCAAAGAATCAAGTTGC 

AATGAACCCCAC 154 



Qy 463 

CGCCCCTTCGGCCGCAGCCTCGGCCTCTGCCTCAGCCGCGGCGTCGTCGCCCGCGGGGGG 52 2 
I I I I I I I I I I I I II 

Db 155 C AACACAGTTTTTG ATGCCAAA 

CGTCTGATTG 186 



Qy 523 

CGGCGCCGAGGCGCTGGAGCTGCTGGAGCACTGCGGCGTGTGCAGAGAGCGCCTGCGACC 5 82 

I I I I I I I I I I I I I I I III 

I 

Db 187 -GACGCAGATTTGATGATGCTGTTG TCCAG- 



Qy 583 

CGAGAGGGAGCCCCGCCTGCTGCCCTGTTTGCACTCGGCCTGTAGTGCCTGCTTAGGGCC 6 42 

I I I I I I I I I I I I I I I II I III II 

I I I I 

Db 218 TGATATGAAACATTG GCCCTTTATG GTGGTGAATGATGCTGGC- 

AGGCC 265 



Qy 643 

CGCGGCCCCCGCCGCCGCCAACAGCTCGGGGGACGGCGGGGCGGCGGGCGACGGCACCGT 7 02 
I I I I I I I III I I I I III II 

I 

Db 266 CAAGGTCCAAGTAGAATACAA GGGA 

GAGACCAAAAGCTTCTATCCAGA 313 



Qy 7 03 GGTGGACTGTCCCGTGTGCAAGCAACAGTGCTTCT- 

C C AAAG AC AT C G T GG AG AAT T AT T 761 

III I I I I I I I I I I I I I I I I I I 

Db 314 GGAG GTGT CTTCTATGGTTCTGACAAAGA — 

TGAAGGA 349 



Qy 762 

TCATGCGTGATAGTGGCAGCAAGGCTGCCACCGACGCCCAGGATGCGAACCAGTGCTGCA 821 
I I I I I I I III I I I I I I II 

I 

Db 350 AATTGCAGAA GCCTAC CTTGGGAAG AC- 

TGTTACC 3 83 



Qy 822 CTAGCTGTGAGGATAATGCCCCAGCCACCAGC- 

TACTGTGTGGAGTGCTCGGAGCCTCTG 880 

I I I I I I I I II I I I I I I I I I I I II 

Db 384 AATGCTGTG GTCACAG-TGCCAGCTTACTTTAATGA 

CTCTCAGCGTC — 429 



Qy 881 

TGTGAGACCTGTGTAGAGGCGCACCAGCGGGTGAAGTACACCAAGGACCATACTGTGCGC 9 4 0 

I I I I I I I I I I I I I I I 

Db 43 0 AGGC TACCAAAG ATGCTG 

GA 449 

Qy 941 

TCTACTGGGCCAGCCAAGTCTCGGGATGGTGAACGTACTGTCTATTGCAACGTACACAAG 10 0 0 

I I I I I I I I I I I I I I I I I I III II 

I I 

Db 450 ACTATTG CTGGTCTC AATGTACT 

TAGAATTATTAATGAG 48 8 

Qy 1001 

CATGAACCCCTTGTGCTGTTTTGTGAGAGCTGTGATACTCTCACCTGCCGAGACTGCCAG 106 0 

I III I I I I I I I I III 

I 

Db 489 C CAACT GCTG 

CTGCTATTGCTTACGG 514 

Qy 1061 

CTCAATGCCCACAAGGACCACCAGTACCAGTTCTTAGAGGATGCAGTGAGGAACCAGCGC 112 0 

III I I I I I I I I I I I I I I I I I I I I II 

I 

Db 515 CTTA — GACAAAAAGG TTGGA 

GCAGAAAGAAACGTGCTC 551 

Qy 1121 AAGCTCCTGGCCTCACTGGTGAAGCGCCTTGGG 

GACAAACATGCAACATTGCAGAAG 1177 

I II I I I I I I I I I II I I I I I I I I I I I I I I I I II 

I 

Db 552 AT-CT-TTGACCTGGGAGGTG — GCACTTTTGATGTGTCAATCCT- 

CACTATTGAGGATG 6 06 

Qy 1178 AGCACCAAGGAGGT TC- 

GCAGCTCAATCCGCCAGGTGTCTGACGTACAGAAGCGTG 1232 

III I I I I I II I I I I I I III II II I I I I I 

I 

Db 607 - G AAT CTTTGAGGT C AAG T C T AC AGC TGGAGACACCC 

ACTTGGGTGGAGAAGATTT 661 

Qy 1233 TG-CAAGTGGATGTCAAGATGGCCATCCTGCAGATCATGAAGGAGCTGA — 

ATAAGCGGG 12 89 

I I I I I I I I I I I I I III I I I I I I I I I I I 

Db 662 TGACAA CCGAATGGTCAAC 

CATTTTATTGCTGAGTTTAAGCGCA 705 

Qy 1290 

GCCGTGTGCTGGTCAATGATGCCCAGAAGGTGACTGAGGGGCAGCAGGAGCGCCTGGAGC 13 49 

II I I I I I III I II I I I I I I I I I I I I 

Db 706 AGCATAAGAAGGACAT CAGTGAG-AACAAGAGAGCTGTAAGA- 

CGCCT 751 

Qy 1350 

GGCAGCACTGGACCATGACCAAGATCCAGAAGCACCAGGAGCACATTCTGCGCTTTGCCT 14 09 

I I I I I I I I I I I I I I I I I 



CTTGTGAACGTGCTAAGCGTACCCT 78 4 



Qy 1410 CTTGGGCTCTGGAGAGTGACAACAACACAGCCC-TTTTG 

CTTTCTAAGAAG 1459 

II II I I I I I I I I I I I I I I I I I 

I I I I 

Db 785 CT CTT 

CCAGCACCCAGGCCAGTATTGAGATCGATTCTCTCTATGAAG 831 

Qy 146 0 TTGATCTACTTCCAG — CTGCA 

CCGGGCCCTCAAGATGATTGTGGATCCCGTGGAG 1513 

I I I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 832 -GAATCGACTTCTATACCTCCATTACCCGTGCCC GATT 

TGAAGAA 8 75 

Qy 1514 CCACATGGCGAGATGAAGTTTCAGTGGGACCTCAATGCCTGGACCAAG- 

AGTGCCGAGGC 15 72 

I III II II I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 8 76 CTGAATGCTGACCTG TTCCGTGGCAC CCTGGACCCAGTAGAG- 

AAAGC 922 



Qy 15 73 CTTTGGCAAGATTGTGGCAGAGC — GTCCTGGCACTAACTCAACA 

GGCCCTG 1622 

III I I I I I II II II I I I I I I I I I 

Db 923 CCTT— CGAGAT 

GCCAAACTAGACAAGTCACAGATTCATGATATTGTCCTGGTTGG 9 76 



Qy 1623 CACCCATGGCCCC TCCAAGAG 

CCCCAGGGCCCCTGAGCAAGCAGGGC 1669 

I I I I I I I I III II I I I I I I II II 

I 

Db 977 

TGGTTCTACTCGTATCCCCAAGATTCAGAAGCTTCTCCAAGACTTCTTCAATGGAAAAGA 1036 
Qy 1670 TCTGGC 

AGCAGCCAGCCCATGGAGGTGCAGGAAGGCTATGGCTTTGGGTCAGGAGA 172 5 

III I I I I I I I I I II I I I I I I I I I I III I 

I I 

Db 103 7 ACTGAATAAGAGCA-TCAACCC TGATG — AAGCTGTTGCTTATGGT — 

GCAGC 1084 



Qy 172 6 TGATCCCTACTCAAGTGCAGAGCCCCATGTGTCAGGTGTGAAACGGTCCCGCTCAG- 

GTG 1784 

II I I I I I I I I II I I I I I I I I III 

Db 10 85 TG TCCAG-GCAG CCATCTTGTCTGGAG 

ACAAGTCTGAGAA 1123 



Qy 1785 AGGGCGAGGTGAGCGGCCTTATGCGCAAGG- 

TGCCACGAGTGAGCCTTGAACGCCTGGAC 1843 

I I I I I II I I I I I I I I I I I 

Db 112 4 TGTTCAAGATTTGC TGCTCTTGGATGTCAC- 

TCCTCTTTCCCTTGGTA 1170 



Qy 18 44 CTGGACCTCACAGCTGACAGCCAGCCACCCGTCTTCAAG- 

GTCTTCCCAGGCAGTACCAC 19 02 

I I I I I I I I I I II I I I I I I I I I I I I I I 

I I 

Db 1171 TTGAAACT-GCTGGTGGAGTCATGACTGTCCTCATCAAGCGTAATACCA- 



CCATTCCTAC 122 8 



Qy 19 03 TGAG — 

GACTACAACCTTATTGTTATTGAACGTGGCGCTGCCGCTGCAGCTACCGGCCAG 196 0 

I I I I I I I I I I I I I I I I I I 

I I I I 

Db 1229 C AAG C AG AC AC AG AC C T T CACTACCTATTCTG- 

ACAACCAG 126 8 

Qy 1961 CCAGGGACTGCGCCTGCAG GAACCCCTG- 

GTGCCCCACCCCTGGCTGGCATGGC 2 013 



Db 1269 CCTGGTG-TGCTTATTCAGGTTTATGAAGGCGAGCGTGCCATGACAAAGGATAACA — 

AC 1325 

Qy 2014 C— 

ATTGTCAAGGAGGAGGAGACGGAGGCTGCCATTGGAGCCCCTCCTACTGCCACTGAG 2 0 71 

I I I I I I I I II II II II I I I I I I I II 

Db 1326 CTGCTTGGCAAGTTTGA ACT— 

CACAGGCATACCTCCTGCACCCCGAGGT 13 73 



Qy 2 0 72 GGCCCTGAGACCAAACCTGT- 

GCTTATGGCTCTTGCGGAGGGTCCTGGTGCTGAGGGTCC 213 0 

I I I I I I I II II I I I I I I I I I I I I I I I 

Db 13 74 GTTCCTCAGATTGAA GTCACTTTTGAC 

ATTGATGCCAATGGTAT 1417 



Qy 2131 

CCGCCTGGCCTCACCTAGTGGCAGCACCAGCTCAGGGCTGGAGGTGGTGGCTCCTGAGGG 219 0 

II I I I I I I I I I I II II III III I 

Db 1418 ACTCAATGTCTCTGCT-GTGG — 
AC AAG AG T AC GGG AAAAG AG AAC AAG AT T AC T A 1470 

Qy 2191 

TACCTCAGCCCCAGGTGGTGGCCCGGGAACCCTGGATGACAGTGCCACCATTTGCCGTGT 2 2 5 0 



Db 1471 T C AC T AAT G AC AAG GGCCGTTTGAGCAAGGAAGACATTG — AACGTATG 

GT 1519 

Qy 2251 CTGCCAGAAGCCAGGCGAT 

CTGGTTATGTGCAACCAGTGTGAGTTTTGTTTCCA 2304 

I I I I I I I I I III I I I I I I I I I I I I I I I I 

Db 152 0 C— CAGGAAGCTGAGAAGTACAAAGCTGAAGATGAG-AAGCAGAGGGA- 

CAAGGTGTCAT 15 75 



Qy 2305 

CCTGGACTGTCACCTGCCGGCCCTGCAGGATGTACCAGGGGAGGAGTGGAGCTGCTCACT 236 4 

I I I I I I I I I I I I I I I III 

I I 

Db 1576 CCAAGAAT-TCACTTG — AGTCCT ATGC- 

CT 1602 



Qy 2365 

CTGCCATGTGCTCCCTGACCTGAAGGAGGAGGATGGCAGCCTCAGCCTGGATGGTGCAGA 2 42 4 
I I I I I I I I I III I I I I I I I 

I I 

Db 16 03 T C AAC AT G AAAGC AACTGTTGAAGATG AGAAACTTCA 

AGG 1642 



Qy 2425 

CAGCACTGGCGTGGTGGCCAAGCTCTCACCAGCCAACCAGCGGAAATGTGAGCGTGTACT 2 4 8 4 



Db 16 43 CAAGATTAACGATGAG 

GACAAACAGAAGA T 1672 

Qy 2 485 GCTGGCCCTATTCTGTCACGAACCCTGCCGCCCCCTGCATCAGCTGGCT 

ACCGACTC 2541 



Db 16 73 TCTGGACAAG TGTAATGAAAT 

TATCAACTGGCTTGATAAGAATC 1716 

Qy 2542 

CACCTTCTCCCTGGACCAGCCCGGTGGCACCCTGGATCTGACCCTGATCCGTGCCCGCCT 2 6 01 
I I I I II I I I I I I I I I I I I I I I 

Db 1717 AGACTGCT GA GAAGGAA 

G AAT T T G AAC AT C AAC AG AAAG AG 175 7 

Qy 26 02 CCAGGAG- 

AAGTTGTCACCTCCCTACAGCTCCCCACAGGAGTTTGCCCAGGATGTGGGCC 2 66 0 

I I I I I I I I I I I I I I 1 I I I I I I III I I I I 

Db 1758 -CTGGAGAAAGTTTGCAAC-CCCATCATCACCAAGCTGTA 

CCAGAGTGCAGGAG 18 09 

Qy 2661 G C AT G T T C AAGC AAT T C AAC AAG T T AAC T GAG G 

ACAAGGCAGACGTGCAGTCCATC 2 716 



Db 1810 GCATG — CCAGGAGGAATGC CTGGGGGATTTCCTGGTGGTGGAGC — 

TCCTCC 1858 

Qy 2 717 ATCGGCCTGCAGCGCTTCTTCGAGACGCG- 

C AT G AAC GAGGCC T T C GG T G AC AC C AAG T T 2 775 

III I I I I I I I I I III III I I I I I I I I I I 

Db 1859 CTCTG GTGGTGCTTCCTCAGGGCCCACCATTGAAGAGG 

TTGATTAAGCCAA 19 09 

Qy 2776 CTCTGCTGTGCTGGTGGAGC 

CCCCGCCGATGAGCCTGCCTGGTGCTGGCCTGAG 2 829 



Db 1910 — CC-AAGTGTAGATGTAGCATTGTTCCACACATTTAAAAC-ATTTGAAG — 

GACCTAAA 1963 

Qy 2830 

TTCCCAGGAGCTGTCTGGTGGCCCTGGTGATGGCCCCTGAGGCTGGAGCCCCCATGGCCA 2 8 89 



Db 196 4 TTCGTAGCAAAT-TCT-GTGGC AGTTTTAAAAAGTTAAGCTG 

CTATAGTAA 2 012 

Qy 2890 

GCCCAGCCTGGCTCTGTTCTCTGTCCTGTCACCCCATCCCCACTCCCCTGGTGGCCTGAC 2 9 49 



Db 2 013 G — TTACTGGGC ATTCTCAATACT- 

TGAATATGGAACATATGCACAGGGGAAGGAAA 2 066 



Qy 2950 TCCCACTCCCTGGTGGCCCCATCCCCCAGTTCCTCACGATATGGTTTTT 

ACTTCTG 3005 

I I I I I I I I I I I I I I I I I I III 

Db 2 06 7 TAACATT GCACTTT 

ATAAACACTGTATTGTAAGTGGAAAATGCAA 2111 

Qy 3 0 06 TGGATTTAATAAAA AAAACTT — CACCAGTTCAAAAAAAAAAAAAAAAAAAAA 

3056 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2112 T G T C T T AAAT AAAAC TAT T T AAAAT TGGC ACC AT AAAAAAAAAAAAAAAAAAAAAAAA 
2169 
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Qy 114 GTGCGG GTTTCGGCGGCGGCTGAGGAAGAAGC- 

GCGGGCGGCGCCTTCGGGAGGCG 16 8 

I I I I I I I I I III II I I I I I I I I I III 

I I 

Db 1 GTACGGAAGTGTTACTTCTGC-TCT AAAAGCTGCGG— AATTCCT- 

CG 44 



Qy 169 

AGCAGGCAGCAGTTGGCCGTGCCGTAGCAGCGTCCCGCGCGCGGCGGGCAGCGGCCCAGG 2 2 8 

I I I I I I I I I I I I I I I I III I I I I I I I I I I I I 

I 

Db 45 AGCA CTGTTGGCC-TACTGGA-ATGCGAGCTGAGC-AGACAGG — 

GCTGCAAGGA 9 4 



Qy 229 

AGGCGCGTGGCGGCGCTCGGCCTCGCGGCGGCGGCGGCGGCAGCGGCCCAGCAGTTGGCG 2 8 8 

I I I I I I I I I I I I I 



Db 95 AATC TGGC- 

GCGGTTCAATA 113 



Qy 289 

GCGAGCGCGTCTGCGCCTGCGCGGCGGGCCCCGCGCCCCTCCTCCCCCCCTGGGCGCCCC 3 4 8 

I I I I I I I I I I I I I I I I I III II 

Db 114 -C CTCGTCT-AGCCT GGGTTCC- 

AGTATCTAATTTTTTTTTTG 153 

Qy 349 

CGGCGGCGTGTGAATGGCGGCCTCCGCGGCGGCAGCCTCGGCAGCAGCGGCCTCGGCCGC 4 0 8 
I I I I I I III 

Db 154 TTTTAACTGACAAACTC 

ATTT 174 

Qy 409 

CTCTGGCAGCCCGGGCCCGGGCGAGGGCTCCGCTGGCGGCGAAAAGCGCTCCACCGCCCC 46 8 

I I I I I I I I I I I III I I I I I I I I I I I I I I I 

Db 175 CTCT ACTGGGACAGGAT GCTGTGCTGGCTG-GAA 

GTTCCA T 214 

Qy 469 TTCGGCCGC- 

AGCCTCGGCCTCTGCCTCAGCCGCGGCGTCGTCGCCCGCGGGGGGCGGCG 52 7 

III I I I I I II II II I I I I I I I I I 

I 

Db 215 TTCTACAGCAAGAATC CTATCT-GGAAACACAGAAGTTGTCCTC 



Qy 52 8 CCGAGGC-GCTGGAGCTGCTGGAGCACTGC — 

GGCGTGTGCAGAGAGCGCCTGCGACCCG 584 

II I I I I I I I I I I II I I I I I I I I III 

I 

Db 261 CCACAGCAGCTCGAACTTTT TTGATTGTCGT-TGCTGCTTTCTCC- 

CATCAC- 310 

Qy 585 

AGAGGGAGCCCCGCCTGCTGCCCTGTTTGCACTCGGCCTGTAGTGCCTGCTTAGGGCCCG 6 4 4 
I I I I I I I I I I II II II II 

Db 311 CCCCATC CCCTTTTGACA AAGATCCAACT- 

GTAAAA 345 

Qy 6 45 CGGCCCCCGCCGCCGCCAACAGCTCGGGG-GACGGCGGGGC 

GGCGGGCGACGGCAC- 699 

II II I I I I I I I I I II III I I I I I 

I I I I 

Db 3 46 AGTCTTACG TAACAGTTCAGGACTACTTCGGTTCTTTTACTGG- 

GTAAGCACT 39 7 

Qy 70 0 — CG TGGTGGACTGT CCCGTGTGCAA — GCAACAGT 

GCTTCTC 738 



Db 398 

TTCAATTTTTTTTTTTTAACTAAAAGCCATTTTAAAATTGAATCTGTTGAGGGGCTTGAC 45 7 

Qy 739 

CAAAGACATCGTGGAGAATTATTTCATGCGTGATAGTGGCAGCAAGGCTGCCACCGACGC 79 8 
III II I I I I I I I I I I I I I I I I I III 

I 

Db 45 8 TAAAATCTTTTAAGTAATTTGTGTAAT — GGAATACTGTCAG — 



TGGATTTTTTCGTCTC 513 



Qy 799 C CAGGATGCGAACCAGTG CTGCAC — 

TAGCTGTGAGGATAATGCCCCAGCCAC 8 49 

I I I I I II II I I I I I I I I I I I I I I 

I I I 

Db 514 ATTTCTGCACGTGCTCCTTTGTTCTCAGAACAGAAGCTTT 

TTATACACATCCCAT 56 8 

Qy 850 CA- 

GCTACTGTGTGGAGTGCTCGGAGCCTCTGTGTGAGACCTGTGTAGAGGCGCACCAGC 9 0 8 

III III I I I I I I II I I I I II III I I I I 

Db 569 AAC G C AG C T G GAG AG AG T T AT - G AAG — TCAGT TATTATAAGGAACA- 

CAAA 616 

Qy 909 

GGGTGAAGTACACCAAGGACCATACTGTGCGCTCTACTGGGCCAGCCAA G 95 8 

I I I I I I I I I I I I I I I I I I II II 

Db 617 GGTTGCTTT CCATTCTTTGC-CTTTA 

GATAATTAATTTTTTTGTTT 661 

Qy 959 

TCTCGGGATGGTGAACGTACTGTCTATTGCAACGTACACAAGCATGAACCCCTTGTGCTG 1018 

III I I I I I I II I III I I I I I I I I III III 

I 

Db 662 T C T T AAAATGGAGT AT T T AAAGAAGGAAG AAA — T T C AC AAG AAAT AAAC TG T T GG — 

AG 717 

Qy 1019 TTTTGTGAGAGCTGTGATACTCTCACCTGCCGAGACTGCCAGCTCAA 

TGCCC 1070 

II III I I I I I I I I I II I I I I I I I II 

I I 

Db 718 AATTTAGAAAAGTTTGAAGTTTTTACC-ACC 

TTTTCTATCTCTAGTTTTGTGTGGCC 773 

Qy 1071 

ACAAGGACCACCAGTACCAGTTCTTAGAGGATGCAGTGAGGAACCAGCGCAAGCTCCTGG 113 0 

I I I I I I I I I III I I I I I I I I I I I I I I II 

I 

Db 774 A AACACTTGTGCCG CCTGGG GCGGTGGGGGTAGAG-GCAAG — 

CATAG 818 

Qy 1131 CCTCACTGGTGAAGCGCCTTGGGGACAAACATGCAACATTGCAGAAGAGC 

ACCAAGG 1187 

I I I I I I I I I I I I I I I I II I I I I I II 

I I 

Db 819 AC AGAGAGGAACT AAGCCAGACATGGACAAAGGC — 

ACGAGCCAAAACCAGA 86 8 

Qy 1188 AGGTTC — 

GCAGCTCAATCCGCCAGGTGTCTGACGTACAGAAGCGTGTGCAAGTGGATGT 12 45 

I I I I I I I I I I I I I I I I I I I I 

I I 

Db 869 CAGTCCTGGCCGTTCGATAGGCCAG 

CAGTGGGTG- 9 02 

Qy 1246 

CAAGATGGCCATCCTGCAGATCATGAAGGAGCTGAATAAGCGGGGCCGTGTGCTGGTCAA 13 05 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 9 03 -AGACAGGTCAGCCAGC TGCAGGGGCGG GAGCGGGG 



GGGCGGGTAGG 9 48 



Qy 1306 

TGATGCCCAGAAGGTGACTGAGGGGCAGCAGGAGCGCCTGGAGCGGCAGCACTGGACCAT 1365 



Db 9 49 GACATGGATGTG — TGAGGTGCT-CATGCGTGC- 

GATCGT 984 

Qy 1366 G AC C AAG AT C C AG AAGC - 

ACCAGGAGCACATTCTGCGCTTTGCCTCTTGGGCTCTGGAGA 142 4 



Db 9 85 GA-CTAGATTCTGAAACTGCCAG — 

CCATTTTCCCAGCTCCGCTTTGTGCAATCTAAAGG 1041 

Qy 1425 

GTGACAACAACACAGCCCTTTTGCTTTCTAAGAAGTTGATCTACTTCCAGCTGCACCGGG 14 8 4 



Db 10 42 AATGCATCCC CTCTGAAGCAG-TCT — TGCCAG- 

AGCCTAGTG 1080 

Qy 1485 CCCTCAAGATG- 

ATTGTGGATCCCGTGGAGCCACATGGCGAGATGAAGTTTCAGTGGGAC 15 43 

I I I I I I I I I I I I I I I I I III I I 

Db 1081 AG G GAG AG AAG T AT GG T AAAT AC C ACAACAT ATG — 

GAATCAG 1121 

Qy 1544 CTCAATGCCTGGACCAAGAGT — GCCGAGGCCTTTGGCAAGATTGTGGCAG- 

AGC 1595 



Db 1122 - 

AAAAT ACCGGGAAC TGGAGTGGGCAAGGGGGAATGCAGAGGGTGTGGAAAT T T T T T AGT 118 0 

Qy 1596 GTCCTGGCACTAACTCA ACAGGCCCTGCACC CATGGCCCCTCCAAGAGC — 

CCC 1647 



Db 1181 

GATCTGGAATGTGTTGAGTGACAGGAAGTGCCCCAAGCTTCTCCCCCACCAACTCTTCTC 12 4 0 

Qy 16 48 AGG-GCCCCTGAGCAAGCAGGGCTCTGG 

CAGCAGCCAGCCCATGGAGGTGCAGGAA 1702 

I I I I I I I I I I I I I I I I I I I I 

Db 1241 

AGTCGCGCCTGCTTTTGTCTAACTCTTGTAATCTACACACTACTGCTTACAAAGCTG 12 9 7 

Qy 1703 

GGCTATGGCTTTGGGTCAGGAGATGATCCCTACTCAAGTGCAGAGCCCCATGTGTCAGGT 1762 
III III III I I I I I I I I I I I I I I I I 

Db 129 8 T C T GAG T T T AAG AC A — AAAAGAAACCTA — AAAGT 

CTCCTTACTTC 1340 

Qy 1763 

GTGAAACGGTCCCGCTCAGGTGAGGGCGAGGTGAGCGGCCTTATGCGCAAGGTGCCACGA 182 2 

I I I I I I I I I I I II II III 

Db 13 41 ATAGTAC — CTGTGATATGGAAGGAATG 

TAAAAGCA 13 7 4 



Qy 1823 

GTGAGCCTTGAACGCCTGGACCTGGACCTCACAGCTGACAGCCAGCCACCCGTCTTCAAG 18 82 
I I I I I I I I I I I I I I I I I I I II I III I 

Db 13 75 -TGACCCTTTAA-GCCT 

ATGGACATTTTTTCAGGTATACAGGAGAAAGATAAAATA 142 8 



Qy 18 83 GTCTTCCCAGGCAG T AC C AC T G AGG AC T AC AAC C T - 

TATTGTTATTGAACGTGGCGCTGC 19 41 

I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1429 ATTTTCC ACAG- 

AAAAATGAGAATTATGAATTATATAGTT 146 7 



Qy 1942 

CGCTGCAGCTACCGGCCAGCCAGGGACTGCGCCTGCAGGAACCCCTGGTGCCCCACCCCT 2 0 01 

I I I I I I II II I I I I I I I 

I I 

Db 146 8 CAGGTTCC 

AAATCTAATTTTTAAAAGAATTCTGATTCTGCTACACT 1513 

Qy 2002 

GGCTGGCATGGCCATTGTCAAGGAGGAGGAGACGGAGGCTGCCATTGGAGCCCCTCCTAC 2 061 
I I I I I I I I I I I I 

I I I I I 

Db 1514 TTACAAATGCTTAGG TTGGT 

TCCTAA 1539 



Qy 2 062 TGCCACTGAGGGCCCTGAGACCAAACCTG — 

TGCTTATGGCTCTTGCGGAGGGTCCTGGT 2119 



Db 1540 T TTGAAG GAGACTTGTTTTATTTGGTTAATGC- 

ATTGCATTTGAACTTGTT 1589 

Qy 2120 GCTGAGGGTCCCCGCCTGGCCTC— 

ACCTAGTGGCAGCACCAGCTCAGGGCTGGAGGTGG 2177 



Db 1590 TCT-ATTTTCTTTGCATAAATTTGGACTTTG-GGAGAAAAATGCAAAGTAAT- 

AAGTAGA 16 46 

Qy 2178 TGGCTCCTGAGGGTACCTCAGCCCCAGGTGGTGGCCCGGGAACCCTGGAT — 

GACAGTGC 2 235 

I I I I I I I I I I III I III I I I I I I 

Db 16 47 ATGCACTTGGGGGAAAAAAGG AGGATTTTTCCC 

TTCATGGGTTGGAAAGT — 1696 



Qy 2236 CACCATTTGCCG TGTCTGCCAG — 

AAGCCAGGCGATCTGGTTATGTGCAACCAGTGT 2290 

III I I I I I I I II I I I I III 

Db 169 7 ATTTTAAAGGGTTGTTTTCTTGAAAAAACAAGCTCTCT- 

CTTAC 1739 



Qy 2291 GAGTTTTGTTTC — 

CACCTGGACTGTCACCTGCCGGCCCTGCAGGATGTACCAGGGGAGG 23 4 8 

I I I I I I I I I I I I I I III 

Db 1740 TTTCTGCATCTATGCTATAA — AGATAACTATGC- 

TATAATAAATGTAA 1785 



Qy 23 49 AGTGGAGCTGCTCACTCTGCCATGTGCTCCCTGACCTGAAGGAGGAGGATGGCAG 



II II III 



Db 1786 GGTAGA AAAC T T TAAAG A- 

G AAAAT AAC AG T G T T C 1819 



Qy 2 406 TCAGCCTGGATGG- 

TGCAGACAGCACTGGCGTGGTGGCCAAGCTCTCACCAGCCAACCAG 2 46 4 

III III III I I I I I I I I I I I I I I I I 

I I 

Db 182 0 T AAG — T G AAAAGC T AC — TTAGCATTTTC CCAAACTCACACAT- 

TATCAA 1865 



Qy 2465 

CGGAAATGTGAGCGTGTACTGCTGGCCCTATTCTGTCACGAACCCTGCCGCCCCCTGCAT 2 52 4 

I I I I I I I I I I I I I I I I I I I I 

I I 

Db 1866 CAGAAA C AC T A- AAAAT T AAGAGAAAAGCGGCC AAC T - 

AAT 1904 

Qy 2525 CAGCTGGCTACCGACTCCACCTTCTCCCTGGACCAGCCCGG — 

TGGCACCCTGGATCTG- 2581 

I I I I I I I I I I I II I I I I I I I I 

I I 

Db 19 05 TAAATAGCTA GCCTTATTTTGGG 

GGTATGGGATAGAAAATTAAGTGTGA 1953 

Qy 2582 — ACCCTGATCCGTG CCCGC CTCCAGGAGAAGTT GTCA- 

CCT 2620 

I I I I I I I I I I I I III I I I I I I I I I 

I I I 

Db 1954 

ATAAAATGATACTTGGGAATGTTTCTCCCTCGTACCACAAAGGATGTTAGTGGTCAGCCT 2 013 

Qy 2621 

CCCTACAGCTCCCCACAGGAGTTTGCCCAGGATGTGGGCCGCATGTTCAAGCAATTCAAC 2 6 8 0 

I III II I I I I I I I I I I I I I I I II 

Db 2014 ACGAGTTAATCCTTCCTAG CATGGCTCTGAGCC 

TTCATGCCGAGCAG- 2 06 0 

Qy 26 81 AAGTTAACTGAGGACAAGGC-AGACGTGCAGTCCA- 

TCATCGGCCTGCAGCGCTTCTTCG 2 73 8 

I I I I I I I I I I I II I I I I I I I I I I II I I I I I I 

I 

Db 2 061 ACGTTATTC ACATGACGATTCGAAAAGTCCATTCATATATCT 

CGCTACCTGG 2112 



Qy 2739 

AGACGCGCATGAACGAGGCCTTCGGTGACACCAAGTTCTCTGCTGTGCTGGTGGAGCCCC 2 79 8 

I I I I I I I I I I I I I I I I I I I III I I I I 

I 

Db 2113 ATTTG AAT AG AAAC C AGACAGCAA-TTCTTT — AGTTCC 

AGCCAC 2154 



Qy 2799 

CGCCGATGAGCCTGCCTGGTGCTGGCCTGAGTTCCCAGGAGCTGTCTGGTGGCCCTGGTG 2 85 8 
I I I I I I I I I I I I I I I I I I I I I 

Db 2155 C ATTCGCC — CC ACTGG 

ACAATAGCGATTTGTTAGCACAG 2192 



Qy 2859 

ATGGCCCCTGAGGCTGGAGCCCCCATGGCCAGCCCAGCCTGGCTCTGTTCTCTGTCCTGT 2 918 
III I I I I I I III I I I I I I I III 

Db 2193 — AGTCAC — AGGCTG 

TGGGACACAAAGCTTGGAGCTG 2226 

Qy 2919 CACCCCATCCCCACTCCCCTGGTGGC — 

CTGACTCCCACTCCCTGGTGGCCCCATCCCCC 2976 



Db 2227 

CAGAAAGATGGGGGATTCAGAGAACAGGAAATTACAGGCTCGATGCAC- 2 2 7 4 

Qy 2977 

AGTTCCTCACGATATGGTTTTTACTTCTGTGGATTTAATAAAAAAAACTTCACCAGTTCA 3 036 



Db 2 2 75 TCCTGGCAGCTCTGAGAATACAAAAAAAAAAAAAAAAAAAAAGGC — 

CACATGTGCT 232 9 

Qy 3037 AAA 3039 

I 

Db 2330 CGA 2332 
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Qy 4 7 CTAGCAGCAGTTGGCCGTGCCGTAGCAGCGTCCCGC- 

GCGCGGCGGGCAGCGGCCCAGGA 105 



Db 2 0 83 CTAGC GGCC GCGACCTGCAGCTCG- 

AGCACATGT 2 051 



Qy 106 GGCGCGTGGTGCGGGT 

TTCGGCGGCGGCTGAGGAAGAAGCGCGGGCGG 153 

I I I I I I I I I I I 

Db 2050 GGC-CTTTTTTTTTTTTTTTTTATTTTGC- 

TGCTCTGGTGGTTAGT 2002 



Qy 154 

CGCCTTCGGGAGGCGAGCAGGCAGCAGTTGGCCGTGCCGTAGCAGCGTCCCGCGCGCGGC 213 
I I I I I III I I I I I I I I I I 

Db 2001 CTCCTACTATAAACATG ATTTTCCTT 

CTTCCCTT 1968 



Qy 214 

GGGCAGCGGCCCAGGAGGCGCGTGGCGGCGCTCGGCCTCGCGGCGGCGGCGGCGGCAGCG 2 73 
I I I I I I II I II III 

Db 1967 CAGAGTCC TGTC AATCTTTACTTTTTTTCTCATTAAT — 

AT AC AG AAG T T 192 0 



Qy 274 

GCCCAGCAGTTGGCGGCGAGCGCGTCTGCGCCTGCGCGGCGGGCCCCGCGCCCCTCCTCC 333 
I I I I I I I I I I I I I I I I III 

Db 1919 ATACTACAGATACATTCAAGCTCAT 

GACTGCTAAAAACTTCCTGAAATGAGCCTTA 1864 



Qy 334 CCCCCTGGGCGCC- 

CCCGGCGGCGTGTGAATGGCGGCCTCCGCGGCGGCAGCCTCGGCAG 392 
I I I I I I III I 

Db 1863 ATTCAT — ACGAATCATTTCTATTTGT 

TCTTTTTTTTTTGAGACAGACT 1817 



Qy 393 

CAGCGGCCTCGGCCGCCTCTGGCAGCCCGGGCCCGGGCGAGGGCTCCGCTGGCGGCGAAA 452 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1816 CTTACTCTGTCGCC-CAGGCTG- 

GAGTGCACTGGCATGATCTCGGCTCACTGCAA — 1764 



Qy 453 AGCGCTCCACCGCCCC 

TTCGGCCGCAGCCTCGGCCTCTGCCTCAGCCGCGGCGTC 5 0 7 

I I I I I I I I I I I II I I I I I I I I I I I I I I I 

Db 1763 CCTCTGCCTCACGAGTTAGGT — 

CAAGCAATTCTTCTGCCTCAGCCTC 1718 



Qy 508 

GTCGCCCGCGGGGGGCGGCGCCGAGGCGCTGGAGCTGCTGGAGCACTGCGGCGTGTGCAG 56 7 



Db 1717 CCGAGGAGCTAGGATTACAGGT-CCCTGCCAC- 

CA- 1685 

Qy 56 8 AGAGCGCCTGCG ACCCGAGAGGGAGCCCCGCC- 

TGCTGCCCTGT 610 

I I I I I I I I I I I I I I I I I I I 

Db 1684 

TGCCTGAGTAATTTTTGTATTTTTAGTAGAGATGGGGTTTCACCATGTTAGCCAAG 162 9 

Qy 611 TTGCACTCGGCCTGTAGTGCCTG — CTTAG — 

GGCCCGCGGCCCCCGCCGCCGCCAACAG 666 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I 



Db 162 8 CTGGTCTCG AATTCCTGACCTCAGGTGATCCGC — 

CCACCTCAGCCTCCCAAAG 1577 



Qy 667 CTCGGGGG — 

ACGGCGGGGCGGCGGGCGACGGCACCGTGGTGGACTGTCCCGTGTGCAAG 72 4 

I I I I II Mil I I II INI II I I I I II II I 

Db 15 76 TGCTGGGATTAC GGGC-GTGAGCCACGGTGCC TGGCC — 

TCTATTTTTTATT 1528 

Qy 725 

CAACAGTGCTTCTCCAAAGACATCGTGGAGAATTATTTCATGCGTGATAGTGGCAGCAAG 7 8 4 

II I I I I I I I I I I I I I I I I I I I I I I 

Db 1527 TTAAATACCTTTTGCCCTGAAATC TATGTCCT-C- 

TGATTAT 148 8 

Qy 785 GCTGCCACCGACGCCCAGGATGCGAACCAGTGCTGCACTAGCTGTGAGGATAATGCC- 

CC 843 

I I I I III I I I I I I I I I I I I I I I I 

I 

Db 148 7 CAACAC TGCTTACCAGTTTTATTTTTGTTG- 

GTATTTATTTCATTC 1443 

Qy 844 

AGCCACCAGCTACTGTGTGGAGTGCTCGGAGCCTCTGTGTGAGACCTGTGTAGAGGCGCA 9 03 

I I I I I I I I II III I I I I I I I 

Db 1442 AATC-CCAGCT CT ACAACT 

TACCAGTATA 1415 

Qy 904 CCAGCGGGTGAAGTACACCAAGGACCA — 

TACTGTGCGCTCTACTGGGCCAGCCAAGTCT 961 

II I I I II I I I I I I I I I I I II I 

I 

Db 1414 AAAATTTGGATCAGTTACT-TAATCT-TGCAGTTTCTTCCT — 



Qy 962 

CGGGATGGTGAACGTACTGTCTATTGCAACGTACACAAGCATGAACCCCTTGTGCTGTTT 1021 
I I I I I I I I I I I I I I I I I I III III I 

I 

Db 13 72 AAAAATGG-AATAATATTATCTAT CATATAAA ATG — 

CCCAGTTAACAAATA 132 4 

Qy 1022 

TGTGAGAGCTGTGATACTCTCACCTGCCGAGACTGCCAGCTCAATGCCCACAAGGACCAC 10 81 
II III I I I I I I I I I I I I I 

I I I 

Db 1323 TG AAC T AAC AC C T AC AAT G — TGCCAG 

GCAC 1295 



Qy 1082 

CAGTACCAGTTCTTAGAGGATGCAGTGAGGAACCAGCGCAAGCTCCTGGCCTCACTGGTG 1141 

II I I I I I I I I I III II II I I I I I I I I I I I I I 

Db 129 4 CA TTC-TAGGGAATAGAATGT- 

AAATAAGTTCAAGTTCTTGACCTGAATAAGT 1244 

Qy 1142 AAGCGCCTTGGGGACAA- 

ACATGCAACATTGCAGAAGAGCACCAAGGAGGTTCGCAGCTC 12 0 0 

I I I I I I I I I I I I I I I II I I I I II 

I 

Db 12 43 T T AC AT T T TAG T G AG AAGAC AGAC AAT AAATC AAT AACATG 



TC- 



-C 1200 



Qy 1201 

AATCCGCCAGGTGTCTGACGTACAGAAGCGTGTGCAAGTGGATGTCAAGATGGCCATCCT 12 6 0 
I I I I I I I I III I I I I I I I I 

Db 1199 ATTATGCCAGAT AGTTT 

TAAATGTCACAA 1171 

Qy 1261 GCAGATCATGAAGGAGCTGAATAAGCGGGGCCGTGTGCTGGTCAATGATGCCC 

AG 1315 

III II I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 1170 — AGAAAAAGGGGGAGC — AATCAG ATAGTGATGG 

AAG T T AC AC T C TAG AG 112 4 

Qy 1316 AAGGTGACTGAGGGGCAGCAGGAG CGCCT-GGAG CGGCAGCACTG- 

GACC— 1363 

I I I I I I I I I I II I I I I I I I I I I I I I I I I 

Db 1123 AGGGT 

GG AC AG AAAAAG T T T C T C T T AGG AG AT G AC AT C T G AAC AG AG AT C T AA 10 71 
Qy 1364 

ATGACCAAGATCCAGAAGCACCAGGAGCACATTCTGCGCTTTGCCTCTTGGGCTCTGGAG 1423 

I I I I I II I I I I I I I III I I I I I III 

I 

Db 10 70 AT G AC T GAG T AC AAACCCCAATGGAAAA GTGATTGG 

TGTGTTG 1028 

Qy 1424 

AGTGACAACAACACAGCCCTTTTGCTTTCTAAGAAGTTGATCTACTTCCAGCTGCACCGG 14 83 
I I I I I I I I I I I I I I I I I I 

Db 1027 

TGAAACAGCAACACAGCCAGTTTG 10 0 4 

Qy 148 4 GCCCTCAAGATGATTGTGGATCCCGTGGAGCCACATGGC- 

GAGATGAAGTTTCAGTGGGA 15 42 

I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 10 03 AC T AG AT G A GTAGATCAGAATGACTGAGGTGGG 

ACTGTGG — 964 

Qy 1543 

CCTCAATGCCTGGACCAAGAGTGCCGAGGCCTTTGGCAAGATTGTGGCAGAGCGTCCTGG 16 02 
III I I I I I I II I I I I I I 

I I I I I 

Db 963 — TCA T C AAAG AG AT C AG A GAGGTAG 

CCTGG 935 

Qy 16 03 CACTAACTCAACAGGCC- 

CTGCACCCATGGCCCCTCCAAGAGCCCCAGGGCCCCTGAGCA 1661 

I I I I I I I III I I I I I I I I I I I I I 

Db 93 4 GA AGGC C AC AT C AC AG AAT G T C C T T 

TGGGCCTCTGTAAG 896 



Qy 1662 

AGCAGGGCTCTGGCAGCAGCCAGCCCATGGAGGTGCAGGAAGGCTATGGCTTTGGGTCAG 1721 
III I I I I I I III I I I I I I I I I I I 

I I I I 

Db 895 AGC GTTTTG — AAAAAACAGTTCT GAGCA — AAGGTTA 

GTCAT 857 



Qy 1722 

GAGATGATCCCTACTCAAGTGCAGAGCCCCATGTGTCAGGTGTGAAACGGTCCCGCTCAG 17 81 

I I I I I I I III I I I I I I III I I I 

I 

Db 856 TTAACTTGACTTACTCTTTAGAAGGATCACTT-TGGCTATTGAGTGGCAAAC 

AACTG 801 

Qy 1782 

GTGAGGGCGAGGTGAGCGGCCTTATGCGCAAGGTGCCACGAGTGAGCCTTGAACGCCTGG 18 41 

III I I I I I I I I III III I I I I I 

I I 

Db 800 GAGA AGTGGTTAGGGG AGCACTGTGGGC AGGAATAGAA 

GG 761 

Qy 1842 

ACCTGGACCTCACAGCTGACAGCCAGCCACCCGTCTTCAAGGTCTTCCCAGGCAGTACCA 19 01 

I III I I I I I I I I I I I I III I I I I I I 

Db 76 0 A GGA AAAGCAGACTG-GAGCC TAATC 

CAGTACTT 72 8 

Qy 1902 

CTGAGGACTACAACCTTATTGTTATTGAACGTGGCGCTGCCGCTGCAGCTACCGGCCAGC 1961 



Db 72 7 CAGATGA- 

GAAATGACAGATTTGC 705 



Qy 1962 CAGGGACTGCG 

CCTGCAGGAACCCCTGGTGCCCCACCCCTGGCTGGCATGGCCATTG 2 018 

I I I I I I III I I I I I I I I II I I I I 

Db 70 4 CTGGTACATCATTTCCT ATCCCT — 

TTACTTATAACTTATTGGTGTATTTTTAT 653 

Qy 2 019 TCAAGGAGGAG 

GAGACGGAGGCTGCCATTGGAGCCCCTCCTACTGCCACTGAGG 2 0 72 

I I I I I I I I I I I I I I I II 

Db 652 T T T AAG T G T AAAT T T T T AT AAAT G AAGC AT AAT T T G AAG 

ATTGA — 6 09 

Qy 2073 

GCCCTGAGACCAAACCTGTGCTTATGGCTCTTGCGGAGGGTCCTGGTGCTGAGGGTCCCC 2132 

I I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 608 TATGTTCGGACTTGTGTTTT CTATT ATCATGGTGTC 

TCTCC 568 

Qy 2133 

GCCTGGCCTCACCTAGTGGCAGCACCAGCTCAGGGCTGGAGGTGGTGGCTCCTGAGGGTA 2192 

I I I I I I I I I I I I I I I I I 

I I 

Db 56 7 ACATACCTTTTCC CTTTAGTTCT 

GCTTCTTATTGTG 532 

Qy 2193 

CCTCAGCCCCAGGTGGTGGCCCGGGAACCCTGGATGACAGTGCCACCATTTGCCGTGTCT 2 2 52 

I I I I I I I I I I I II I I I I I I 

I I I 

Db 531 CT CCTCAAGT CT— ATTTTTGT TCTTTTCCC — 



Qy 2 2 53 GCCAGAAGCCAGGCGATCTG — 

GTTATGTGCAACCAGTGTGAGTTTTGTTTCCACCTGGA 2310 

I III I I I I I I I I I I I I I I I II 

I I I 

Db 499 AC TTCTACTGTTTTGAACTGTATATGCTATTTCTATTT- 

TGCCAGGA 45 4 

Qy 2311 

CTGTCACCTGCCGGCCCTGCAGGATGTACCAGGGGAGGAGTGGAGCTGCTCACTCTGCCA 23 7 0 

I I I I I I I I I I I I I I I 

Db 453 — GTTACC CCCT 

TCAATTTTCGT 433 

Qy 2371 

TGTGCTCCCTGACCTGAAGGAGGAGGATGGCAGCCTCAGCCTGGATGGTGCAGACAGCAC 2 43 0 

I I I I I I I I I I I I I I I I 

I I I 

Db 432 TGTACTCTCT CAGTCTCA 

AACAC 410 

Qy 2431 

TGGCGTGGTGGCCAAGCTCTCACCAGCCAACCAGCGGAAATGTGAGCGTGTACTGCTGGC 2 49 0 

I I I I I I I I I I I I I I I I I I 

Db 409 ATAC CCATGCTCTCTTC 

TGTCCTTCTCTT 381 

Qy 2 491 CCTATTCTGTCACGAACCCTGCCGCCCCCTGCATCAGCTGGCTACCGACTCC- 

ACCTTCT 2549 

I I I I I I I I I I I I I I I I I I III 

I I I I 

Db 380 CATAT CATCTTCTAGAATTAGTT — CTACC 

CCGATTTTCT 343 

Qy 2550 

CCCTGGACCAGCCCGGTGGCACCCTGGATCTGACCCTGATCCGTGCCCGCCTCCAGGAGA 2 6 09 

II III I I I I I I I I I I I 

Db 3 42 ACC 

AATTTATTTTATTTCCAATCAGTACATTGATTCAG 3 05 

Qy 2610 

AGTTGTCACCTCCCTACAGCTCCCCACAGGAGTTTGCCCAGGATGTGGGCCGCATGTTCA 2 669 

I I I I I I II I III I I I I I I I I I I I I I I I 

Db 304 ATTTATCA ATATGTTT T AC C A 

ATTTTATCATTTTTTTTGGAGGCTTGTT — 2 56 

Qy 2670 

AGCAATTCAACAAGTTAACTGAGGACAAGGCAGACGTGCAGTCCATCATCGGCCTGCAGC 2 72 9 

I I I I I I I I I I I I I I I I I I I III 

I I I 

Db 255 ATTTA — AGGTTCATT TTTCTTGGAGTACAT 

CCTTCAGA 219 

Qy 2 73 0 

GCTTCTTCGAGACGCGCATGAACGAGGCCTTCGGTGACACCAAGTTCTCTGCTGTGCTGG 2 7 89 
I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 218 AGTTCTTTCA GCAAGAAC TCATGGTTGTTA ATCTTTG 

AG 180 



Qy 



2790 



TGGAGCCCCCGCCGATGAGCCTGCCTGGTGCTGGCCTGAGTTCCCAGGAGCTGTCTGGTG 2 8 49 

I II II I I I I I I I I I I I I I I I I I 

Db 179 T CCTTTG TTTCTGGTGTTTTAACTTTTTCCC 

CTCACT 143 

Qy 2850 GCCCTG- 

GTGATGGCCCCTGAGGCTGGAGCCCCCATGGCCAGCCCAGCCTGGCTCTGTTC 2 9 0 8 

I I I I I I I I I I I I I I III I III I I I I II 

I 

Db 142 — CTTGAGTAATAG TTTAGCTGGG CAT-AAAATTTCAGATTGGCAGT- 

TAC 96 

Qy 2909 

TCTGTCCTGTCACCCCATCCCCACTCCCCTGGTGGCCTGACTCCCACTCCCTGGTGGCCC 2 96 8 

I I I I I I I I I I III I I I I I I II 

I 

Db 95 TTTTCCTTGG C AC T AC G AAG AT G CCGCTCC 

GTTATCT 59 

Qy 2969 CATCCC— 

CCAGTTCCTCACGATATGGTTTTTACTTCTGTGGATTTAATAAAAAAAACTT 3 02 6 

II I I I I I III I I I I I I I I I I I I I I I I I 

Db 5 8 TAT T AC AGC C AAT — AT T AC — TATTGTT 

CCTTTGTAGGTAATCTCCAGTAGGC— 9 

Qy 3027 CACCAGT 3033 

II I I I I 
Db 8 CAACAGT 2 
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RESULT 1 

US-10-478-914-17/C 

; Sequence 17, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 

; TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
; FILE REFERENCE: 7388-80899 

; CURRENT APPLICATION NUMBER: US/ 1 0 / 478 , 9 1 4 



; CURRENT FILING DATE: 2003-11-26 

PRIOR APPLICATION NUMBER: PCT/JP02/05294 
; PRIOR FILING DATE: 2001-05-30 

PRIOR APPLICATION NUMBER: JP 2001-162775 

PRIOR FILING DATE: 2001-05-30 
; PRIOR APPLICATION NUMBER: JP 2001-255226 

PRIOR FILING DATE: 2001-08-24 
; NUMBER OF SEQ ID NOS : 417 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 17 

LENGTH: 219 8 
TYPE : DNA 
; ORGANISM: Homo sapiens 
US-10-478-914-17 

Query Match 43.2%; Score 1402.5; DB 1; Length 2198; 

Best Local Similarity 39.3%; Pred. No. 8e-84; 

Matches 398; Conservative 0; Mismatches 172; Indels 443; 
Gaps 59; 

Qy 2 TTTTTTTTTTTTTTTTTTAAAG 

TCTTTAGTATATTTATTTG 42 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2163 

TTTTTTTTTTTTTTTTTTATGGTGCCAATTTTAAATAGTTTTATTTAAGACATTGCATTT 210 4 

Qy 43 T AT AAAG AG T AA 

ACAAAGTGCATAT 6 7 

I I I I I I I I I I I I I I I I 

Db 2103 

TCCACTTACAATACAGTGTTTATAAAGTGCAATGTTATTTCCTTCCCCTGTGCATATGTT 2 0 4 4 

Qy 6 8 AGAGTGGCCA 

CAGGTT T 8 4 



Db 2043 

C C AT AT T C AAG T AT T G AG AAT GC C C AG T AAC T T AC T AT AGC AGC T T AAC T T T T T AAAAC T 19 8 4 

Qy 85 GACACAGA GACCTT— GGTGATGTA GG— 

CTATG-A 115 

I I I I I I I I I I I I I I I I I II 

Db 1983 

GCCACAGAATTTGCTACGAATTTAGGTCCTTCAAATGTTTTAAATGTGTGGAACAATGCT 192 4 

Qy 116 ACAAATTTAAATGG CAACTTCATTG 

CTG C 144 



Db 1923 

ACATCTACACTTGGTTGGCTTAATCAACCTCTTCAATGGTGGGCCCTGAGGAAGCACCAC 186 4 

Qy 145 CA CTGAACCA 

ATCCTG 160 



Db 1863 

CAGAGGGAGGAGCTCCACCACCAGGAAATCCCCCAGGCATTCCTCCTGGCATGCCTCCTG 18 0 4 



QY 



161 AATTTGGGCTCAAC-AGGTGAAAA GT 



AACAATATCAAACGAATACT 2 05 

II II III I I I I I I II III I I I I I III 

Db 1803 

CACTCTGGTACAGCTTGGTGATGATGGGGTTGCAAACTTTCTCCAGCTCTTTCTGTTGAT 17 4 4 

Qy 2 06 AAA CAGCATAACAAAAAGATT TTCAGACTCTTGG 

TCATAA- 2 45 

III I I I II I I I I I I I I I I I I I I I 

I 

Db 1743 

GTTCAAATTCTTCCTTCTCAGCAGTCTGATTCTTATCAAGCCAGTTGATAATTTCATTAC 16 8 4 

Qy 2 46 AGACCGTAATC GTTCACATTG — 

AATCAATGACTAAACATTTTTGATTA 292 

III I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1683 ACTTGTCCAGAATCTTCTGTTTGTCCTCATCGTTAATC- 

TTGCCTTGAAGTTTCTCATCT 1625 

Qy 293 CC— CAGCTACCTCCA AGCAAACTGAAAACTGTCTAGTGGATCCT— GAAGTC- 

CAT 344 

I I I I I I I I I II I I II I I I I I I I I I I I I I I I 

I 

Db 162 4 TCAACAGTTGCTTTCATGTTGAAGGCATAGGAC — 

TCAAGTGAATTCTTGGATGACACCT 156 7 

Qy 3 45 AGTGCCTCT AG CCGGGTCT 

TTCAA 368 

I I I I I I I I I I I I I I 

I I I I I 

Db 1566 

TGTCCCTCTGCTTCTCATCTTCAGCTTTGTACTTCTCAGCTTCCTGGACCATACGTTCAA 150 7 

Qy 369 GTG TTGCACCACAGG 

GTGATGATTGATGGTA 399 

II I I I I I I II I I I I I I I I I I I I 

Db 1506 - 

TGTCTTCCTTGCTCAAACGGCCCTTGTCATTAGTGATAGTAATCTTGTTCTCTTTTCCC 1448 

Qy 40 0 AAAACAGGGA 

TCAACCCTTGTAGATCGGTGGTAAGTATGGA 440 

I I I I I I I I I I I I I I III II II II 

Db 1447 G T AC T C T T G T C C AC AGC AGAGAC AT TGAGT AT AC CATTGGC- 

ATCAATGTCAAAAGTGAC 13 89 

Qy 441 AACCCTCT-AAGAACA GTGCAG— CGTATG— TG- 

GTATTCAGACT 48 0 

I I I I I I I I I I I I I I I I I I I I I III I I I I I I I 

Db 1388 

TTCAATCTGAGGAACACCTCGGGGTGCAGGAGGTATGCCTGTGAGTTCAAACTTGCCAAG 132 9 

Qy 481 — GGTT GC AT AC - AGC AT T C A- 

AAACCAGT G 50 7 

I I I I III I I I I I I I I I I I I I 

I 

Db 1328 

CAGGTTGTTATCCTTTGTCATGGCACGCTCGCCTTCATAAACCTGAATAAGCACACCAGG 12 69 

Qy 50 8 CTG GAATAG CT TGCCCCAAAG 

TGGTAGAGTTAT- 540 

III I I I I I I II III II I I I I I I I I 



Db 1268 

CTGGTTGTCAGAATAGGTAGTGAAGGTCTGTGTCTGCTTGGTAGGAATGGTGGTATTACG 12 09 



Qy 541 AAAAG GAT AT AC AT T G AC GTTTCTTAAAAGCATGTGTAA 5 79 

I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 12 08 C T T GAT G AGG AC AG T C A- T G AC T C C AC C AGC AG T T T C — AATACCAAGGGAAA 1159 



RESULT 2 

US-10-478-914-17 

; Sequence 17, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 

; TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
; FILE REFERENCE: 7388-80899 

; CURRENT APPLICATION NUMBER: US / 1 0 / 478 , 9 1 4 
; CURRENT FILING DATE: 2003-11-26 
; PRIOR APPLICATION NUMBER: PCT/ JP0 2 / 0 529 4 
; PRIOR FILING DATE: 2001-05-30 

PRIOR APPLICATION NUMBER: JP 2001-162775 
; PRIOR FILING DATE: 2001-05-30 

PRIOR APPLICATION NUMBER: JP 2001-255226 
; PRIOR FILING DATE: 2001-08-24 
; NUMBER OF SEQ ID NOS : 417 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 17 

LENGTH: 219 8 
TYPE: DNA 
; ORGANISM: Homo sapiens 
US-10-478-914-17 

Query Match 43.1%; Score 1397; DB 1; 

Best Local Similarity 41.7%; Pred. No. 2e-83; 
Matches 379; Conservative 0; Mismatches 191 
Gaps 55; 

Qy 1 CTTTTTTTTTTTTTTTTTTAA — 

AGTCTTTAGTATATTTATTTGTATAAAGAGTAAACAA 5 8 



Db 464 

CAATGTACTTAGAATTATTAATGAGCCAACTGCTGCTGCTATTGCTTACGGCTTAGACAA 523 

Qy 59 A GTGCATATAGA 

GTGGCCACAGGTTTGACACAGAGA 94 

I I I I I I I I I III III I I I I I I I I I I 

Db 52 4 AAAG G T T G G AGC AG AAAG AAAC G T G — CTCATCTTTGAC- 

CTGGGAGGTGGCACTTTTGA 58 0 

Qy 95 CC TTGGTGATGTAGGCTATGA — AC AAAT T T AAA — TGG 

CAA 132 



Db 581 

TGTGTCAATCCTCACTATTGAGGATGGAATCTTTGAGGTCAAGTCTACAGCTGGAGACAC 6 4 0 

Qy 133 CTTCATTGCTG CCACTG AACCAATC 

CTGAATTTGG 167 

I I I I I I I I I I I I I I I I I I I I I I I 



Length 2198; 
; Indels 338; 



Db 641 

CCACTTGGGTGGAGAAGATTTTGACAACCGAATGGTCAACCATTTTATTGCTGAGTTTAA 7 0 0 

Qy 16 8 GCTCAA CAGGTGAAAAGT AACAA 

TATCAAACGAATACTAAACAG 211 

I I I I I III I III I I I I I I I I I I I II II 

I 

Db 701 GCGCAAGCATAAGAAGGACATCAGTGAGAACAAGAGAGCTGTAAGACGCCTCC- 

GTACTG 759 

Qy 212 CATAACAAAAAGATTTTCAGACTCTTGGT CATAAAGACC GTAATCG- 

TT 259 

II II II I I I I I I II I I I I I I I I I 

I I 

Db 760 

CTTGTGAACGTGCTAAGCGTACCCTCTCTTCCAGCACCCAGGCCAGTATTGAGATCGATT 819 

Qy 26 0 CACATT G AATC AATGAC T AAAC AT T T T T GAT T AC C C - 

AGCTACCTCCAAGCA-A 311 

II I I I I I I I I I I I I I I I I I I I I II II II 

I 

Db 82 0 CTCTCTATGAAGGAATCGACTTCTATACCTCC ATTACCCGTGC — 

C C GAT T T G AAG A 8 74 

Qy 312 ACTGAAAACTGTCTAG TGGATCCTG AAGTCCATAG 

TGC 349 

I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 875 

ACTGAATGCTGACCTGTTCCGTGGCACCCTGGACCCAGTAGAGAAAGCCCTTCGAGATGC 93 4 

Qy 350 C — TCTAGCCGGGTC TTTCAAGT GTTG 

CACCA 3 79 

I I I I I I I I I I I I I I I I I I 

I I 

Db 935 

CAAACTAGACAAGTCACAGATTCATGATATTGTCCTGGTTGGTGGTTCTACTCGTATCCC 99 4 

Qy 380 CAGGGT GATGATT GATGGTAAA-AACAG 

GGATCAA 413 

I I I I I I I I I I I I I I I I I I I I I 

I I I I I 

Db 995 

C AAG AT T C AG AAGC T T C T C C AAG AC T T C T T C AATGGAAAAGAAC T G AAT AAG AGC AT C AA 105 4 

Qy 414 CCCT TGTAGAT CGGTGGT 

AAGTATGGAAACCC 4 45 

I I I I I I I I I I I I I I I I I I I I I 

Db 1055 

CCCTGATGAAGCTGTTGCTTATGGTGCAGCTGTCCAGGCAGCCATCTTGTCTGGAGACAA 1114 

Qy 4 46 -TCTAAGAA CAG TGCAGC GTATG 

TGGTATTCA 476 

I I I I I I I II I I I I I I I I I I I I I I I I 

I 

Db 1115 

GTCTGAGAATGTTCAAGATTTGCTGCTCTTGGATGTCACTCCTCTTTCCCTTGGTATTGA 117 4 

Qy 477 GACTGGT TGCATA 

CAGCATTCAAAAC 502 



Db 1175 

AACTGCTGGTGGAGTCATGACTGTCCTCATCAAGCGTAATACCACCATTCCTACCAAGCA 123 4 

Qy 5 03 CAGTGCTGGAATAGCT TGCC — CCAAAGTGGT 

AGAGTTAT — 540 

I I I I I I I I I I I I I I I I I I I I I 

Db 1235 

GACACAGACCTTCACTACCTATTCTGACAACCAGCCTGGTGTGCTTATTCAGGTTTATGA 12 9 4 

Qy 541 AAAAGGATATACA 

CTTAAAAGCA 5 72 



Db 1295 AGGCGAGCGTGCCAT G AC AAAGG AT A- 

ACAACCTGCTTGGCAAGTTTGAACTCACAGGCA 1353 

Qy 573 TGTGTAAT 580 

I I I 
Db 1354 TACCTCCT 1361 



RESULT 3 

US-10-478-914-64/C 

; Sequence 64, Application US/10478914 
; Patent No. 7335755 
; GENERAL INFORMATION: 
; APPLICANT: N AKAG AWARA , AKIRA 
; TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
; FILE REFERENCE: 7388-80899 

; CURRENT APPLICATION NUMBER: US/10/478,914 

; CURRENT FILING DATE: 2003-11-26 

; PRIOR APPLICATION NUMBER: PCT/JP02/05294 

; PRIOR FILING DATE: 2001-05-30 

; PRIOR APPLICATION NUMBER: JP 2001-162775 

; PRIOR FILING DATE: 2001-05-30 

; PRIOR APPLICATION NUMBER: JP 2001—2 5 5226 

; PRIOR FILING DATE: 2001-08-24 

; NUMBER OF SEQ ID NOS : 417 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 6 4 

LENGTH: 212 0 

TYPE : DNA 

ORGANISM: Homo sapiens 
US-10-478-914-64 

Query Match 42.4%; Score 1375; DB 1; Length 2120; 

Best Local Similarity 40.3%; Pred. No. 6.5e-82; 
Matches 362; Conservative 0; Mismatches 199; Indels 338 
Gaps 48; 

Qy 1 

CTTTTTTTTTTTTTTTTTTAAAGTCTTTAGTATATTTATTTGTATAAAGAGTAAACAAAG 6 0 



Db 2 0 83 CCTTTTTTTTTTTTTTTTAAAAGAATGAGTTACATTTATTGATAT 

GGTTTGTCATA- 2 02 8 



-TTGAC — GTTT 

I I I I I I I I II 



Qy 61 TGCATATAGAGTGGCCACAGGTTTGACACA- 

GGTGATGTAGG 109 



-GAGACCTT- 



Db 2 02 7 TGCTTTATAAATGGTCACC- 

C T T T GAAACAT GTATTATTACTAT T TGGGGGAGAGGGGGA 1969 

Qy 110 CTAT GAACAAATTTAAATGG 

CAACT 13 4 

III I I I I I III 

Db 1968 

CTGTTCATTTTACAGGGGACAAGCAAGACAGGCTCAAGGAGGGAAAGGACAGGCTCAAAG 19 09 

Qy 135 TCATTGC TGCCACTGAA CCAATC 

CTGAATTTGGGCTCAAC 174 

I I I I I II I I I I I I I I II I I I I I 

Db 1908 

TCATCACAGTGTGGGGCTGGAATGCAGTTGCCCTTCCTTCTTTCTTTTTGCACATCTTCC 18 49 
Qy 175 

AGG T G AAAAG T AAC AAT AT C AAAC G AAT AC T AAAC AGC AT AAC AAAAAG AT T T 2 2 7 

I I I I I II I I I I I I I I I I I I 

I 

Db 18 48 GTCTCTAGGGTGAGGAGGGGTGCAGGCACAGGCACCCAAGACAGC 

TGCGGT 1798 

Qy 22 8 TCAGACTC TTGGTCATAAAGAC 

CGTAATCGTTCAC A 263 

I I I I I I I I I I I I I I I I I I I 

I 

Db 1797 

CCAGCCCCGGCCCCACCTGTGGTCTCAGTGACGCCCCAGAGGCCCCATCTTCCCCACATA 173 8 

Qy 26 4 TTGA ATCAATGACTAAACATTTTTGATTA— CCCAG- 

CTACCT 3 03 

III I I I I I I I I I I I I I I I I I I I I I 

I I I I 

Db 1737 

ATGAGGCTGCTCCATCCTCCTCAAAGCCCAGACCTATTTCATAAGCCCCAGACCCCACCT 16 7 8 

Qy 304 CCA-AGCAAACTGAAAACTGT-CTAGTGGATCCTGAAGTCCATAG 

TGCC 350 

III II I I I I I I I I I I II I I I I I I I 

I I I I 

Db 16 77 TCACCCAGGGCCCCAAGAGAACAGAGCTGGAGACACTTCTACTCC- 

TAGCACTGGATGCC 1619 



Qy 351 TCTAGCCGGGTCTTTCAAG-TGTTG 

CACCAC 380 

II II I I I I I I I I I I I I I I I 

Db 1618 - 

CTCTCCCCTCCTGTGAAGCTGTAGGTGGGGGGTGGAGGACACCCCTTTAAGGCAGGCTC 156 0 



Qy 381 — AGGGTGATGA TTGATG GTAAAAACAGGGAT 

ACC 415 

III I I I I I I I I I I I I I I I I I I 

I I I 

Db 1559 

GGGGGGCTATGAACTCCAAGACTCCTGGAAGCCAGTAAGAAGTGGGAAGGTAGGCACACC 15 0 0 

Qy 416 CTTGTAGAT CGGTGGTAAG 

TATGGAAACC 444 



Db 1499 

ATTTGGGATTTTTATTTTTCACTGAGCTAAGAGAGGAGATTCCAGCTATGGAATCCAGAG 14 4 0 

Qy 4 45 CTCTAAGAACAGTGCAGCGTATG TGGTATTCAGA 

CTGGT 483 



Db 1439 AGCGTTCTACAGGAACAGAACAGAAATTGGCTGGGCCCTGG- 

GCCCAGAGGAGGCCTGGA 13 81 

Qy 48 4 — TGCATACAGCATTCAAAACCAG TG CTG- 

GAATAG 516 



Db 13 8 0 GCCGCTTACTG — 

TTCAGTGCCAGCTTCATCCTTGGCACTCTGCCTCTCACACAGCATCG 1323 

Qy 517 CTT — GC CCCAAAGTGGTAG 

AGTT 538 

II II I I I I I I I I I I I I I 

Db 1322 

CTCCGGCTCCCAGAGCCCACAGTGGCAGCCGCTGGAGGCTGTGGCAGCAGCTGCTGCCCT 12 63 

Qy 539 ATAAAAGGATAT ACATTGACGTTTCT 

TAAAAGCATGTGTAAT 58 0 



Db 1262 

GCCCAAGGAGGGATGTGGGCTTGGCTCTGACAGTTCAAGCCAGAAAGAGCCTTAGAAGT 12 0 4 



RESULT 4 

US-10-478-914-61/C 

; Sequence 61, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 

; TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
; FILE REFERENCE: 7388-80899 

; CURRENT APPLICATION NUMBER: US/ 1 0 / 478 , 9 1 4 
; CURRENT FILING DATE: 2003-11-26 

PRIOR APPLICATION NUMBER: PCT/ JP02 / 0 529 4 
; PRIOR FILING DATE: 2001-05-30 
; PRIOR APPLICATION NUMBER: JP 2001-162775 
; PRIOR FILING DATE: 2001-05-30 
; PRIOR APPLICATION NUMBER: JP 2001-255226 

PRIOR FILING DATE: 2001-08-24 
; NUMBER OF SEQ ID NOS : 417 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 61 

LENGTH: 2 43 7 
TYPE: DNA 
; ORGANISM: Homo sapiens 
US-10-478-914-61 



Query Match 42.4%; 
Best Local Similarity 40.6%; 
Matches 354; Conservative 
Gaps 44; 



Score 1374; DB 1; Length 2437; 
Pred. No. 1.6e-81; 
0; Mismatches 216; Indels 302; 



Qy 1 CTTTTTTTTTTTTTTTTTTAA-AGTCT TTAGTA 

TATTTATTTG 42 



Db 2415 

CTTTTTTTTTTTTTTTTTTAAGAATCACAGCAATCCAAGCAAAGTACCTCACTGAGTAGG 2356 

Qy 43 TATAAAGA GTAAACAAAGT GCATATAGAGT 

GGCCACAGGTT 83 



Db 2355 

TATCAAGACCCTTCAGGCAGAATTCCATCATTCTCGCAATTAGTGACAGGGACGCTGG-T 2 2 9 7 

Qy 8 4 TGACACAGAGACCTTG— GTGATG 

TAGGCTATGAACAAATTTAAATG 12 8 



Db 2296 TGTGGAGGAGTATTTGGAGTGGTGGCAGAAAATTGCATCGGC- 

ATTTCCAGCTTGAGTAG 223 8 

Qy 129 G CAACTTCATTGCTGC— CACTGAA-CCAA- 

TCCTGAATTTGGGCTCAACAGGTGAA 181 



Db 223 7 GTCTTATCTTCTTAGCGGATGCTCAGAAGCCAAGGTCTGGATGCAGTCG- 

GTCAGTTGCA 2179 

Qy 182 AAGTAAC AAT AT C AAAC G AAT AC 

TAAACAGCATAACAAAAAG 223 



Db 2178 AA- 

TAACAGCTCAGGAATGTTAAGTGGCTTCAGGAGTCGTGTGGACAGCACGATTACCTG 212 0 

Qy 22 4 ATTTTCAGACT CTTGGTCATAAAGACCGTAATCG — TTCACAT- 

T G AAT C AAT G AC 2 76 



Db 2119 

GGTTCCTGGCTTCAACGATGCCACAAGATCTTTGACCATTTTCACTTCAGAGACTGTGAC 2 06 0 

Qy 2 77 TAAACATTTT TGATTAC CCAGCT- 

ACCTC 304 



Db 2059 

CCCACCTACCACAAAGAGGATCAGGAGGGGGTAGTCACTAGGATGAGGCCGGCTCACCTT 2 0 0 0 

Qy 3 05 CAAG — CAAACTGAAAACTGTCT 

AGTGGATCCTGAAGTCCATAGTGC 349 

III II I I I I I I I I I I I I I I I I I I I I 

Db 1999 

CATGAACATGCTAAATCCAGTTTTAAGGAGATCAGTGAGGCCTGAAGACATGTGTTCAAT 19 4 0 

Qy 350 CTCTAGC CGGG TCTTTC 

AAGTGTTGCACCACAGG 3 83 

I I I I I I I I I I I I I I I I I I 

I I 



Db 1939 

ATCAACGGAATCTGGCCTCTCGGGATGAAATATTTCCTCCACAACTTGCTTCAACAATGG 18 8 0 

Qy 384 GT GATGATTGATGG 

TAAAAAC 40 4 

I I I I I I I I I I I I I I I 

Db 1879 

CTTATAAGATGCCTGGTGGGTATGATTTCCAGGAACATATACAGACTTAAACTGTTTCAG 182 0 

Qy 405 AGGGATCAACC 

CTTGT 420 



Db 1819 

GAGACTCCGAGCTCCAGCAATATCCCGAAGTGAAGTAAAGAGTTCATCCACGGCAATTTT 176 0 

Qy 421 AGATCGGTGGTAAGTATG GAAACCCTCTAAGAAC 

AGTGCAGCGTATG 46 7 



Db 1759 

GGATTTGTGAAATGTCAGATTAATTGAAGAGTCCCAGTCCGTAATTTTTTGCAGCAAAGG 17 0 0 

Qy 46 8 T GG T AT T C AG AC T GG T T GC AT AC AGC A- 

TTCAAAACCAGTGCTGGAATAGCT 518 

II III I I I I I I I I I I I I I I I I I I I I II 

Db 1699 TGACAATC 

CAGATTCCTCACAGAAGACCTGAGCCAATGCTTTCTTGACTTTTTCTTC 16 43 

Qy 519 TGCCCCAAAGTGGT AGAGTT ATAAAAGGATATACAT 

TGA- 557 



Db 1642 

TGCTTCACACAGGTCTTTGTCTACCGTGAGCTCTCCAGTGACAGAATAAATATATATGAG 15 83 

Qy 558 CGTTTCTTAAAAGCATGTGTAAT 58 0 

I I I I I I II I I I I I 
Db 1582 AAGGATCAGCAGTTCCTCAGGGC TGTAGT 155 4 



RESULT 5 

US-10-478-914-21/C 

; Sequence 21, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 

TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 

FILE REFERENCE: 7388-80899 
; CURRENT APPLICATION NUMBER: US/ 1 0 / 478 , 9 1 4 

CURRENT FILING DATE: 2003-11-26 
; PRIOR APPLICATION NUMBER: PCT/JP02/05294 
; PRIOR FILING DATE: 2001-05-30 

PRIOR APPLICATION NUMBER: JP 2001-162775 
; PRIOR FILING DATE: 2001-05-30 
; PRIOR APPLICATION NUMBER: JP 2001-255226 
; PRIOR FILING DATE: 2001-08-24 
; NUMBER OF SEQ ID NOS : 417 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 21 



LENGTH: 3589 
TYPE : DNA 

ORGANISM: Homo sapiens 
US-10-478-914-21 

Query Match 42.4%; Score 1373.5; DB 1; Length 3589; 

Best Local Similarity 45.5%; Pred. No. 1.4e-80; 

Matches 338; Conservative 0; Mismatches 230; Indels 175; 
Gaps 33; 

Qy 1 CTTTTTTTTTTTTTTTTTTAAAGTCTTTAGTATATTTATTTGTATAAAG 

AG 51 



Db 3567 CTTTTTTTTTTTTTTTTTTT 

TTTTTTTTTTTTGAAAGAATCCAGCAGATATTTAT 3513 

Qy 52 TAAACAAAGTGCATATAGAGTGGC CACAGG 

T T T G AC AC AG AG AC C T 9 7 

I I I I I I III I II I I I I I I I II I I I I I 

Db 3512 

T AAGC AAG AT GAAAG T GAAAT T AC AAAC AC AGGTC AAC T T T TAAAC TCAGCACTCTGTTG 3453 

Qy 9 8 TGGTGATG TAGG CTATG 

AACAAATTTAAATGGC 13 0 



Db 3452 

GAGTGGAGGTGCACGGTCCTTCATCATAGGCAGCCTATGCGAGATGCATCTTAGGAAGGG 3393 

Qy 131 AACTTCATTGCTGCCACTGAACCAATCCTGAATTTGGGCT 

CAACAGGTGAAAAGTAA 187 



Db 3392 AGCTT — 

TCGCTGCTCAGAAATCAAAGCTCCATCGGAGGTGTCCTACTGGAGGCATCAGA 3335 

Qy 188 CAA TATCAAACG — AAT AC T AAAC AGC AT AAC — AAAAAG AT - 

TTTCA- 230 



Db 3334 

CAACAAGCTAAATGACGTTAGGGCTACACAACACAAAGGGGAAAGTTGACAACAATTCAG 32 75 

Qy 231 -GACTCTTGGTCATAAAGACCGTAATCGTT-CACATTGAATCAATGA 

CTAAACATTT 2 85 



Db 32 74 GGGCTTTGAGTAGTCAAGACAATTAGCTTAGTACTTCAGGTCAATAAATGCTACA- 

ATTT 3216 

Qy 2 86 TTGATTACCCAGCT ACCTCCAAGC — AAACTGAAAAC- 

TGTCTAGT 328 

II I I I I I I I I I I I I I I I I I I I I I I I I 

Db 3215 ATGGGCAATTAGCTGTAAAATGGCCTAC- 

AGCTGAAACACTATTCTTGTCTTATTTCACG 3157 

Qy 329 GGATCCTGAAGTCCATAGTGCCTCTAGCCGG GTCTTT 

CAAGTG- 3 71 

I I I I I I III I I I I I I I I I I I II 



Db 3156 

ATTTCATGGAGATTAAAGATAATCTAAACAGATTAAGGCCTTAATCTTTAATCCAGGTGA 3 09 7 

Qy 3 72 TTGCA 

CCACAGGGTGATGATTGATGGTAA 40 0 



Db 3096 

GTAATTTTGTTTGTAG T AAAAAGC AT AAGAAAT AAT TC TC AGC AT AT T ATGT AAAAG AAA 3 03 7 

Qy 401 AAACAGGGATCAACCCTTG-TAGAT CGGTGGTAAGTA 

TGGAAACCCTCTAAG 451 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 3 036 AAGCAGTGA — 

AAACCTTGTTTGGTCTTCCAGTGGCAAGGATAGTTGAACAACTTCAAGA 2 9 79 

Qy 452 AACAGTGCAGCGTATGT 

GGTATTCAGACTGGTTGCATACAGCATTCAA 499 



Db 2978 

AGCTGAGAGAAGTTTGTTCCCAATTTACAAGTATTTTGACATATAGATAAACAAAAGCAA 2 919 
Qy 50 0 AACCAGT 

GCTGGAATAGCTTGCCCCAAAGTGGTAGAGTTATAAAAGGATATACATTG 556 

I I I I I I I I I I I I I III II II I 

Db 2918 

AAC C AG T T AAAC C T T AAAAG AT C AT C T G AAT AAG AT C C T G AC AAT TAT AAT TTTACATCC 2 859 

Qy 557 ACGTTTCTTAAAAGCATGTGTAA 579 

III II I I I I I I I I I 
Db 2858 ATGTGAATTTCAAGCAATTGTTA 2836 



RESULT 6 

US-10-478-914-20 

; Sequence 20, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 

; TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 

FILE REFERENCE: 7388-80899 
; CURRENT APPLICATION NUMBER: US/ 1 0 / 478 , 9 1 4 
; CURRENT FILING DATE: 2003-11-26 
; PRIOR APPLICATION NUMBER: PCT/ JP02 / 0 52 9 4 
; PRIOR FILING DATE: 2001-05-30 
; PRIOR APPLICATION NUMBER: JP 2001-162775 

PRIOR FILING DATE: 2001-05-30 
; PRIOR APPLICATION NUMBER: JP 2001-255226 

PRIOR FILING DATE: 2001-08-24 
; NUMBER OF SEQ ID NOS : 417 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 2 0 

LENGTH: 2 495 
TYPE: DNA 
; ORGANISM: Homo sapiens 
US-10-478-914-20 



Query Match 



42.3%; Score 1371; DB 1; Length 2495; 



Best Local Similarity 39.1%; Pred. No. 3e-81; 

Matches 381; Conservative 0; Mismatches 190; Indels 404; 
Gaps 54; 

Qy 1 CTTTTT TTTTTTTTTTTTTAAAGTCTTTAGTATAT 

TT 37 

I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 220 

CATCTTTGGGCAGAACTGGTGTTTTTGTATGACAAGTATGAAGAATATGATAATGCCATA 2 79 

Qy 3 8 AT T T G T AT AAAGAGT AAAC AAA GTGCAT — ATAGAGTGGC CACAGGT 

TT 84 

III I I I I I I I I I I I I I I I I I I III I I I I I 

I I 

Db 280 

ATTACCATGATGAATCATCCAACTGATGCCTGGAAAGAAGGGCAATTCAAAGATATCATT 339 

Qy 85 G AC AC AG AG AC C T TGGTGATGTA GGCTAT 

GAACAAATTTAAATGGCAA 132 

III II III III I I I I I I I I I I I I 

I 

Db 340 

AC C AAGG T T GC C AAT G T GG AAC TAT AC T AC AGAGC AAT AC AG T T C T AC T T AG AAT T C AAG 399 

Qy 133 CTTC ATTGCTG CCACTG AAC 

CA 154 

III I I I I I I I I I I I II 

I I 

Db 400 

CCTCTGTTGTTAAATGATTTGCTGATGGTGCTGTCTCCACGGTTGGATCACACTCGTGCA 459 

Qy 155 ATCCTGAATTTGGGC TCAACAGGTGAAA 

AGT- 185 

II I I I I II I I I I I I I I I I 

I I I 

Db 460 

GTCAATTATTTCAGCAAGGTTAAACAGCTACCACTGGTGAAACCGTATTTGCGTTCAGTT 519 

Qy 186 AACAATATCA AAC GAAT -ACT AAAC AG CAT AAC — 

AAAAA 222 

I I I I I I I I I I I I I I I I I I I I I I I I II 

I I 

Db 520 

C AG AAC C AT AAC AAC AAAT C T G T GAAT G AATC AT TGAAC AAT CTTTTTAT T AC AG AAG AA 5 79 

Qy 2 23 GAT T T TC AGAC TC T - TGGTCATAAAGA 

CCGTAATCGTTCACATTGAATCAATG 2 74 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

I 

Db 580 

GATTATCAGGCTCTGCGAACATCAATAGATGCTTATGACAACTTTGACAATATCTCGCTT 639 

Qy 2 75 AC T AAAC AT T T TTGATTACC CAGC- 

TACCTC— C 305 

I I I I I I I I I I I I I I I I I I I I I 

I 

Db 640 

GCTCAGCGTTTGGAAAAACATGAACTCATTGAGTTCAGGAGAATTGCTGCTTATCTCTTC 699 



Qy 3 06 AA — GCAA ACTGAAAA 

CTGT 323 

II I I I I I I I I I I I I I I 

Db 700 

AAAGGCAACAATCGCTGGAAACAGAGTGTAGAGCTGTGCAAGAAAGACAGCCTTTACAAG 759 

Qy 32 4 CTAGTGGATC CTGAAG TCC- 

ATAG 3 46 



Db 760 

GATGCAATGCAGTATGCTTCTGAATCTAAAGATACTGAATTGGCTGAAGAACTCCTGCAG 819 

Qy 347 TGCCTCTAGCCGG GTCT- 

TTCAAGTGTTGC 3 75 

I I I I I I I I I I I I I I I 

Db 820 

TGGTTTTTGCAGGAAGAAAAAAGAGAGTGCTTTGGAGCTTGTCTGTTTACCTGTTACGAT 8 79 

Qy 3 76 ACCA CAGGGTGATGATTGATGG — 

TAAAAACAGGGA 409 

III II I I I I I I I I I I I 

Db 880 

CTTTTAAGGCCAGATGTCGTCCTAGAAACTGCATGGAGGCACAATATCATGGATTTTGCC 939 

Qy 410 TCAACC CTTGTAG 

ATCGGTGGTAAGTATGGAAAC- 4 43 



Db 940 

ATGCCCTATTTCATCCAGGTCATGAAGGAGTACTTGACAAAGGTGGATAAATTAGATGCT 999 

Qy 444 CCTCTAAGA ACAGTG CAGC 

GTATGTGGT 471 



Db 1000 

T C AG AAT C AC T G AG AAAAG AAGAAGAAC AAGC T AC AGAGAC AC AAC CCATTGTTTATGG- 105 8 
Qy 4 72 AT TC AG AC 

TGGTTGCATACAGCATTCAAAACCAGTGCTGGAATAGCTTGCCCCA — 52 5 

I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 1059 — TCAGCCCCAGTTGATGCTGACAGCA GGACCCAGTGTTGCCGTCCCT — 

CCCCAGG 1111 

Qy 52 6 AAG TGG-TAGAGTTATA 

AAAGGATATACATTGACGTTTCTT 565 

I I I I I I I I I I I I I I I I II III 

Db 1112 CACCTTTTGGTTATGGTTATACCGCACCACCGTATGGACAGCCACAGCC- 

TGGCTTTGGG 1170 

Qy 566 AAAAGCATGTGTAAT 580 

I I I I I I I I I II 
Db 1171 TACAGCATGTGAGAT 1185 



RESULT 7 

US-10-478-914-20/C 

; Sequence 20, Application US/10478914 
; Patent No. 7335755 



GENERAL INFORMATION: 
APPLICANT: NAKAGAWARA, AKIRA 

TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
FILE REFERENCE: 7388-80899 

CURRENT APPLICATION NUMBER: US/ 1 0 / 47 8 , 9 1 4 
CURRENT FILING DATE: 2003-11-26 
PRIOR APPLICATION NUMBER: PCT/ JP02 / 0529 4 
PRIOR FILING DATE: 2001-05-30 
PRIOR APPLICATION NUMBER: JP 2001-162775 
PRIOR FILING DATE: 2001-05-30 
PRIOR APPLICATION NUMBER: JP 2001-255226 
PRIOR FILING DATE: 2001-08-24 
NUMBER OF SEQ ID NOS : 417 
SOFTWARE: Patentln version 3.2 
SEQ ID NO 20 
LENGTH: 2495 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-478-914-20 

Query Match 42.2%; Score 1367; DB 1; Length 2495; 

Best Local Similarity 41.8%; Pred. No. 5.9e-81; 
Matches 365; Conservative 0; Mismatches 211; Indels 298; 
Gaps 45; 

Qy 1 

CTTTTTTTTTTTTTTTTTTAAAGTCTTTAGTATATTT 3 7 

I I I I I I I I I I I I I I I I I I I II II I I I I I I I I 
Db 2459 CTTTTTTTTTTTTTTTTTACATGT- 

T T AAAT AT AT T T T AT T TC AAGT T AAT AT AAC AGCC 2401 

Qy 3 8 ATTTGT ATAAAGAG TAAACAAAGTG- 

CATATAG 69 

I I I I I I I I I I I III I I I I I I I 

Db 2400 

ATGATTTGTTCATTTAAGTGAAATGCTTCGATAAAAACCCCGGTGTCACTTGGTTAAAAT 23 41 

Qy 7 0 AGTGGCCACAGGTT TGACAC 

AGAGACCTTGGTGATGTAGGCT 111 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2340 

ATTGGAGACAGATTAAATGTTAAGCCACTATAGTAGTGACTCTGGTTCTTTAGACTATTG 2 2 81 

Qy 112 - AT G AAC AAAT TT AAAT GGC AAC T TCATTG CTGC 

CAC- 147 

I I I I I I I I I III I I I I I III 

I I I 

Db 2280 

C AT G T AC C T T G T T AAT G AAC T AC AGGAATGC AC AT TGAT AT AATGC AC AAT G T C T T CAC A 2 2 21 

Qy 148 TGAACCAATCCTGA ATTTGGGCTCAACAGGTG-AAAAGTA- 

ACAATATCA 195 

II I I I I I I I I II II I II I I I I I I I I I I 

I 

Db 2220 

TGTCTCAATCCTCATTCCTCTTCCTTTCATGCCTCCCTAATGCCTCAGTATCCAATATAA 2161 

Qy 196 A AC G AAT AC T AAAC AGC AT AAC 

AAAAAGATTTTCAGACTCTT 23 7 

I I I I I I I I I III I I I I I I I I I 



Db 216 0 AACTACAAAAACTACTGTTCTTTTCGTTTGAGTGGTATTTAATTAAAACTTCA 

ATTCTA 2102 

Qy 23 8 GGTCAT AAAGACC-GTAAT CGTTCACATTGAAT 

CAATGACT 2 77 



Db 2101 

TGTTATTTCCACAGTCCAGTAATTTATTTTAAATTTGAGTAATTTCAAATTCCACAAACA 2 0 42 

Qy 2 78 AAAC ATTTTTGATT ACCCAG- 

CT— 299 

I I I I I I I I I I I I I I I I 

Db 2041 

AAACTGAAGAACAGCAATATTTTGTTTGAATTTCTCTCTTCTGTACACTCAGTGATCTAA 19 82 

Qy 300 ACCTC CAAGCAAACTGAAAACT 

GTCTAGT 328 

I I I I I I I I I I I I I I I I I I I I 

Db 1981 

AACACCACAATATCCAACATACACAAACCTCAGGGAAGGGTTAGTAAACACACACAAGAT 192 2 

Qy 329 — GGATCCTGAAGTCCATAGTGCCT— 

CTAGCCGGGTCTTTCAAGTGTTGCAC 377 

I I I I I I I III I II II I I I I II I I I I 

Db 1921 

TGGAATCATGGTGCTCTTTGCTCCTGAATGGAATGGTCCCACAGAAAAAGCACAGGATAC 1862 

Qy 3 78 — CACAGGGTGATGATTGATGGTAAAAA — CAGGGATCAACCCTTG-TAG 

ATCG 426 

I I I I III I I I I I I I I I I I I I II III 

Db 1861 

AGCACAACATAAGGGCACCTGTTACATATGAAGTGAGCAAAACATACTAGCATTTTCTAT 18 02 

Qy 42 7 GTGGTAAGTATGGAAACCCTCTAAGAACAGTG 

CAGCGTATGTGGTATTCAGA 478 



Db 1801 

ATGCATAATGGGGAAACCTGCATAGGTTAGAGGGCCTTTTACGCTCATTTAAAAATCAGG 17 42 

Qy 4 79 CTGGTTGCATACAGCATT CA AAAC C A 

GTG 507 



Db 1741 

CAAGTTGTCTGTAACATTTTTCAATAATCTGGGAAGCACTGCAATCCAGTGTACGATGTG 16 82 

Qy 50 8 CTGGAATAG CTT GCCC CAAAGTGGTAGAGTTATA 

AAAGGATA 5 49 



Db 1681 CT- 

GATTAGTGTTGTCTTTGTTGGCACTGACAGTCTTGCATGGTCATATGCCAGTGTTTT 1623 

Qy 550 TACATTGACGTTTCTT AAAAGCATGTGTAA 579 

I I I I I I I I I I I I I I I I I I I I 

Db 1622 TGCTTTAGCTTTTCTTTGAATAAAACAGATTTAA 1589 



RESULT 8 

US-10-478-914-60 

Sequence 60, Application US/10478914 
Patent No. 7335755 
GENERAL INFORMATION: 
APPLICANT: NAKAGAWARA, AKIRA 

TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
FILE REFERENCE: 7388-80899 

CURRENT APPLICATION NUMBER: US / 1 0 / 478 , 9 1 4 
CURRENT FILING DATE: 2003-11-26 
PRIOR APPLICATION NUMBER: PCT/ JP02 / 0 52 9 4 
PRIOR FILING DATE: 2001-05-30 
PRIOR APPLICATION NUMBER: JP 2001-162775 
PRIOR FILING DATE: 2001-05-30 
PRIOR APPLICATION NUMBER: JP 2001-255226 
PRIOR FILING DATE: 2001-08-24 
NUMBER OF SEQ ID NOS : 417 
SOFTWARE: Patentln version 3.2 
SEQ ID NO 60 
LENGTH: 2022 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-478-914-60 

Query Match 42.1%; Score 1366.5; DB 1; Length 2022; 

Best Local Similarity 37.9%; Pred. No. 2.1e-81; 
Matches 369; Conservative 0; Mismatches 191; Indels 413; 
Gaps 45; 

Qy 1 CTTTTTTTTTTTT 

TTTTTTAAAGTC 25 

II I I I I I I I I I I I I I I I I I I I 

Db 743 

C T AAT TATTTATTTATATT G AAAT AAT TGAT T T AAC AT T T T T T AAAGTC AAAG T T C T GG A 8 02 

Qy 2 6 TT TAGTATATTTATTT 

GTATAAAGAGTAAACAAAGT 61 

II I I I I I I I I I II I I I I I I I I I I 

I 

Db 803 

GATAATTTGCCCAGTAGTACATTTTATTGCTGCAAGCAAAGTTTAAAGTGATATAATTGA 862 

Qy 62 GCATATAGAGTGGCC-ACAGGTT TGAC 

ACA 9 0 

II III III I I I I I I III III 

Db 863 GCC- 

AAAGTATTGCTGACAAGTTATTTCAAGCATGTCATTTACATTACTTTGTTATTTGT 921 

Qy 91 — GAGACCTTGGTG 

ATGTAGGCTATGAACAAATTTAAATGGCAACTTC 136 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 

Db 922 GTGTGACGCAGGTGGAAGCTATAAAGGCCATGTGGATATTT 

TGGCACCTACTGTTCA 9 78 

Qy 13 7 ATTGCTGCCACTGAAC 

CAATCCTGAATTTGGGCT 170 

I I I I I I I I I I I I I I I I I I I I I I I I 

Db 979 

AGAGTTGGCTGCCCTTGAAAAGGAGGCGCAGACATCTTTCCTGCATCTTGGCTACCTTCC 103 8 



Qy 171 CAACAG- 

GTGAAAAGTA 186 



Db 1039 

T AAC C AGC T G T T C AG AAC C T T C T GAT T T T T AC AT T T AC TGAAT AAGAT T TGAGT AAT G AA 109 8 

Qy 187 A C AAT AT C A 

AACGAATACTAAACAGCA T 214 

I I I I I I II I I I I I I I I I 

Db 1099 

AGTCTGTAGTCTTAAAACTCTAAAACAGTTGTACTGCTTCCAAGCAGCAGTATTTATAGT 115 8 

Qy 215 AAC AAAAAG AT T T T CAGACTCTTG 

GTCATAAAGACCGTAATCGT 258 

III I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1159 AAC-GTAAGCTATTAATGCTAACTCTTGCATGTCA- 

AG AAAC AT TAG T C T T AGG AAT T C T 1216 

Qy 2 59 TCACATTGAA 

TCAATGACTAAACATTTTTGA 2 89 

I I I I I I I I I I I I I I I I I II III 

Db 1217 

TCAAAAAATGGCATCCCAATGAAAATAAATTTGATGACTATATTTTCATGAAGGTTTGTG 12 76 

Qy 29 0 TTACCCAGCTACCTCCAAGCAA 

ACTGAAAACTGTCTAGTG 329 

III I I I I II I I I I I I I I 

I 

Db 1277 

TCTTATTTTAAAGTTATATTGATATATTTTTTCTATTTCTTTTTTAAGAACAGTATG— G 133 4 

Qy 33 0 GATCCTGAAGTCCA TAGTGCCTCTAGCCG- 

GGTCTTTCAAGTGTTGCACC 378 

II I I I I I I I III I I I I I I I I I I I I III 

I I I 

Db 1335 GCTTATGAAGTAGAATTTATGGGTATGTGAATCTGGCAGAGGACTTAC 

GTGGAACC 139 0 

Qy 3 79 ACAGGG 

TGA 387 

II II 

I I I 

Db 1391 

ACTCGGGAATATTCTAAAAGTAGGTTTTCAGATGGCTAAGGTTGTCTATGTGTATATTGA 145 0 

Qy 388 TGATTGA TGGTAAAAACAGGGA TC AACCC 

TTGTAGATCGG— 42 7 

I I I I I I I I I I I I I I II I I I I I I I I I I 

Db 1451 

AGCTAGAGGAGAGTTGGAACATGAAGGGAAATTCGATGATCCCAATGTAGAAGAACTGCT 1510 

Qy 42 8 TGGTAAGTATGGAAACC CTCTAAG 

AACAGTGCAGCGTATGTGGTA 4 72 

I I I I I I I I I I I I I I I I I II I I I I II I 

Db 1511 

TGGTTAGTTTGGAAGCATGGAAGTTTTGAGGGAGTCAGTAAAGGTTCTGTATCTAAGGAC 15 7 0 



Qy 4 73 TTCAGACTGGT TGC ATACAGC 

ATTCAAAACCAGTGCT 509 

I I I I I I I III I I I I I I I I I I I I I I 

Db 1571 

TGATGACTGATGTGATGGTGCCAGTGAAGATGTATCTTCTTTTTATGACCCTTGCTTTCC 163 0 
Qy 510 -GGAATAGCTTGCCCC — 

AAAGTGGTAGAGTTATAAAAGGATATACATTGACGTTTCTTA 566 

I I I I I I I I I I I I II III I I I I I II 

I I I I I 

Db 1631 AGATGATGCATACCCATTAAAGT AGAACTAATCTGCTGTCCCT — 

ACAGGTCTTA 16 83 

Qy 56 7 AAAGCATGTGTAA 5 79 

I I I I I III 
Db 1684 AAATCAAACTTAA 1696 



RESULT 9 

US-10-478-914-63/C 

; Sequence 63, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 

; TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
; FILE REFERENCE: 7388-80899 

; CURRENT APPLICATION NUMBER: US/ 1 0 / 478 , 9 1 4 

; CURRENT FILING DATE: 2003-11-26 

; PRIOR APPLICATION NUMBER: PCT/JP02/05294 

; PRIOR FILING DATE: 2001-05-30 

; PRIOR APPLICATION NUMBER: JP 2001-162775 

; PRIOR FILING DATE: 2001-05-30 

; PRIOR APPLICATION NUMBER: JP 2001-255226 

; PRIOR FILING DATE: 2001-08-24 

; NUMBER OF SEQ ID NOS : 417 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 63 

LENGTH: 1813 

TYPE: DNA 
; ORGANISM: Homo sapiens 
US-10-478-914-63 

Query Match 41.9%; Score 1360; DB 1; Length 1813; 

Best Local Similarity 36.6%; Pred. No. 3.4e-81; 

Matches 389; Conservative 0; Mismatches 187; Indels 486; 
Gaps 53; 

Qy 1 

CTTTTTTTTTTTTTTTTT 18 

I I I I I I I I I I I I I I I I I I 

Db 1791 

CTTTTTTTTTTTTTTTTTGGGGAGAACACTTTGCTTTGGCTATAAAAAAGGAGTTTCTGA 1732 

Qy 19 TAAAGTCT TTAGT ATATT- 

TATT 4 0 



Db 1731 

AAATATACAATATGAGTACTATGGAGACTACTAGGATTAGTGCTGAATCCTTATTATTTT 16 72 



Qy 41 TGTATAAAG AGTA 

AACAAAGTG 62 

I I I I I I I III I I I I I I I I 

Db 1671 

TGTCTACAGTAAGGTGGGTAAGGTAATCCAACACAGTGGCAGCCAGCCAGCCATACAGTT 1612 
Qy 63 

CATATAGAGTGGC 75 

III! I I II 

Db 1611 

GTTAATTCTTATCACATACAAAATGTCTTTAGCATGGAATAAAACATTATACATTTCTAA 1552 

Qy 76 -CACAGG TTTG 

AC AC AG AG AC C T T GG T GAT G T AGGC — 110 

I I I I I I I I I I I I I I I I I I I I I I 

Db 1551 

GCTCAGGCTGAATCCTTTGCAAATCTTTTAAAAGCACAGAAAAATTAAGTATTTAAAATG 1492 

Qy 111 TATGAACAAATT TAAATGGCA-ACTTCA 

TTG 140 

I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 1491 

TTTGGACAAATATTTTCTCAAATAAGAAAATCTGAACTCTGAATGTCATACTCCAATTTA 1432 

Qy 141 CTGCC ACTGAACCAAT CCTG 

AATTTGGGCT 170 

I I I I I I I I I III III I I I I III 

Db 1431 

C T AC CC AC AAC AC T AAC T AAAAT AAT AT ACC AGC AGC AT T T T T AGC TAT AT GAT C AG AAT 13 72 

Qy 171 CA — ACAGGTGAAAAGTAACAATA 

T C AAAC G AAT AC T 205 

II III I I I I I I I I I I I I I I I II 

Db 1371 

ACTTGCATTACACTTGTAATGGAACCATACACATATTCTCTATCTCTCTCTGCCATAGAT 1312 

Qy 2 06 — AAACAGCATAACAAAAA GATTTTCAG ACTC — 

TTGGTC 241 

I I I I I I I I I III I I I I I I I I I I 

I I 

Db 1311 

AAATGCTGCATAACTTAAATACTGCTCTTCTGTGCTATTAATACAGCAACTCAAATATTC 12 52 

Qy 2 42 ATAAAGACCGTAATC GTTCACATT 

GAATCAATG 2 74 

I I I I I I I I I I I I I I I II 

I 

Db 1251 

AAAAACATAGTAAGCTGCAACCTGTGAATCTAGTATTTTCCCTCTTTTTACAACTCAAAG 1192 

Qy 2 75 ACTAAACAT TTTTGATTACC CAGC 

TACCTCCAAGCAAAC 313 

I III I I I I I I I I I I I I I I I I I I 

I 

Db 1191 GCATCCCATAGATGATGGTTACAAATCAGCACAGGATTAAAATAGTGCCCCCAA- 

CACAT 1133 

Qy 314 TGAAAACTG— TCTAGTGGAT CCTGAAGT — CCATAGTGCCT 

CTAGCCGGGTCT 363 



Db 1132 TAAAAGCTGACTCTGCTGGGTGTGGCTTAAGTAAACATCTCGTCTCAACTGGACAG- 

CT 1075 



Qy 36 4 TTCAAGTGTTGCA-CCACAGG GTGA 

TGATT 392 

I I I I I I I I I I I I I I I I 

Db 1074 

ACCCAGTAACACAGCCACAGAAACCGAAAATATGTGAGTACATAAGTAAAGTGCTTCTTT 1015 

Qy 393 GATGGTAAA AACAGGGATCAACC 

CTTGTAGATCGGTGGTAAGT 435 



Db 1014 

TGGAGGGGGTGGGAAACTAAGCCTCCATGTTTCATCTATGCTTGATGCTCAATTGTTTCT 955 

Qy 436 ATGGAAACCCTCTAAGA — ACAGTGC — AGCGTATGTG 

GTATTCA 4 76 

II I I I I I I I I I I I I I III II III 

Db 954 

AT AC T G T AC T T C T G AAAT G AC AG T GC C T T G AG TC AATGC TGAAC AT T G T T C AAG AAAC C A 895 

Qy 477 — GACTGG T TG -CAT AC AGC AT 

TCAAAA 501 



Db 894 

AGGACAGAGAAATGTTACTTGTCCTACTTCCTCTCCTTTATTTTATGCTCCCTGTCTACA 835 

Qy 502 CCAGTGCTG GAATAGCTTGCCCCAAAGTGGTAG — 

AGTTATAAAAGG — 546 

II III I I I I I I I I I I I I I I III 

Db 834 

ACATGGCCGTTCTTACAAACATTTTGCTTTCATAAAAGAGGTAATCATTTAAATCAAGAC 7 75 

Qy 547 -ATATACATTG ACGTTTCTTAAAAG-CATGTGTAA 5 79 

I I I I I I I II I I I I I I I I I III 

Db 774 TATGTGCAATGCGAACTCACACTGCATTACAGTCATCAATAA 733 



RESULT 10 
US-10-478-914-62 

; Sequence 62, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 

TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
; FILE REFERENCE: 7388-80899 

CURRENT APPLICATION NUMBER: US/ 1 0 / 478 , 9 1 4 
; CURRENT FILING DATE: 2003-11-26 
; PRIOR APPLICATION NUMBER: PCT/JP02/05294 

PRIOR FILING DATE: 2001-05-30 
; PRIOR APPLICATION NUMBER: JP 2001-162775 
; PRIOR FILING DATE: 2001-05-30 
; PRIOR APPLICATION NUMBER: JP 2001-255226 
; PRIOR FILING DATE: 2001-08-24 
; NUMBER OF SEQ ID NOS : 417 

SOFTWARE: Patentln version 3.2 



; SEQ ID NO 62 

LENGTH: 19 46 

TYPE: DNA 
; ORGANISM: Homo sapiens 
US-10-478-914-62 

Query Match 41.9%; Score 1359.5; DB 1; Length 1946; 

Best Local Similarity 43.2%; Pred. No. 5.4e-81; 

Matches 369; Conservative 0; Mismatches 195; Indels 291; 
Gaps 55; 

Qy 1 CTTTT TTTTTTTTTTTTTTAAAGTCTTTAGT — ATATTT 

ATTTGTAT 45 



Db 424 

CTTTACACTGTTTGCTTTCTCTGGAACATTTTTAGTTAAAATTTCATAATGCAGTTGCAC 4 83 

Qy 46 A- AAG AG T AAAC AAAG TGCATATAGAGT 

GGCCACAGGTTT 8 4 

III I I I I I I I I I I I I II III I I I I 

Db 484 

ACAAATGAAGACACAGATGGCTGCATCCTCCGTCTCTTCCCCTCGTTTACAGGAAGCTGC 5 43 

Qy 85 GACACAGAG ACCTTGGTGATGTAGGCTATGA 

AC 117 



Db 544 

GGATCAGGGAGGGGTGTTAGGGTTACCCACATGGTAAGGGCAGAGACAAGAGGGGACCCC 6 03 

Qy 118 AAATTTAAATG — GCAACT — TCATTGCTG C CAC- 

TGAACCAATCCT 159 



Db 604 

AGTTTTCCATGCTGCACATGGTCATTGCTGGGGACTGAGGTTTGCACATCACCCTGCCCT 663 

Qy 16 0 GAATTTGGGCTCAACAGGTGA-AAAGT 

AAC AAT AT C AAAC GAAT ACT AAAC 2 09 



Db 664 G— 

TTCTCCCTCCGCTGGGGAGAAAGTCAGGGATGGAGCAAGCTGCAGCATCTTCTGAA- 72 0 

Qy 210 AGCATAACAAAAAGATTTTC-AGACTCTTGGT CATAAAGACC- 

GT 252 



Db 721 

AAAGAAAAGGTGGCCTTGTCTCCAGGTCTCCCCTCAAGTCCCACCTTCCCATAGT 7 75 

Qy 2 53 AATCGTTCAC ATTGA AT C AAT G ACT AAAC 

ATTT 285 

II III I I I I I I I I I I I I I I 

Db 776 

TTTCTGCCACTTCTCTTGATTTTCCTCTCTGCCACTTCTCTTGATTTTCCTCTCTGCCAC 835 

Qy 2 86 TTGA-TTACC CAGCTAC CTCC — 

AAGCAAACTGA 316 



1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 I 1 1 1 1 1 1 

Db 836 

TTCTCTTGATTTTCCTCTCTGCAGCTGCTTTGAGGTGGGTTTTCTCCAGATGCACACTTT 895 

Qy 317 AAACTG TCTAGTGGA 

TCCTGAAG 339 

III I I I I I I I I I I I I I 

Db 896 

CCCCTGCTTTGCGTCCTTATTCTGGTAGAAGCACAATCTAAAGCTCATTAAGGGAACTAA 955 

Qy 3 40 TCCAT — AGTGC CTCTAGC CGGGTCT-TTCAAGTGTTGCACCA 

CAGGG 384 

I I I I I I I I I I I I I I I I I I II I I I I I I II 

I 

Db 956 

TCAATTCTGTGCATGGCGCTAGCTCAGCAGATCACCACACAGGCAGCACTATTAGCAAGT 1015 

Qy 3 85 TGATGATTG ATGGTAAAAACAGGGATC AACCCTTGTAGATC — 

GGT 428 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 1016 

CGGTGCTTAACACATGGCACTTCCATGAATCGATATGGAGCCCGTGTAGAACAAGGCATG 10 75 

Qy 42 9 GGTAAGTATGGAAACCCTCTAAG AACAG-TGC AGCGTATGTGGTA — 

TTC 475 

III II III I I I I I I I I I I I I I I I III 

I I I 

Db 1076 

GGTTTTTTTCTCTTCCCATTAAGAAAAACTGATGCCAAAAATAACTTCTCAGATATTTTC 1135 

Qy 4 76 A GAC TGGTTGCATACAGCATT 

CAAAACCAGTGCTGGAATAGC 517 

I III II I I I I I I I I I I I I I I I I I I I I II 

I I 

Db 1136 AAGTATGACTTTTATGAAGGGAAAAAGCATTTTTGTTTGCAAAATCA-TGCTTCAGT- 

GC 1193 

Qy 518 TTGCCCCAAAGTGGTAGAGTTATAAAAGGATATA CATTGAC- 

GTTTCTTAAA — 568 

III I I I I I I I I I I I I III I I I I I I I I I I 

Db 119 4 AGGCC AGTTGT- 

GAATTGTGATGGCTTTTATTTCTCCTGGGGCTGTAACTTTAAGG 12 4 8 

Qy 569 AGCATGTGTAA 5 79 

II II II II 
Db 1249 GTTTAGAATTTGGAA 1263 

RESULT 11 
US-10-478-914-64 

Sequence 64, Application US/10478914 
Patent No. 7335755 
GENERAL INFORMATION: 
APPLICANT: NAKAGAWARA, AKIRA 

TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
FILE REFERENCE: 7388-80899 

CURRENT APPLICATION NUMBER: US/ 1 0 / 478 , 9 1 4 
CURRENT FILING DATE: 2003-11-26 
PRIOR APPLICATION NUMBER: PCT/JP02/05294 
PRIOR FILING DATE: 2001-05-30 



; PRIOR APPLICATION NUMBER: JP 2001 

PRIOR FILING DATE: 2001-05-30 
; PRIOR APPLICATION NUMBER: JP 2001 

PRIOR FILING DATE: 2001-08-24 
; NUMBER OF SEQ ID NOS : 417 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 6 4 

LENGTH: 212 0 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-478-914-64 

Query Match 41.8%; Score 1357; DB 1; Length 2120; 

Best Local Similarity 36.6%; Pred. No. 1.3e-80; 

Matches 345; Conservative 0; Mismatches 217; Indels 380; 
Gaps 42; 

Qy 1 CTTTTTTTTTTTT 

TTTTTTAAAGTCTTTAGTATATTTATTTGTATAAA 48 



Db 1130 CTTTATTGTTCTTGAAAGCCCCTGCTCTCTCTGAGCCTT — 

ATTTCATCATCTGTAAAAT 1187 

Qy 49 GAG — T AAAC AAAG T G - C AT AT AG AGT GGC C AC AGG T T T G AC 

ACAGAGACCTTGGTG 102 

II I I I I I I I I I I I I I I I I I 

Db 1188 

GGGAATGTCCTGAATGACTTCTAAGGCTCTTTCTGGCTTGAACTGTCAGAGCC 

Qy 103 ATGTAGGCTATGAACAAATTT — 

AAATGGCAACTTCATTGCTGCCACTGAACCAATC — C 158 



Db 12 41 AAGCCCACATCCCTCCTTGGGCAGGGCAGCAGC — 

TGCTGCCACAGCCTCCAGCGGC 1295 

Qy 159 TG — AATTTGGGCTC AACAGGT GAAAAGTAACAA- 

TATC 19 4 



Db 1296 

TGCCACTGTGGGCTCTGGGAGCCGGAGCGATGCTGTGTGAGAGGCAGAGTGCCAAGGATG 1355 

Qy 195 AAAC GAAT ACT AAAC AG 

C AT AAC AAAAAG AT T T 22 7 



Db 1356 

AAGCTGGCACTGAACAGTAAGCGGCTCCAGGCCTCCTCTGGGCCCAGGGCCCAGCCAATT 1415 

Qy 22 8 TCAGACTCTTGGTCATAAAGACCGTAATC— GTTCACATT GAATC- 

AATGACTAAA 280 



Db 1416 TCTGT-TC- 

TGTTCCTGTAGAACGCTCTCTGGATTCCATAGCTGGAATCTCCTCTCTTAG 14 73 



-162775 
-255226 



I I I I I I 
1240 



Qy 281 C- 

TCT 325 



- AT T T T T GAT T AC C C AGC TACCTCCAAGCAAACTGAAAACTG 



Db 1474 

CTCAGTGAAAAATAAAAATCCCAAATGGTGTGCCTACCTTCCCACTTCTTACTGGCTTCC 1533 

Qy 326 AGTGGATCCTGAAGTCCATAGTGCCTCTAGCC 

GGGT 361 

III I I I I I I I I I I I I I I I I I I I I I I I 

Db 153 4 AG- 

GAGTCTTGGAGTTCATAGCCCCCCGAGCCTGCCTTAAAGGGGTGTCCTCCACCCCCC 1592 

Qy 362 CTT TCAAGTGTT 

GC 375 

III I I I I I I I 

I I 

Db 1593 

ACCTACAGCTTCACAGGAGGGGAGAGGGCATCCAGTGCTAGGAGTAGAAGTGTCTCCAGC 1652 

Qy 3 76 AC CACAGGGTGATGAT 

TGATGGTAAAAACAGGGATCAACCC 416 

I I I I I I I I I I I I I I I I I I I 

I 

Db 1653 

TCTGTTCTCTTGGGGCCCTGGGTGAAGGTGGGGTCTGGGGCTTATGAAATAGGTCTGGGC 1712 

Qy 417 TTGTAGATCGGTGG TAAGTATGGAA ACC 

CTCTA 449 

I I I I I I I I I I I I I I I I II 

I I 

Db 1713 

TTTGAGGAGGATGGAGCAGCCTCATTATGTGGGGAAGATGGGGCCTCTGGGGCGTCACTG 17 72 

Qy 450 AGAACA GT GCAGC 

GTATGTG G 470 

I I I I I II I I I I I I I I I I 

Db 1773 

AGACCACAGGTGGGGCCGGGGCTGGACCGCAGCTGTCTTGGGTGCCTGTGCCTGCACCCC 1832 

Qy 471 TATTCAGACTGGT TGCA 

TACAGCATTCAAA 50 0 



Db 1833 

T C C T C AC C C T AG AG AC GG AAG AT G T GC AAAAAGAAAGAAGGAAGGGC AAC T GC AT T C C AG 1892 

Qy 501 ACCAGTGCTG 

GAATAGCTTG 52 0 

I I I I I I I I I I I I 

Db 1893 

CCCCACACTGTGATGACTTTGAGCCTGTCCTTTCCCTCCTTGAGCCTGTCTTGCTTGTCC 1952 

Qy 521 CCCCAAAGTGGTAGAGTTA 

TAAAAGGATA 5 49 

I I I I I I I I I I II I I I I I I 

I 

Db 1953 

CCTGTAAAATGAACAGTCCCCCTCTCCCCCAAATAGTAATAATACATGTTTCAAAGGGTG 2 012 

Qy 55 0 TACAT TGACGTTTCTTAAAAGCATGTGTAA 5 79 

III I I I I III I I I I I I I 



Db 2 013 ACCATTTATAAAGCATATGACAAACCATATCAATAAATGTAA 2 054 



RESULT 12 

US-10-478-914-57/C 

; Sequence 57, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 

; TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
; FILE REFERENCE: 7388-80899 

CURRENT APPLICATION NUMBER: US/ 1 0 / 478 , 9 1 4 

CURRENT FILING DATE: 2003-11-26 
; PRIOR APPLICATION NUMBER: PCT/ JP02 / 0 52 9 4 
; PRIOR FILING DATE: 2001-05-30 
; PRIOR APPLICATION NUMBER: JP 2001-162775 
; PRIOR FILING DATE: 2001-05-30 
; PRIOR APPLICATION NUMBER: JP 2001-255226 
; PRIOR FILING DATE: 2001-08-24 
; NUMBER OF SEQ ID NOS : 417 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 57 

LENGTH: 2336 
TYPE: DNA 
; ORGANISM: Homo sapiens 
US-10-478-914-57 

Query Match 41.8%; Score 1356; DB 1; Length 2336; 

Best Local Similarity 39.3%; Pred. No. 2.6e-80; 
Matches 367; Conservative 0; Mismatches 198; Indels 368; 
Gaps 46; 

Qy 1 CTTTTTTTTTTTT TTTTTTAAAGT 

C T T TAG TATATTTATT 4 0 



Db 1372 

CTTTTACATTCCTTCCATATCACAGGTACTATGAAGTAAGGAGACTTTTAGGTTTCTTTT 1313 

Qy 41 TGTATAAAG AGTAAACA-AAGTGCATA TAGAGT — 

GGCCA — 7 7 

I I I I I I I I I I I I I I I I II I I I I I 

Db 1312 

TGTCTTAAACTCAGACAGCTTTGTAAGCAGTAGTGTGTAGATTACAAGAGTTAGACAAAA 12 53 

Qy 7 8 -CAGGTTTGACACAGAGACCTTGGTGATGTAG— GCTATG 

AAC 117 



Db 1252 

GCAGGCGCGACTGAGAAGAGTTGGTGGGGGAGAAGCTTGGGGCACTTCCTGTCACTCAAC 1193 

Qy 118 AAATT TAAATGGCAACTTCAT-TGC — 

TGCCACTGAACCAATC 157 

I I I I I I I I I I I I I I I I I I I I I II 

Db 1192 

ACATTCCAGATCACTAAAAAATTTCCACACCCTCTGCATTCCCCCTTGCCCACTCCAGTT 1133 



Qy 158 - 

CTCAACAGG- 177 



-CTGAAT TTGGG- 



Db 1132 

CCCGGTATTTTCTGATTCCATATGTTGTGGTATTTACCATACTTCTCTCCCTCACTAGGC 10 73 

Qy 178 — TGAAAAGTAACAATATCAAACG AATACT 

AAACAGCATAAC 217 

I I I I I I I I I I I I I I I I I I I I I I I 

Db 10 72 TCTGGCAAG — 

ACTGCTTCAGAGGGGATGCATTCCTTTAGATTGCACAAAGCGGAGCTGG 1015 

Qy 218 -AAAAAG ATTTTCAGACTCTTGGTCATAAAGACCGTAATCGT 

TCACA 2 63 

I I I I I I I I I I I I! I I I I I I I I I I I II 

I I I I I 

Db 1014 GAAAATGGCTGGCAGTTTCAGAATCTAG 

TCACGATCGCACGCATGAGCACCTCACA 959 

Qy 26 4 TTGAATCAATGACTAAACATTTTTGATTACCCAGCTACCTCC 

AAGCAAACTG 315 

I I I I I I I II I I I I I I I I I I III III 

Db 958 CATCCATGTCCCTACCC 

GCCCCCCCGCTCCCGCCCCTGCAGCTGGCTGACCT 907 

Qy 316 AAAACTGTCTAG 

TGGATCCTG 336 

I I I I I I I I I I I I I I I I 

Db 906 

GTCTCACCCACTGCTGGCCTATCGAACGGCCAGGACTGTCTGGTTTTGGCTCGTGCCTTT 8 4 7 

Qy 33 7 AAGTC CATAG TGCCTCTA 

GCCGGGT 361 

I I I I I I I I I I I I I I I I II 

I I 

Db 846 

GTCCATGTCTGGCTTAGTTCCTCTCTGTCTATGCTTGCCTCTACCCCCACCGCCCCAGGC 7 8 7 

Qy 362 CTTTCAAGTGTT — GCACCAC 

AGGGTGATGATTGATGGTAAAAAC 404 

I I I I I I I I II III III I I I I I I I I I I I I I 

Db 786 

GGCACAAGTGTTTGGCCACACAAAACTAGAGATAGAAAAGGTGGTAAAAACTTCAAACTT 72 7 

Qy 405 AGGGATCAACC CTTGTAGA 

TCGGTGG 43 0 

I I I I I I I I I I I III 

Db 726 

TTCTAAATTCTCCAACAGTTTATTTCTTGTGAATTTCTTCCTTCTTTAAATACTCCATTT 66 7 

Qy 431 TAAGTATGGAAACC CTCTAAGAAC 

AGTGCAGCGTATGTG 469 

I I I I I III I I I I I I I I I I I I I I I I 

Db 666 

T AAG AAAAC AAAAAAAT T AAT T AT C T AAAGGC AAAGAATGGAAAGC AAC CTTTGTGTTCC 6 0 7 

Qy 470 G T AT T C AG AC T GG T T GC AT AC AGC AT T C AAAAC C AG T GC T GG AAT 

AGCTT 519 

III I I I I I I I I I I III II I I I I I 

I I I I I 

Db 606 



TTATAATAACTGACTTCATAACTCTCTCCAGCTGCGTTATGGGATGTGTATAAAAAGCTT 5 4 7 



Qy 52 0 GCCCCAAAGTG G TAG AG T T AT AAAAG G 

ATATACATTGAC 55 8 

I I I I I I II II I I I I I I I I I I 

Db 546 

CTGTTCTGAGAACAAAGGAGCACGTGCAGAAATGAGACGAAAAAATCCACTGACAGTATT 4 8 7 

Qy 559 GTTTCTTAAAAGCATGT-GTAA 579 

I I I I I I I I I I I I I I I 
Db 486 C C AT T AC AC AAAT T AC T T AAAAGAT T T T AGTC A 45 4 



RESULT 13 
US-10-478-914-4/C 

; Sequence 4, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 

; TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
; FILE REFERENCE: 7388-80899 

; CURRENT APPLICATION NUMBER: US / 1 0 / 478 , 9 1 4 

; CURRENT FILING DATE: 2003-11-26 

; PRIOR APPLICATION NUMBER: PCT/ JP02 / 0 52 9 4 

; PRIOR FILING DATE: 2001-05-30 

; PRIOR APPLICATION NUMBER: JP 2001-162775 

; PRIOR FILING DATE: 2001-05-30 

; PRIOR APPLICATION NUMBER: JP 2001-255226 

; PRIOR FILING DATE: 2001-08-24 

; NUMBER OF SEQ ID NOS : 417 

; SOFTWARE: Patentln version 3.2 

; SEQ ID NO 4 

LENGTH: 3210 

TYPE: DNA 
; ORGANISM: Homo sapiens 
US-10-478-914-4 

Query Match 41.5%; Score 1347; DB 1; Length 3210; 

Best Local Similarity 37.3%; Pred. No. 6.5e-79; 

Matches 384; Conservative 0; Mismatches 188; Indels 458; 
Gaps 56; 

Qy 1 CTTTTT TTTTTTTTTTT TTAAAGTC 

TTTAGTATA 3 4 



Db 2519 

CTTTATGCGCTGAGGTTTTAGTTATACAAAAACCCCTTCCAGTCAAACAGAAAAAATTAA 2 46 0 

Qy 35 TTTATTTG T AT AAAGAGT AAAC AAA GTGC — 

ATATAGAGTGG 7 4 

III II I I I I I I I I I I I III I I I I 

Db 2459 

TTCAAATATGGAAAGAAAAATCAAACTTTCAGTATTTTCGTGACTATCTTGAGTAGTTCA 2 4 0 0 

Qy 75 CCACAGGTTTGACA 

CAGAG— 93 

III I I I I I I 

Db 2399 

T T T T C C AAC CTCTTAGACAGCT G AC AG AC AC C AC T AAC C T GAT C AC C T AT T T CAGAG C T T 23 4 0 



Qy 9 4 ACCTTGGTG ATGTAGG 

CTA 112 

I I I I I I I I I I I I 

Db 2339 

C AAAG AT C C T T AAT T T C C C C AAAG AC T T C C T GAGC AAAAGGAC T AC ATGTGGGAAAT C AA 2 2 8 0 

Qy 113 TGAAC AAATTTAAA TGGCAAC- 

TTCATTGCTGCCACTG 149 

I I I I I I I I I I I I III III I I I I I I I I I 

Db 2279 

TGAAAACAGGAATGAGAAGAGAAAGTTAAAAATTGGTTGCTTTTGTTCCTGTCATTGGGG 2 2 2 0 

Qy 150 AACCAATCC-TGAATTTGGGCTCAACAGGTG — 

AAAAGTAACAATA 192 

I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 2219 ACTTTTGGTCTTCTAAACATTCCTTGAGTTC 

CTCCACATGAGTTTTTAATATCA — A 2165 

Qy 193 

T C AAAC G AAT AC T AAAC AGC AT AAC AAAAAG AT T T TC AG AC T C T T GG T CAT AAAG AC — C 2 5 0 

I I I I I I I I I I I I I I I I I I I I I I I I I II II II 

I 

Db 2164 

TCTAGCAAAAATTAAAACAAACAAAAAAAGGCCTCTGAAAGTCAGGATACTGACCACCTC 2105 

Qy 251 GTAATCGTTCACATT GAAT — CAATGACTAAACATTT — 

TTGATTAC 293 

I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 2104 GGCATCTCTCCCCTTCCCCACTGCTTGCGAATGCCAGTG- 

CTGGCCTGTTCCTTAAACGC 2 0 46 

Qy 29 4 -CCAGCT — ACCTCCA-AGCAAACTGAAAACTGT-CTAGTGG 

ATCCTGAAG 339 

I I I I I I I I I I I I I I I III I I I I I I I I I I I 

I 

Db 2045 

ACCAGCTCATCCCCCAGAGCAGCCTGTGTGCTCTGCCAGCGACTGCTACTTCTCCTCCGG 19 86 



Qy 3 40 TC CATAGTGCCT CTAGCCGGGTCTTT 

CAAGTGTTGCA 3 76 

I II III I I I I I I I I I I I I I I I I 

I I I 

Db 19 85 CCTCAGCTCAACATCCCAGCTATGCTACCCTGGGCTATGTGAGCTTGGGCAAGT- 

TAGCA 192 7 

Qy 3 77 CC ACAG GGTGATG 

ATTGA 39 4 

II I I I I I I I I I I I II 

I I 

Db 1926 

TACATCTCTAGGCCTCTCTTTCCATGGATACAGAGAAGATGGTGATGCCTACTTCATAGA 186 7 

Qy 395 TGGTAAAAA CAG 

GGATCAACCC T 417 

I I I I I I II I I I I I I I 



Db 1866 

GTTCTTATAAGAATTCCCTGAGGCTAGGCATGGTGGCTCACACCTGTAATCCCAGCACTT 18 0 7 



Qy 418 TGTAGATC — GGTGGTAAG 

TATGGAAACC 444 

I I I I I I I I I I I I I I I I I I II 

Db 1806 

TGGAGGTCAAGGTGGGAAGATCAATTGAGCCCAGGAGTTCAAGACCAGCCTGGGCAACAT 17 4 7 

Qy 4 45 CTCTAAGAACAGTG 

CAGCGTAT 466 

I I I I I I I I I I I III 

Db 1746 

AGCGAGAGTGCATCTCTACCAAAAGTGATTTTTTTGATTTTTTGGGACAGAGTCTCGCTC 16 8 7 

Qy 46 7 GTGGT 

ATTCAGACT G 481 



Db 1686 

TGTCACCCAGGCTGGAGTGCAGTGGTGTGATCTCGGCTCACTGCAACTTCTGCCTCCCAG 162 7 

Qy 482 GTTGCATACAGCATTCAAAAC-CAG TGCTGGAATAGC — TTGCC — 

CCAA 526 



Db 1626 GTT- 

CAAGCAGCTCTCATGTCTCAGCCTCCCGAGTAGCTGGGATTACAGATGCCTGCCAC 156 8 

Qy 52 7 AG T GG T AG AG T T AT AAAAGG AT AT AC AT T GACG TTTCT- 

TAAAAGCA 572 

I I I I II I I I I II III I I I I I I 

Db 1567 

CATGCCAGGCAAATTTTGGGTTGTTTTTTTTGAGATGGAGTCTTGCTCTATCAGCAGGCT 150 8 

Qy 573 — TGTGTAAT 580 

I I I I I 

Db 150 7 GGAGTGCAGT 149 8 



RESULT 14 

US-10-478-914-60/C 

; Sequence 60, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 

; TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
FILE REFERENCE: 7388-80899 

CURRENT APPLICATION NUMBER: US/ 1 0 / 478 , 9 1 4 
; CURRENT FILING DATE: 2003-11-26 

PRIOR APPLICATION NUMBER: PCT/JP02/05294 
; PRIOR FILING DATE: 2001-05-30 
; PRIOR APPLICATION NUMBER: JP 2001-162775 

PRIOR FILING DATE: 2001-05-30 
; PRIOR APPLICATION NUMBER: JP 2001-255226 
; PRIOR FILING DATE: 2001-08-24 
; NUMBER OF SEQ ID NOS : 417 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 6 0 

LENGTH: 2 022 



TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-478-914-60 



Query Match 41.5%; Score 1345.5; DB 1; Length 2022; 

Best Local Similarity 39.3%; Pred. No. 6.9e-80; 

Matches 370; Conservative 0; Mismatches 201; Indels 371; 
Gaps 50; 

Qy 1 CTTTTTTTTTTTTTTTTT 

T AAAG T C T T TAG TATATTTATTTG T AT AAAGAGT AA 5 4 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2000 CTTTTTTTTTTTTTTTTTAGATGGAGT- 
TTTGCTCTTTTGCCCAGGCTGGAGTGCAATGG 1942 

Qy 55 AC AAAG T — GCATATAGAGT 

GGC 75 

I II I III III 

I I 

Db 1941 

CGCGATCTCGGCTCACTGCAACTTCCGCCTCCTGGGTTCAAGCTATTCTCCTGCCTCAGC 18 82 

Qy 76 CAC AGG T T T G AC AC - AG AG 

ACCTTGGTGATGT 106 

II I I I I I I I I I I I I I I I I I I I 

Db 1881 

CTCCTGAGTAGCTGAGGTGAGAGGACTGACGCGAGAGGATTGCTTGAGCCTAGGAGGTTG 182 2 

Qy 107 AG GCTATGAACAAATTTAAATGGCAACT TCATTGCT 

GCC 145 

II I I I I I I III I I I I I II III I 

I 

Db 1821 AGGTTGCAGTGAGCTGTGTTCAC — 

GCCACTGGACTCCAGGCTGGGTGACAAAGCAAGAC 1764 

Qy 146 ACTG AACC 

AAT 156 

III I I I I 

I I 

Db 1763 

C C T G T C T C AAAAAC AAAAAC AAAC AAAC AAAC AAAC AAAAAAAT AAC C AAC AAAAAAAAC 17 0 4 



Qy 157 CCTGAATTT 

GGGC 169 

I I I I I I I I III 

Db 1703 

C C T GAG T T T AAG T T T GAT T T T AAG AC C TGT AGGGAC AGC AGAT TAG T T C T AC T T T AAT GG 16 4 4 

Qy 170 TCAACAGG TGAAAAGTA ACAATATCA — 

AAC G AAT 2 02 

I I I I I I I I I I I I I III III 

Db 1643 

GTATGCATCATCTGGAAAGCAAGGGTCATAAAAAGAAGATACATCTTCACTGGCACCATC 15 8 4 

Qy 2 03 AC T AAAC AG - C AT AAC AAAAAGA 

TTTTCAGACTCTTGGTCATAAAGACCGT- 2 52 

II I I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 15 83 AC AT C AG T C AT C AG T C C T TAG AT AC AG AAC C T T T AC T G AC T C C C — 

TCAAAACTTCCATG 1526 



Qy 2 53 AATC GTTC ACATT- 

GAATCAATGACTAAACATTTTTGAT — 29 0 

III I I I I I I I I I I I I I I III II 

Db 1525 

CTTCCAAACTAACCAAGCAGTTCTTCTACATTGGGATCATCGAATTTCCCTTCATGTTCC 1466 

Qy 291 TACCCAGCTACCTCC- 

AAGCAAACTGAAAACTGTCTAGT 32 8 



Db 1465 

AACTCTCCTCTAGCTTCAATATACACATAGACAACCTTAGCCATCTGAAAACCTACTTTT 14 06 

Qy 329 GGA TCCTGA AGTCCA — TAGTGCCTCTAGCCGGGT 

AAGTGT 3 72 

II I I I I I I I I I II I I I I I I I I I I 

Db 1405 AGAATATTCCCGAGTGGTTCCACGTAAGTCCTCT- 

GCCAGATTCACATACCCATAAATTC 13 47 

Qy 3 73 TGCACCACAGG GTGATGATTGATGGTA AAAACAGGGATCAA 

CCCTT 418 



Db 1346 

T AC T T C AT AAGC C CAT AC T GT TC T T AAAAAAGAAAT AGAAAAAAT AT ATC AAT AT AAC T T 12 8 7 

Qy 419 GTAGATCGG TGGTAAGTATGGAAACC — 

CTCTAAGAACAGTG 458 

I I I I I I I I I I I I I III II 

Db 1286 

TAAAATAAGACACAAACCTTCATGAAAATATAGTCATCAAATTTATTTTCATTGGGATGC 122 7 

Qy 459 CAGCGTATGTGGTATTC AGACTG — GTT GCATAC 

AGCATTCAAAA 501 



Db 1226 

CATTTTTTGAAGAATTCCTAAGACTAATGTTTCTTGACATGCAAGAGTTAGCATTAATAG 116 7 

Qy 502 C CAGTGCTGGAATAGCT — TGC 

C C C AAAG T GG T AG AG T T AT AAAAG 5 45 



Db 1166 

C T T AC G T T AC T AT AAAT AC TGCTGCTT GGAAGC AGT AC AAC T G T T T TAG AG T T T T AAG AC 110 7 

Qy 5 46 GATATACATTGACGTTTCTTAAA AGCATGTGTAA 5 79 

I I I I I I I I I I I I I I III I I I I I 

Db 1106 T AC AG AC TTT — CAT T AC TC AAATC T T AT TC AGT AAAT G T AA 106 7 



RESULT 15 
US-10-478-914-63 

; Sequence 63, Application US/10478914 
; Patent No. 7335755 
; GENERAL INFORMATION: 
; APPLICANT: NAKAGAWARA, AKIRA 
; TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
; FILE REFERENCE: 7388-80899 



CTTTC 

III I 



CURRENT APPLICATION NUMBER: US/ 1 0 / 478 , 9 1 4 
CURRENT FILING DATE: 2003-11-26 
PRIOR APPLICATION NUMBER: PCT/JP02/05294 
PRIOR FILING DATE: 2001-05-30 
PRIOR APPLICATION NUMBER: JP 2001-162775 
PRIOR FILING DATE: 2001-05-30 
PRIOR APPLICATION NUMBER: JP 2001-255226 
PRIOR FILING DATE: 2001-08-24 
NUMBER OF SEQ ID NOS : 417 
SOFTWARE: Patentln version 3.2 
SEQ ID NO 63 

LENGTH: 1813 

TYPE : DNA 

ORGANISM: Homo sapiens 
US-10-478-914-63 

Query Match 41.5%; Score 1345; DB 1; Length 1813; 

Best Local Similarity 44.8%; Pred. No. 4.2e-80; 

Matches 370; Conservative 0; Mismatches 180; Indels 276; 
Gaps 54; 

Qy 1 CTTTTTTTTTTTTTTTTTTAAAGTCTTTAGTATATTTATT 

TGTAT- 45 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 

I 

Db 570 CTTTTAGTTTTTATATTTTC 

TCTGTTGTACATTTGTTGCAGCTCCTGCCAATGTCTC 626 

Qy 46 AAAG AG T AAAC A AAGTGCATATAG AGTGGC-CAC 

AG 8 0 

III II II I I I I I I I I I I I I I 

I I 

Db 627 

TTTCCCTATATTGTTCATTTTATAATAGCAGAAAACATTAGTAGCGCACTCTGTGGTGAG 6 86 

Qy 81 GTTTG ACACAG 

AGACCTTGGTGATGTAGGCTAT G AAC AAAT T TAAATGGCA 131 

III I III I I I I I I I I I I I I I I I I I I I I I I 

Db 687 ATATGGAAAATCAGTAGAATGAACCTTGG AGTTGGCACT — 

TCATATTTTATTGATG 741 

Qy 132 ACTTCATTGCTG C-CACTGAACC-AATCCTGAATT 

TGGGCTCAAC 174 

III I I I I I I I I I I I I I I I I I I I I II I I 

Db 742 

ACTGTAATGCAGTGTGAGTTCGCATTGCACATAGTCTTGATTTAAATGATTACCTCTTTT 8 01 

Qy 175 AGGTGAAAAG-TAACAATATCAAACGAATACTAAAC 

AGCATAACAAAAAG— 2 23 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 802 

ATGAAAGCAAAATGTTTGTAAGAACGGCCATGTTGTAGACAGGGAGCATAAAATAAAGGA 861 

Qy 224 

ATTTTCAGACTCTTGGTCATAAAGACCGTAATCGTTCA- 261 

I I I I I I I I I I I I I III I 

I I I I I 

Db 862 GAGGAAGTAGGACAAGTAACATTTCTCTGTC-CTTGGT-TTCTTGAAC 

AATGTTCAG 916 



Qy 262 CATTGAATCAATG-ACT 

AAACATTT 2 85 

I I I I I I I I I I I I I I I I I I I I I 

Db 917 

CATTGACTCAAGGCACTGTCATTTCAGAAGTACAGTATAGAAACAATTGAGCATCAAGCA 9 76 

Qy 2 86 TTGATTACCCAGCTACCTCC 

AAGCA 310 

II I I I I I I I I I I I I I I I I I 

Db 9 77 TAGATGAAACATGGAGGCTTAGTTTCCCA- 

CCCCCTCCAAAAGAAGCACTTTACTTATGT 1035 

Qy 311 AACT-GAAAACTGTCTAGTGGATCC- 

TGA 33 7 

I I I I I I I I I I I I I I I I 

I I 

Db 1036 

ACTCACATATTTTCGGTTTCTGTGGCTGTGTTACTGGGTAGCTGTCCAGTTGAGACGAGA 1095 

Qy 33 8 AGTCCA-TAGTGCCTCT AGCCGGGTC TTTCAAGTGTT 

GCACCA 3 79 

II II I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1096 

TGTTTACTTAAGCCACACCCAGCAGAGTCAGCTTTTAATGTGTTGGGGGCACTATTTTAA 1155 

Qy 380 — CAGGGTGATGATTGATGGTAAAAACA 

GGGATCAACCCTTGTAGATCGGTGGTA 432 

I I I I I I I I I II I III I I I I I I I I I I I I I II 

I 

Db 1156 TCCT — GTGCTGATT — TGTAACCATCATCTATGGGAT — GCCTTTG-AGTT — 

GTAAAA 1206 

Qy 433 AGTATGGAAACCCTCTAAGAACAG — TGCAGCGTA-TGTGGTATTCAGACTGGTTGC- 

AT 488 

II I I I I II I I I I I I I I I I I I I I I I I I I III III 

I 

Db 12 0 7 AGAGGGAAAATACTAGATTCACAGGTTGCAGCTTACTATGTTTTT — 

G AAT AT T T GAG T T 126 4 

Qy 489 ACAGCATTCAAAAC CAGTGCTGGAAT — AGCT-TGCCCCA 

AAGTGGTAGAG 536 

I I I I I I I I I I I I I I I I I I I I I II III 

I I I I 

Db 1265 

GCTGTATTAATAGCACAGAAGAGCAGTATTTAAGTTATGCAGCATTTATCTATGGCAGAG 132 4 

Qy 53 7 TTATA-AAAGGATATACAT-TGACGTTTCTTAAAAGCATGTGTAAT 58 0 

I I I I I I I I I III I I I I I I I I I I I I I I I 
Db 132 5 AG AG AT AG AG AAT AT G T G T ATG — GTTCCATTA CAAGTGTAAT 1365 



Search completed: March 9, 2010, 15:52:08 
Job time : 3.08146 sees 

GenCore version 6.3 
Copyright (c) 1993 - 2010 Biocceleration Ltd. 
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Run on: 

updates/sec 



March 9, 2010, 15:51:16 ; Search time 1 Seconds 
(without alignments) 
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Title: US-10-572-905-22 
Perfect score: 6518 

Sequence: 1 gaattccctgaggaggcgaa taagagaacattataattgc 

1044 

Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 417 seqs, 79705 residues 

Total number of hits satisfying chosen parameters: 834 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 10 0% 
Listing first 45 summaries 

Database : 7335755 . seq: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being 
printed, 

and is derived by analysis of the total score distribution. 
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c 42 1939 29.7 816 1 US-10-478-914-38 Sequence 38, 

Appl 

43 1930 29.6 816 1 US-10-478-914-38 Sequence 38, 

Appl 

c 44 1914 29.4 738 1 US-10-478-914-32 Sequence 32, 

Appl 

c 45 1911.5 29.3 717 1 US-10-478-914-42 Sequence 42, 

Appl 

ALIGNMENTS 



RESULT 1 

US-10-478-914-64/C 

Sequence 64, Application US/10478914 
Patent No. 7335755 
GENERAL INFORMATION: 
APPLICANT: NAKAGAWARA, AKIRA 

TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
FILE REFERENCE: 7388-80899 

CURRENT APPLICATION NUMBER: US / 1 0 / 478 , 9 1 4 
CURRENT FILING DATE: 2003-11-26 
PRIOR APPLICATION NUMBER: PCT/ JP02 / 0 52 9 4 
PRIOR FILING DATE: 2001-05-30 
PRIOR APPLICATION NUMBER: JP 2001-162775 
PRIOR FILING DATE: 2001-05-30 
PRIOR APPLICATION NUMBER: JP 2001-255226 
PRIOR FILING DATE: 2001-08-24 
NUMBER OF SEQ ID NOS : 417 
SOFTWARE: Patentln version 3.2 
SEQ ID NO 64 
LENGTH: 2120 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-478-914-64 

Query Match 45.5%; Score 2967; DB 1; Length 2120; 

Best Local Similarity 38.7%; Pred. No. 5.6e-188; 
Matches 716; Conservative 0; Mismatches 315; Indels 820; 
Gaps 109; 

Qy 4 TTCC C TGAGGAGGCGAATCCGG — CGGGTATCAGAGCCA 

TCAG 4 4 

I I I I I I I I I I I I I I III I I I I I I I I I 

I I I 

Db 1852 TTCCGTCTCTAGGGTGAGGAGG- 

GGTGCAGGCACAGGCACCCAAGACAGCTGCGGTCCAG 1794 

Qy 45 AACCGCCACCATGACGGTGGGC- 

AAGAGCAGCAAGATGCTGCAGCATATTGATTACAGGA 103 

I I I I I I I I I I I I I III II II II I I I I I 

I 

Db 1793 CCCCGGCCCCA — CCTGTGGTCTCAGTGACGCCCCA- 

GAGGCCCCATCTTCCCCACATAA 173 7 

Qy 104 TGAGG TGCATCCTGCAGGACGGCCGGATC TTCATTGG 

CACCTT 146 

I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I I I 



Db 1736 

TGAGGCTGCTCCATCCTCCTCAAAGCCCAGACCTATTTCATAAGCCCCAGACCCCACCTT 16 7 7 

Qy 147 CAAGGCTTTTGACA AGCAC 

ATGAATT 172 



Db 1676 

CACCCAGGGCCCCAAGAGAACAGAGCTGGAGACACTTCTACTCCTAGCACTGGATGCCCT 1617 

Qy 173 TGATCCT-CTGTGA— CTGT GATGAGTTCA 

GAAAGATC— A 208 

I I I I I I I I I I I I I I III II II 

Db 1616 

CTCCCCTCCTGTGAAGCTGTAGGTGGGGGGTGGAGGACACCCCTTTAAGGCAGGCTCGGG 155 7 

Qy 2 09 AGCCAAAGAACTCCAA-AC AAG - C AG AAAG G G AAG AG AAG C G AG T C - 

CTCGGTC 259 



Db 1556 

GGGCTATGAACTCCAAGACTCCTGGAAGCCAGTAAGAAGTGGGAAGGTAGGCACACCATT 149 7 

Qy 26 0 TGG TGCT — GCTGCGAGGGGAGAATC 

TGG 286 

III II III I I I I I I I I I I III 

Db 1496 

TGGGATTTTTATTTTTCACTGAGCTAAGAGAGGAGATTCCAGCTATGGAATCCAGAGAGC 143 7 

Qy 287 — TCT-CAATGACAGTAGA GGGACCTCCTCCCAAAGAT — 

ACTGG 326 

I I I I I I I I I I I I I I I I I I I I I I I 

Db 1436 

GTTCTACAGGAACAGAACAGAAATTGGCTGGGCCCTGGGCCCAGAGGAGGCCTGGAGCCG 13 7 7 

Qy 327 — TATTGCTCGAGTTCCA CTTG CTGGAGCTGC- 

CGG 359 



Db 1376 CTTACTGTTC- 

AGTGCCAGCTTCATCCTTGGCACTCTGCCTCTCACACAGCATCGCTCCG 1318 

Qy 36 0 GGGCCCAGGGATC GGCAG GGCTGCTG 

GCAGAGGAATCCCAGC 401 

I I I I I I I I I I I I I II I I I I I I I I I 

Db 1317 

GCTCCCAGAGCCCACAGTGGCAGCCGCTGGAGGCTGTGGCAGCAGCTGCTGCCCTGCCCA 12 5 8 

Qy 402 TGGG GTTC CCA 

TGC 415 

I I I I I I I I III II 

Db 1257 

AGGAGGGATGTGGGCTTGGCTCTGACAGTTCAAGCCAGAAAGAGCCTTAGAAGTCATTCA 119 8 

Qy 416 CCC AGGCTCC TGCA- 

GGACTT 435 

III I I I I I I I I I I I I 

Db 1197 

GGACATTCCCATTTTACAGATGATGAAATAAGGCTCAGAGAGAGCAGGGGCTTTCAAGAA 113 8 



Qy 436 GCTGGG CCAGTCCG- 

TGGGGTTGGCGGGCCATCCCA 470 

I I I I I I I I I I I I I I I I I I I I I I I 

Db 1137 

CAATAAAGGTGGAGACTGGAACTCAGGCCTCCTGTCTGCTGAGGGTGTCTGCCCATGCTC 10 7 8 

Qy 471 AC AGGTGATGAC 

CCCACAAGGAAGAGGTAC- 50 0 

I I I I I I I I I I I I I I I I I I 

I 

Db 1077 

TCTCTCAGAGTTTCTGAGTGGTGGTGTCTGGAGGGGCTGGACACGCAAGGGAGCTGGCCT 1018 

Qy 501 TGTTGC AGCCG CTGCAGCTGCTGCCAC 

AGCCA— G 533 

I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 1017 

CGTGTGTGTCTGTCTGTGCCGGGATGTGTCTGTGGCTTCTGTCTCTGAACGTGCCAGGGG 958 

Qy 53 4 TATTG CC GGGGCTCC 

AAC 551 

I I I I II I I I I I I I III 

Db 957 

TTTTGTGTCCTGTGTGTGTCTCTCTGTGGGTGCTCCTGTGTGCAGGAACTTGGGGCCATG 89 8 

Qy 552 -CCAGT ACCCA— CCT— 

GGCCGTGGGGGTCCTCCCCC 584 

I I I I III III I I I I I I I I I I I I 

Db 897 

GCCAGAGAGGGATGGGCCTATTCCTACAATCTTGGGAGGCCTCCACAGGCTGAGCTGGTG 83 8 

Qy 585 ACCT-ATGGGCCGA GGAGC 

ACCCCC T 609 

I I I I I I I II II I I I I I I 

Db 837 

GGGAGGGTCCT G AAG G G TAG AG T G AAG AT GGGG AC T T T AC TCCCAGCGGAGAG G AAG AAA 7 7 8 

Qy 610 CCA GGCAT 

GATGGG 623 

III I I I I I I I I I I I 

Db 777 

CCAGAAGATAGGCTTGGGGCATGGATGGGAAAGGTCTTGGAACTAGGATGGGGCAAGGGA 718 

Qy 624 CCCACCT CCTGG 

TAT GAG AC CTCCTATG 651 

I I I I I I I I I I I I I I I I I I I 

I 

Db 717 

AACAAAGCCCCAAATGGAGTATGGGTCCCTGGAAGTTGACACACTCTTAGCCCTCATGGG 65 8 

Qy 652 GGTCC — CCCAATGGGGATCCC 

CCCTGGAAGAGGGACT 687 

I I I I I I I I I I I I I I I I II I I I I III 

Db 657 

GGCCCAGCCCAGTGGGTAACCCCGGGGAAGTGGTCACATGACGTGACACTTCTCAGCTGA 59 8 

Qy 688 CCAATG-GGCATG CC CCCT 

CC 707 

III II III I II III 



Db 597 

TTGCTTGAGATCCCCCAGTGTGGCCTGGGCCTTGTCCTTGAGTGAGCCCAGGTTCAAGCC 53 8 

Qy 70 8 GGGAATGC-GGCCTCCTCCCCCTGGGATGCGAGGCCTTCTT 

TGACC 752 

I I I I I I I I I I I I I I I II I I I I I I I I I I I 

I I 

Db 53 7 AGGAAGGCTGGCCAGCTGCCCAAGGACT — 

GAGGCCTTCTCCTCCTCCTCCTCTGTGTCC 480 

Qy 753 CTTGGCCA 

CAG 763 

I I I I I I I I III 

Db 479 

TCCTCGATCATCTTGGCCAGCTCCCGGGGCAGCTCCACGTCTCCACCTGCCATCTGGATC 42 0 



Qy 76 4 AGTAT 

GGAAGTAGCTCCGCAGA 785 

I I I I I I I I I I 

I I I I I I I I I 

Db 419 

TGGCTCTCATCTGTCTCGTTCTTGGGTAGCCGGTATTTGTCTCGGAAGTGGCTCCGCAGT 36 0 

Qy 786 GGCGTGGGCTCG— ATTCCTCAGGG CCACGTTAC CACAGAC- 

CT 826 

II I I I I I I I I I I I I I I I I I II I 

I 

Db 359 

GTGGCCCGCTCTGCCTTCCTCTGTGTGAACTGTGCATCCCGCTCCATCTTCTCTTCCACG 3 0 0 

Qy 82 7 GTTTGTTTCTTATGCT-GTTGT — TCGTGGAGTCTCATG 

GGA 865 

I I I I I I I I I I I III III I I I I I I I I 

Db 299 

AGTTGCTTCTGATACTCCTCGTACTCCTCCCGGCTCATGCCCTGAGCTTCGGCTGCCGAC 2 4 0 

Qy 866 TTGT-CTGGTTTCCCTTA-CAGGGCCCCCTCC 

CCCGGGAATG 9 05 

I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 239 

TTGTCCCCATCTCCCTTATCCTCGCCGCCTCCCAGGCTCCCAGTGAGGTTCTTCAGCTGG 18 0 

Qy 9 06 -CGCCCACCAAGG CCCTAGACTC ATCTTGGCCCTCCTCAGCTCC 

CTG 951 

I I I I I I I I I I I III III I I I I I I I II I I I I I 

I 

Db 179 

CCGCCCACCATGGTCTTCACCATGAACGCCATGGTCTTCGCCGGCCGGGGCACCGGGC-G 121 

Qy 952 CCTGTTTCC CGTAAGGCT — G T AC AT AG T C C 

TTTTATCTCCTTGTGGC 997 

I I I I I I I II I I I I I II I I I I I I I I I 

I I I 

Db 12 0 CCTGCTACCACGCGCCAGGCACCGGGCA- 

CGTCCGCGCCTACTTCAGCACTCCAGTAGGC 62 

Qy 99 8 CTATGAAACT GG T T T AT AAT AAAC TC T T A- AG AG AAC AT T AT AAT T GC 10 44 

II II I I I I I I I I I I I I I I I I I I I 

Db 61 CAACAGTGCTCGAGGAAT-TCCGCAGCTTTTAGAGCAGAAGTAACACTTCC 12 



RESULT 2 

US-10-478-914-64 

; Sequence 64, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 

; TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
; FILE REFERENCE: 7388-80899 

; CURRENT APPLICATION NUMBER: US / 1 0 / 478 , 9 1 4 

CURRENT FILING DATE: 2003-11-26 

PRIOR APPLICATION NUMBER: PCT/ JP02 / 0 529 4 
; PRIOR FILING DATE: 2001-05-30 
; PRIOR APPLICATION NUMBER: JP 2001-162775 
; PRIOR FILING DATE: 2001-05-30 
; PRIOR APPLICATION NUMBER: JP 2001-255226 
; PRIOR FILING DATE: 2001-08-24 
; NUMBER OF SEQ ID NOS : 417 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 6 4 

LENGTH: 212 0 
TYPE: DNA 
; ORGANISM: Homo sapiens 
US-10-478-914-64 

Query Match 45.5%; Score 2963.5; DB 1; Length 2120; 

Best Local Similarity 41.6%; Pred. No. 9.8e-188; 
Matches 672; Conservative 0; Mismatches 338; Indels 607; 
Gaps 96; 

Qy 1 GAATTCCCTG— AGGAGGC GAATC 

CGGCGGGTA 31 



Db 162 

GAAGACCATGGTGGGCGGCCAGCTGAAGAACCTCACTGGGAGCCTGGGAGGCGGCGAGGA 2 21 

Qy 32 TCAGAGCCAT CAGAACCGCCACCATGACGGTGGGCAAGAGCAG 

CAA 77 



Db 222 

TAAGGGAGATGGGGACAAGTCGGCAGCCGAAGCTCAGGGCATGAGCCGGGAGGAGTACGA 2 81 

Qy 7 8 GATG CTGCAGCATAT TGAT — TACAGGATG — 

AGGTGCATCCT 116 

II I I I I I I I III I II I I I II 

Db 282 

GGAGTATCAGAAGCAACTCGTGGAAGAGAAGATGGAGCGGGATGCACAGTTCACACAGAG 3 41 

Qy 117 GCAGGACGGCCGGATCTTCATTGGCACCTTCAAGGCTTTTG 

ACAAGCA 16 4 



Db 342 

GAAGGCAGAGCGGGCCACACTGCGGAGCCACTTCCGAGACAAATACCGGCTACCCAAGAA 4 01 

Qy 165 CATGA ATTTGATCCT CTG-TGACT GTGATGAG 

T T C AG AAAG AT C A 208 



Db 402 

CGAGACAGATGAGAGCCAGATCCAGATGGCAGGTGGAGACGTGGAGCTGCCCCGGGAGCT 461 

Qy 2 09 AGCCAAAGAACTCCA — AAC AAGC AG AAAGGG AAG AG AAG CGAGTCCTCGGT- 

CTGGTGC 265 



Db 462 GGCCAAGATGATCGAGGAGGACACAGAGGAGGAGGAGGAG- 

AAGGCCTCAGTCCTTGGGC 52 0 

Qy 266 TGCTG CGAGGGGAGAATCTGGTCTCAATGACAGTAGAGG 

GACCTC— 310 

I I I I I II I I I I I I I I I I I I I I III 

Db 521 

AGCTGGCCAGCCTTCCTGGCTTGAACCTGGGCTCACTCAAGGACAAGGCCCAGGCCACAC 5 8 0 

Qy 311 CTCCCAAAGATACTG GTATTGC-TCGAGT- 

TCCACTTGCTGGAGCTG 355 

I I I I I I II III III II I I I I I I I 

Db 581 

TGGGGGATCTCAAGCAATCAGCTGAGAAGTGTCACGTCATGTGACCACTTCC 632 

Qy 356 CCGGGG — GCCCA — GGGATCGGC AGGGCT 

GCTGGCAGAGG 392 



Db 633 CCGGGGTTACCCACTGGGCTGGGCCCCCATGAGGGCTAAGAGTGTGTCAACTTCCAG- 

GG 691 

Qy 393 AATC CCAGCTGGGG TTCCCATGCCCCAGGCT 

CCTG 427 

I I III I I I I I I I I I I I I I I I I I II III 

Db 692 

ACCCATACTCCATTTGGGGCTTTGTTTCCCTTGCCCCATCCTAGTTCCAAGACCTTTCCC 751 

Qy 42 8 CAGGACT TGCTGG 

GCCAGTCCGTGGGG 454 

I I I I I I I I I I II I I I I III 

Db 752 

ATCCATGCCCCAAGCCTATCTTCTGGTTTCTTCCTCTCCGCTGGGAGTAAAGTCCCCATC 811 

Qy 455 TTGGC GGGCCATCCCAA CAG 

G T GAT G AC C C C AC AAGG AAG 49 4 



Db 812 

TTCACTCTACCCTTCAGGACCCTCCCCACCAGCTCAGCCTGTGGAGGCCTCCCAAGATTG 8 71 

Qy 495 -AGGTACTGTTGCAGCCGCTGCAGCTGCTGCCACAGCCAGTATTGCC 

GGGGC 5 45 



Db 8 72 TAGGAATAG 

GCCCATCCCTCTCTGGCCATGGCCCCAAGTTCCTGCACACAGGAGC 92 6 

Qy 5 46 TCC AACCCAGTACCCA CCTGGC-CGTG 

GGGGTCCTC 580 

II I I I I I I I I I I I I I I I I I I I II 



Db 927 

ACCCACAGAGAGACACACACAGGACACAAAACCCCTGGCACGTTCAGAGACAGAAGCCAC 9 86 

Qy 581 CCC — CACCTATGGGC CGAGG — AGCAC 

CCCCT 609 

III III I II I I I I I I I I I 

I I I I I 

Db 987 

AGACACATCCCGGCACAGACAGACACACACGAGGCCAGCTCCCTTGCGTGTCCAGCCCCT 10 46 

Qy 610 CCAGGCA TG ATGGGC CCACCTC— 

CTGGTAT 63 8 

I I I I I I II I I I I I I I I I I I I I 

Db 1047 

C C AG AC AC C AC C AC T C AG AAAC T C T GAG AGAGAGC ATGGGC AGAC AC C C T C AGC AG AC AG 1106 

Qy 639 GAGACC TCCTATGGGTCCCCCAATGGGGATC 

CCCC 673 

I I I I I III I I I I I I I III I I I I 

Db 1107 GAGGCCTGAGTTCCAGT — 

CTCCACCTTTATTGTTCTTGAAAGCCCCTGCTCTCTCTGAG 116 4 

Qy 6 74 CTGGAA GAGGGACTCCAATG 

GGC 696 

I I I I I II I I I I I I I 

I I I 

Db 1165 

CCTTATTTCATCATCTGTAAAATGGGAATGTCCTGAATGACTTCTAAGGCTCTTTCTGGC 122 4 

Qy 69 7 ATG CC CCCTCC GGGAA 

TGCGGCCTCCT 723 

II II I I I I I I I I I I I I I I I I 

Db 1225 

TTGAACTGTCAGAGCCAAGCCCACATCCCTCCTTGGGCAGGGCAGCAGCTGCTGCCACAG 12 8 4 

Qy 72 4 CCCCCTG-GGATGC GAGGCCTTCTTTGACCCTTGGCCA 

CAGA 76 4 

I I I I I I I I I I I I I I I I I I I I I III 

I I I I 

Db 12 85 CCTCCAGCGGCTGCCACTGTGGGC- 

TCTGGGAGCCGGAGCGATGCTGTGTGAGAGGCAGA 1343 

Qy 765 GT ATGGAAGTAGCT-CCGCAGAGGCGTG GGCTC 

GATTCCTCAGGGCCACGT 814 

II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 

I 

Db 1344 

GTGCCAAGGATGAAGCTGGCACTGAACAGTAAGCGGCTCCAGGCCTCCTCTGGGCCCAGG 14 03 

Qy 815 TACCACAGACCTGTTTGT-TTCTTATGCTGTTGTTCG 

TGGAGTCTCATGGGATTG 86 8 

III II I I I I I I I I I I I I I I II I I I I I I I I I I 

Db 140 4 GCCCA GCCAATTTCTGTTCTGTTCCTGTAGAACGCTCTCTGGATTC- 

CATAG 145 4 

Qy 869 TCTGGTTTC-CCTTACAGGGCCC CCTCCCCCG-GGAATGC— 

GCCCA 911 

I I I I I I I I I I III I I I I I I I I I I 



Db 1455 - 

CTGGAATCTCCTCTCTTAGCTCAGTGAAAAATAAAAATCCCAAATGGTGTGCCTACCTT 1513 



Qy 912 CCAA GGC — CCTAGACTC 

ATCTTGGCCC 937 

III III II I I I I I I I I I I I 

Db 1514 

CCCACTTCTTACTGGCTTCCAGGAGTCTTGGAGTTCATAGCCCCCCGAGCCTGCCTTAAA 15 73 

Qy 93 8 TCCTCAGCTCCCTGCCT GTTTCCCGTAAGG 

CTGTAC 9 73 

I I I I I I I I I III I I I I I III I I I I 

Db 1574 

GGGGTGTCCTCCACCCCCCACCTACAGCTTCACAGGAGGGGAGAGGGCATCCAGTGCTAG 1633 

Qy 9 74 ATAGTC CTTTTATCT CCT 

TGTGGC 997 

I I I I I I I I I I I III II 

I I I 

Db 1634 

GAGTAGAAGTGTCTCCAGCTCTGTTCTCTTGGGGCCCTGGGTGAAGGTGGGGTCTGGGGC 1693 

Qy 99 8 C T AT G AAAC T G G T T T AT AAT AAAC T C T T AAG AG AA CATTATAATTG 

1043 

I I I I I I I III I II I I I I I I I I I I I I I 

Db 169 4 TTATGAAATAGGT CTGGGCTTTGAGGAGGATGGAGCAGCCTCATTATGTGGG 

1745 



RESULT 3 

US-10-478-914-4/C 

Sequence 4, Application US/10478914 
Patent No. 7335755 
GENERAL INFORMATION: 
APPLICANT: NAKAGAWARA, AKIRA 

TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
FILE REFERENCE: 7388-80899 

CURRENT APPLICATION NUMBER: US/10/ 478, 914 
CURRENT FILING DATE: 2003-11-26 
PRIOR APPLICATION NUMBER: PCT/ JP02 / 0 52 9 4 
PRIOR FILING DATE: 2001-05-30 
PRIOR APPLICATION NUMBER: JP 2001-162775 
PRIOR FILING DATE: 2001-05-30 
PRIOR APPLICATION NUMBER: JP 2001-255226 
PRIOR FILING DATE: 2001-08-24 
NUMBER OF SEQ ID NOS : 417 
SOFTWARE: Patentln version 3.2 
SEQ ID NO 4 

LENGTH: 3210 

TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-478-914-4 



Query Match 45.3%; 
Best Local Similarity 38.6%; 
Matches 727; Conservative 
Gaps 118; 



Score 2953; DB 1; Length 3210; 
Pred. No. l.le-185; 
0; Mismatches 288; Indels 8 70; 



Qy 

AATCC- 



1 GAATTCCCTGAGG— AGGCG- 
23 



Db 1856 

GAATTCCCTGAGGCTAGGCATGGTGGCTCACACCTGTAATCCCAGCACTTTGGAGGTCAA 179 7 

Qy 2 4 GGCGGGT — ATCA GAGCC ATCAGAACC-GC CACCATGACG- 

GTGGG 65 



Db 1796 

GGTGGGAAGATCAATTGAGCCCAGGAGTTCAAGACCAGCCTGGGCAACATAGCGAGAGTG 173 7 

Qy 66 CAAGAGCAGCAAGATGCTGCAGCATATTGATT ACAGGAT GAGGTG- 

CATC 114 



Db 1736 CAT-CTCTACCAAAAG-TG- 

ATTTTTTTGATTTTTTGGGACAGAGTCTCGCTCTGTCACC 16 8 0 

Qy 115 CT-GCAGGACGGCCG GATCT TCATTGGCA 

CCTTCAAGGCTTTT 156 

I I I I I I III I I I I I I I I I I I I I I I I I 

Db 16 79 CAGGCTGGAGTGCAGTGGTGTGATCTCGGCTCACT- 

GCAACTTCTGCCTCCCAGGTTCAA 1621 

Qy 157 G AC AAG C AC AT G AAT T T GAT C C T C T G T G AC T G T GAT GAG T T C AG AAAGAT - 

CA 208 



Db 162 0 GCAGCTCTCATGTCTCAG — CCTCCCGAGTAGCTGGGAT 

TACAGATGCCTGCCACCA 1566 

Qy 2 09 AGCCA AAGAACTC 

CAAACAAGCAGAAAG 236 



Db 1565 TGCCAGGCAAATTTTGGGTTGTTTTTTTTGAGATGGAGTCTTGCTCTATC- 

AGCAGGCTG 150 7 

Qy 23 7 GGAAG-AGAAGCGAGTCCTCGG TCTGGTGCTGCTG CGAG 

GGGAGAAT 2 82 



Db 1506 

GAGTGCAGTGGCGCGATCTCGGCTCACTGCAACCTCTGACTCCCGGGTTCAAGCGATTCT 14 4 7 

Qy 2 83 CTGGTCTCAATGAC AGTAG-AGGGAC CTCCTCC 

CAAA — 318 

I I I I I I I I I I I I I I I I I I I I I I I 

Db 1446 

CTTGCCTCAGCCTCTTGAGTAGCTGGGACTACAGGCGCGCACCACCATGCCTGGCTAATT 13 8 7 

Qy 319 GATACTGGTATTGCTCGAGTTCCAC TTG— CTGGAG 

CTGCC 357 



Db 1386 

TTTGTGTTTTTAGTAGAGATGGGGTTTCACCCTGTTGGCCAAGAGGACCCCCAGCCTTCC 132 7 

Qy 358 GGGGGCCCAGGGATC GGCAGGGC 

TGCTGGCAGAGGAATCCCAG 40 0 



II 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 I I 1 1 I 1 1 1 1 

Db 132 6 CAAGCCATAGGGGGCTGGTGGGCAGGGCGGCACTTAGCTGCCTGGGGCA- 

CCCAGTACCT 126 8 

Qy 401 CTGGGG TTCCCATGCC CCAGG 

CTCC 425 

I I I I I I I I I I I I I I I I I I I 

Db 1267 

GTCATGAGGCACTTGCCCTGGCCAAAGGTCACTGCCAAGGCACAGCCACTCCAGAGAGTC 12 0 8 

Qy 426 TGCAG-GACT TGCTGGG 

CCA- 444 

I I I I I I I I I I I I I I 

I I I 

Db 1207 

AAGAGGAGCAGAGACTCCAGAGCAGATGGAAGGAGAGTGGGCTTCTGGGGATTGCACCAA 114 8 

Qy 445 GTCCGTGGGGTT — 

GG 458 

I III I I I I I I 

I I 

Db 1147 

GGAGGAAGCAGAAGGGATGTGCAGCAGCAGAAAAGGAGGAGGACCGGGGGGTTCTGAGGG 10 8 8 

Qy 459 C GGGCCA TCCCAACAGG 

TGATGAC 482 

I I I I I I I I I I I I I I I I I I I I I 

Db 1087 

CTTGGGGTGATGAGGCCAGAGTTCCAAACAGGGTCTGGGTGCAGGATGAGGACAGAAGAA 102 8 

Qy 483 CCCACAAGG 

AAGAGGT 498 

I I I I I I I I I I I I I I I 

Db 1027 

AACAGAGAACCCAAAAGGGGCAAGAGGTTGGTCCCTTACTTTCACTCTCCTGGGAGGAGG 96 8 

Qy 499 ACTGT TGCA GCC— 

GCTGCAGCTGCTGCCACA 52 8 

I I I I I I I I III I I I I I I I I I I I 

Db 967 

ATGGGTACACCACACAGTTGGATGGTGCATCTTTGCCTGTCTGCACCTGCAGCTCCATGG 9 0 8 

Qy 529 -GCCA GTATTGCCGGGGCTCCAAC- 

CCAG 555 

III II I I I I I I I I I I I I I I I 

Db 907 

CACCATGGACCCTGTGCTCCGAGTGGTGGTGCCTGATCTCCACCACCAGGCAGCAGTAGA 8 4 8 

Qy 556 TACCCACCTG 

GCCGTGGGGGTC — 577 

I I I I I I I I I I I I I 

Db 847 

GGCCGCTATCCAGCAGGGTCAGGTTGCGCATGGTGATGGAGAAGTTGCCATGGTGGTCGG 7 8 8 

Qy 578 CTCC— CCCAC — CTATGGGC CGAGG 

AGCACCC C 607 

I I I I III I I I I I I I I I I I I I I 

I 

Db 787 

AGGCCGACTCCAGCCCGTGGCGCTGAGCCAGGTCGTGGCTGGTGTTGGCAGCCTGGTGGC 72 8 



Qy 608 CTCCAGG CAT GATGGGCCCAC- 

CTC C 632 



Db 727 

CTCCATGGTGCAGGTGAAGGTCCTGGAACGTGAGGTTGCGGATGGGCCGGCGCTCTGAGC 66 8 

Qy 633 TGGTATGAGACCTCCTATGGGTCCCCCAA— TGGGGATCCCC- 

CCTGGAAGAGGGAC 6 86 

I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I 

Db 66 7 AGGTCTGC-ACCTC 

GCCCCTCGAGCTGCGGTACCACGTCTTGTAGAAGGTCACA 615 



Qy 687 TCCAATGG GCATGC 

CCCCT— 705 

I I I I I I I I I I I I I I I I 

Db 614 

TCGTGCCCTTTGTCCACAGGGCCCAAGAGCCTGCAGGTGAGGGTGACGTTCTGCCCCTCG 555 

Qy 706 CCGGGAATGCGGC CT-CCTCC— 

CCCT 729 

I I I I I I I I I I I I I I I I I III 

Db 554 

GGACAGACATACAGGGAATACGGCGTGGCGACCTTGAAGGCTGCCACCGGACCTGCTCAG 495 

Qy 73 0 -GGGATGCGAGGCCT-TCT-TTGAC CCTTG — 

GCCACAGAGTATG 769 

I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 494 

AGAGAGGAGAGCCCTGTCACCTGACTGATCCAGTGCTACTCCTGGCAACCCCAGAGCGGG 435 

Qy 770 GAAGTAGCT CCGCAG AGGCGTGGGCTCGATTCC — TCAG — 

GGCCACG 813 

I I I I I I I I I I I III III I I I I I I III 

I 

Db 434 

GAAGAACCTTAGCCTCTGCTACTTCCCAGCCACAGGCCAGGGGCATGTCAGCTGACCTCC 3 75 



Qy 814 TTACCACAGAC— CTG— TTTGTT 

TCTTATGCTGTTGTTCGTGG 853 

II I I I I III I I I I I I I I I I I I I I I I I I I 

I I 

Db 374 

CTAAGACTGCCTTCTGCATTTGTTCACAAACCCCACTCCATCCTAGGCTGTTGTAGGGGG 315 

Qy 854 AGTCTCA TGGGATTG — TCTGGTTTC 

CC 879 

I I I I I I I I I I I I I I I I I 

I 

Db 314 

AGTCACAGAAGCTTCCCCAACTAGGGGTGCTCCTGGATGCTGGACAGGGCAGGGAGAGAC 2 55 

Qy 880 TTACAGGGCC CCCTC— 

CCC 897 

I I I I I I I I I I I I I I I 

Db 254 

TTGGAGGGCAGTGATGCAGTCGGGTGTGTCCCTCTTCCCAGTAGAGTATATGCAAAACCT 195 



Qy 89 8 CGGGAATGC GCC- 

CACCAAGGCCC 92 0 

I I I I I I I II III I 

I I 

Db 194 

GTTGGGTGGGAGGCAGGTTGGGGAGCAGGAAGGCTGTTTCGAACAGCAGCTCCTTGTCCT 135 

Qy 921 TAGACTCATCT TGGCC 

CTCCTCAGCTCCCTGCCTGTTTCCCGTAAGGCTGT 971 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 13 4 TGGCCGC-TCTGTAGGGGCCATCAACACCTCTGC-CCCTTCC 

TACTCCTAGGACTGA 8 0 

Qy 9 72 ACAT — AGTCCTTTTATCTCCTTGTGGC-CTATGA AACTG-GTTTATAA 

TAA 1019 

I I I I III I I II I I I I I I I I I I I I I I I II 

Db 79 

ACATGCCTTCCCATCCTTGCCAGGGAGCACTAAGAGAGCACCTGCGTGTGCAAGGCCTGG 2 0 

Qy 102 0 AC T C T T AAG AG AAC AT T AT AAT TGC 10 44 

I I I I I I I I I I I 

Db 19 GCCAGTAGGCCAAC AGTGC 1 



RESULT 4 
US-10-478-914-4 

Sequence 4, Application US/10478914 
Patent No. 7335755 
GENERAL INFORMATION: 
APPLICANT: N AKAG AWARA , AKIRA 

TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
FILE REFERENCE: 7388-80899 

CURRENT APPLICATION NUMBER: US/10/478,914 
CURRENT FILING DATE: 2003-11-26 
PRIOR APPLICATION NUMBER: PCT/ JP02 / 0 52 9 4 
PRIOR FILING DATE: 2001-05-30 
PRIOR APPLICATION NUMBER: JP 2001-162775 
PRIOR FILING DATE: 2001-05-30 
PRIOR APPLICATION NUMBER: JP 2001-255226 
PRIOR FILING DATE: 2001-08-24 
NUMBER OF SEQ ID NOS : 417 
SOFTWARE: Patentln version 3.2 
SEQ ID NO 4 
LENGTH: 3210 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-478-914-4 

Query Match 44.5%; Score 2898; DB 1; Length 3210; 

Best Local Similarity 36.7%; Pred. No. 8.7e-182; 
Matches 737; Conservative 0; Mismatches 290; Indels 980; 
Gaps 121; 

Qy 1 GAATTCCCTGAGGAGGC 

GAATCCGGCGGGTATCAG 35 

I I I I I I I II I I I I I I I I I III 

Db 444 

GGGTTGCCAGGAGTAGCACTGGATCAGTCAGGTGACAGGGCTCTCCTCTCTCTGAGCAGG 5 03 



Qy 36 AGCCATCAGAACCGCCA CCATG 

ACGGTGGGCA 6 7 

I I I I I I I I I I I I I I I I II 

I I I I I 

Db 504 

TCCGGTGGCAGCCTTCAAGGTCGCCACGCCGTATTCCCTGTATGTCTGTCCCGAGGGGCA 563 

Qy 6 8 AGA GCA GCAAGATG 

CTG 84 

I III I I I I I I I 

I I 

Db 564 

GAACGTCACCCTCACCTGCAGGCTCTTGGGCCCTGTGGACAAAGGGCACGATGTGACCTT 623 

Qy 85 CAGCATAT — TGATTAC AGGATGAGGTGCA-TCCTGC AG- 

GACGGCC 12 7 

I II III I II I I I I I I I I I I I I I III 

I I I I I 

Db 624 

CTACAAGACGTGGTACCGCAGCTCGAGGGGCGAGGTGCAGACCTGCTCAGAGCGCCGGCC 6 83 

Qy 12 8 GGATCTTCATTGGCACCTT 

CAAGGCT 153 

I I I I I I I I I I I I I I I I I I I I 

Db 684 

CATCCGCAACCTCACGTTCCAGGACCTTCACCTGCACCATGGAGGCCACCAGGCTGCCAA 7 43 

Qy 154 

TTTGACAAGCA 16 4 

I I I I I I I 

Db 744 

CACCAGCCACGACCTGGCTCAGCGCCACGGGCTGGAGTCGGCCTCCGACCACCATGGCAA 8 03 

Qy 165 CATG AATTTGATCCT CTG-TGACT- 

GT 189 

I I I I I I I I I I I I I I I I I I I 

I I 

Db 804 

CTTCTCCATCACCATGCGCAACCTGACCCTGCTGGATAGCGGCCTCTACTGCTGCCTGGT 863 

Qy 19 0 GAT - GAG T T C AG AAAGATCAA GCCAAAGAACT- 

C 221 

I I I I I I I I I I I I I I I I I I I I I I I 

Db 864 

GGTGGAGATCAGGCACCACCACTCGGAGCACAGGGTCCATGGTGCCATGGAGCTGCAGGT 923 



Qy 222 -CAAACAAGCAGA 

AAGGGAAG 2 41 

II III III I 

I I 

Db 924 

GCAGACAGGCAAAGATGCACCATCCAACTGTGTGGTGTACCCATCCTCCTCCCAGGAGAG 9 83 

Qy 2 42 AGAA GC GAGTCCTCGGT 

CTGGTG 264 

III II I I I I I I I I 

I 

Db 984 

TGAAAGTAAGGGACCAACCTCTTGCCCCTTTTGGGTTCTCTGTTTTCTTCTGTCCTCATC 10 43 



Qy 265 CTGCTGCGAGG GGAGAATCTGGTCTCA 

AT G AC AG TAG AG 3 03 

I I I I III I I I I I I I I I I I I I III I 

Db 10 44 CTGCACCCAGACCCTGTTTGGA- 

ACTCTGGCCTCATCACCCCAAGCCCTCAGAACCCCCC 1102 

Qy 304 GGACCTCCTCC CAAA GATAC 

TGGT 327 

I I I I I I I I I I III III I I I I 

Db 1103 

GGTCCTCCTCCTTTTCTGCTGCTGCACATCCCTTCTGCTTCCTCCTTGGTGCAATCCCCA 1162 

Qy 32 8 ATTGCTC — GAGT TCCACTTG 

CTGGAGCTG 355 



Db 1163 

GAAGCCCACTCTCCTTCCATCTGCTCTGGAGTCTCTGCTCCTCTTGACTCTCTGGAGTGG 12 2 2 

Qy 356 CCGGGGGCC CAGGGATC GGCAGGGC TGC TGGCAG 

AGGAAT 395 

I I I I I I I I I I I I I I I I I I III I I I I I 

Db 1223 CT— 

GTGCCTTGGCAGTGACCTTTGGCCAGGGCAAGTGCCTCATGACAGGTACTGGGTGC 12 8 0 

Qy 396 CC CAGCTGGGGTTC-CCATGCC — CCAGGCTCCT 

GCAGG 431 

II I I I I I I I I I I I I I I I I I I I I I 

Db 1281 

CCCAGGCAGCTAAGTGCCGCCCTGCCCACCAGCCCCCTATGGCTTGGGAAGGCTGGGGGT 13 4 0 

Qy 432 — ACTTG CTGGG CCAGTC 

CGT 450 



Db 1341 

CCTCTTGGCCAACAGGGTGAAACCCCATCTCTACTAAAAACACAAAAATTAGCCAGGCAT 14 0 0 

Qy 451 GGGGTTGGCGGGCC ATCCCAAC AGG-TGA 

TGAC 482 



Db 1401 GGTGGT- 

GCGCGCCTGTAGTCCCAGCTACTCAAGAGGCTGAGGCAAGAGAATCGCTTGAA 1459 

Qy 4 83 CCCACAAGGAAGAGG-TACTGT-TGCAGC 

CGCTGCAGCTGCTGCCACAGCCAGT — A 535 



Db 146 0 CCCGGGAGTCAGAGGTTGCAGTGAGCCGAGATCGC-GCCACTGC- 

ACTCCAGCCTGCTGA 1517 

Qy 536 TTGC-CGGGGCTCCAACCCAG TACCCA CCTGGCC— 

GTGGGGGTCC 578 



Db 1518 

TAGAGCAAGACTCCATCTCAAAAAAAACAACCCAAAATTTGCCTGGCATGGTGGCAGGCA 15 7 7 

Qy 579 TC 



Db 1578 

TCTGTAATCCCAGCTACTCGGGAGGCTGAGACATGAGAGCTGCTTGAACCTGGGAGGCAG 163 7 

Qy 58 7 CTATGGGCCGAGGAGCACCCC 

CTCCAGGCATGATG 621 

I I I I I I I I I I I I I t I I I I I I I I III 

Db 163 8 AAGTTGCAGTGAGCCGA- 

GATCACACCACTGCACTCCAGCCTGGGTGACAGAGCGAGACT 1696 

Qy 622 — GGCCCA CCTCCTGGT AT GAG AC C T C - 

CTATG 651 

I I I I I I I I I I I III I I I I I I I 

Db 1697 

CTGTCCCAAAAAATCAAAAAAATCACTTTTGGTAGAGATGCACTCTCGCTATGTTGCCCA 1756 

Qy 652 GGTC 

CCCCAATGGGGATCCCCCCTGGAAGAGGGACTCCAA 691 

I I I I I I I I I I I I I I I I I I I 

Db 1757 GGCTGGTCTTGAACTCCTGGGCTCAATTGATCTTCCCACCTTGA 

CCTCCAAAGT 1810 

Qy 692 — TGGG CATGCCC— 

CCTCCGGGAATGC 715 

I I I I I I I I I I I I I I I I I I I I I 

Db 1811 

GCTGGGATTACAGGTGTGAGCCACCATGCCTAGCCTCAGGGAATTCTTATAAGAACTCTA 1870 



Qy 716 GGCCTC CTC CCCCTGGGATGCGAGGCCTT 

CT 746 

I I I I I III I I I I I I I I I I I I I I 

I I 

Db 1871 

TGAAGTAGGCATCACCATCTTCTCTGTATCCATGGAAAGAGAGGCCTAGAGATGTATGCT 193 0 

Qy 747 TTGACC CTTGGCCACAGAGTA TGGAAGT — AGCT-- 

CCGC 782 

I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 1931 AACTTGCCCAAGCTCACATAGCC- 

CAGGGTAGCATAGCTGGGATGTTGAGCTGAGGCCGG 1989 

Qy 783 AG AGGCG-TGG GCTC GATTCCTCAGG GC- 

CACGT 814 

II I I I I I I I III I I I I I I I II 

I I I 

Db 1990 

AGGAGAAGTAGCAGTCGCTGGCAGAGCACACAGGCTGCTCTGGGGGATGAGCTGGTGCGT 2 0 49 

Qy 815 TAC CACAGACCTG-TTTG — TTTCTTATGCTGT-TGTTCGTGGAGT- 

CTCATGGGAT 866 

I I I I I I I I II II I I I I I I I I I I I I I 

I 

Db 2050 

TTAAGGAACAGGCCAGCACTGGCATTCGCAAGCAGTGGGGAAGGGGAGAGATGCCGAGGT 2109 
Qy 867 TGTCTGGTTTCC 

CTTACAGGGCCC 89 0 

III I I I I I I I I I I I I I I 



Db 2110 GGTC- 

AGTATCCTGACTTTCAGAGGCCTTTTTTTGTTTGTTTTAATTTTTGCTAGATTGA 216 8 



Qy 891 CCTC CCCCGGGAATG CGCCCA- 

CCAAGGC 918 



Db 2169 

TATTAAAAACTCATGTGGAGGAACTCAAGGAATGTTTAGAAGACCAAAAGTCCCCAATGA 2 2 2 8 
Qy 919 C CTA 

GACTCATCTTGGCCCTCCTCAGCTCCCTGCCTGTTTCCCGTAAGGCT 969 

I II II I I I I I I I I I III I I I I I I I I I 

Db 2229 CAGGAACAAAAGCAACCAATTTTTAACTTTCTCTTCTCATT- 

CCTGTTTTCATTGATTTC 2287 

Qy 970 GTACA — TAGTCCTTTT-ATC TCCTTGTGGC 

CTATGAAACT 1007 

III I I I I I I I I I I II I I I I I I I III 

Db 2288 

CCACATGTAGTCCTTTTGCTCAGGAAGTCTTTGGGGAAATTAAGGATCTTTGAAGCTCTG 23 4 7 

Qy 1008 GGTTTAT 

AATAAACT-C 1023 



Db 2348 

AAATAGGTGATCAGGTTAGTGGTGTCTGTCAGCTGTCTAAGAGGTTGGAAAATGAACTAC 2 4 0 7 

Qy 102 4 TTAAGA G AAC AT T AT AAT T G 1043 

I I I I I I I I I I II I 

Db 2 40 8 T C AAG AT AG T C ACGAAAAT AC TGAAAG 2434 



RESULT 5 

US-10-478-914-61/C 

; Sequence 61, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 

; TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
; FILE REFERENCE: 7388-80899 

CURRENT APPLICATION NUMBER: US/ 1 0 / 478 , 9 1 4 
; CURRENT FILING DATE: 2003-11-26 
; PRIOR APPLICATION NUMBER: PCT/JP02/05294 
; PRIOR FILING DATE: 2001-05-30 
; PRIOR APPLICATION NUMBER: JP 2001-162775 

PRIOR FILING DATE: 2001-05-30 

PRIOR APPLICATION NUMBER: JP 2001-255226 
; PRIOR FILING DATE: 2001-08-24 
; NUMBER OF SEQ ID NOS : 417 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 61 

LENGTH: 2 43 7 
TYPE: DNA 
; ORGANISM: Homo sapiens 
US-10-478-914-61 



Query Match 43.2%; Score 2818.5; DB 1; Length 2 43 7; 

Best Local Similarity 36.1%; Pred. No. 4.9e-177; 



Matches 710; Conservative 0; Mismatches 318; Indels 937; 
Gaps 116; 

Qy 1 GAATTC CCTGAGGAGG 



CGAATCCGGCGGGTATCAGAGCCATCAGAA- 46 

I I I I I I I I I II II I I I I I I I I I I I I 

I 

Db 19 72 GAGATCAGTGAGGCCTGAAGACATGTGTTCAATATCAAC- 

GGAATCTGGCCTCTCGGGAT 1914 

Qy 4 7 CCGCCACC ATGAC- 

GGTGGG 65 

I I I I I I I I I I I I I I I I 

Db 1913 

GAAATATTTCCTCCACAACTTGCTTCAACAATGGCTTATAAGATGCCTGGTGGGTATGAT 185 4 

Qy 66 CAAGAG 

CAGCAAGATGCT 83 

I I I I I I I I I I I I I 

I 

Db 1853 

TTCCAGGAACATATACAGACTTAAACTGTTTCAGGAGACTCCGAGCTCCAGCAATATCCC 179 4 

Qy 8 4 G CAGCATATT 

GATTACAGG 102 

I I I I I I I I I I I I 

I 

Db 1793 

GAAGTGAAGTAAAGAGTTCATCCACGGCAATTTTGGATTTGTGAAATGTCAGATTAATTG 173 4 

Qy 103 ATGAGGTGCA-TCC TGCAG GACGG-CCGGA- 

TCTTCATTGGC-A 142 

I I I I I I I I I I I I I I III I I I I I I I I I I 

I 

Db 1733 

AAGAGTCCCAGTCCGTAATTTTTTGCAGCAAAGGTGACAATCCAGATTCCTCACAGAAGA 16 7 4 

Qy 143 CCT TCAAGGCTT — TTGAC 

AAGCACATGAATTTGATCCTCTGTG 184 

III I I I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 16 73 CCTGAGCCAATGCTTTCTTGACTTTTTCTTCTGCTTCACACAGGTCTTTG- 

TCTACCGTG 1615 

Qy 185 A CTGTGA-TGAGT T C AG AAAG AT C A — AG — 

CCAAAGAAC 219 

I I I I I I III I I I I I I I I I I II II II I 

Db 1614 

AGCTCTCCAGTGACAGAATAAATATATATGAGAAGGATCAGCAGTTCCTCAGGGCTGTAG 1555 

Qy 22 0 TCC AAACA AGCAG 

AAAG 236 

III I I I I I I I I I II 

I 

Db 1554 

TCCTCGTTGGTTCTCTGGGTTACAGGCTTAATCATGGGCAGCAGCTGATTTAACACAACG 1495 

Qy 23 7 G GAAG-AGAAGC GA 

GTCC 253 

I I I I I I I I I I II 



Db 1494 

GACATTGCTGACTCCCCAATGCTCTGAAGAAGGAGCCTTTCAAAAGCCAGAAAGTTGTCC 1435 

Qy 254 T CGGTCT GGTGCTGCTGCG AGGGGAGAATC 

TGG 286 



Db 1434 

CACTTGGCAGTCTGTGGGTGTTTCAACGTTTGAGCTGTGGCCAGTCCAAGCTGGAGGAGG 13 75 

Qy 287 TCTCAATGA — CAGTAGAG 

GGAC 307 

I I I I I I I I I I I I I I I I I I 

Db 1374 

CCACAATGATTCATTAGAGCTTTGAGGTTGTTCTTGAAGAGCTGAATATAGGACATGAGC 1315 

Qy 308 CTCCTCCCAAA GAT— ACTGGTAT TGCT 

CGAG 33 7 

III Mill III I I I I I I I I I 

Db 1314 

TGTCCCGGTGTGACTCTCCCCATACTCATCTTGATTGGCAGGTTTTCTCTGCTTGCCGC- 12 56 

Qy 33 8 TTCCACTTGCTG GAGCTGCCG 

GGGGCCCAGGGATCGGCAGGGC 380 



Db 1255 

TTCCACTAGATGTCTCCGAACTTCCATCACTGCCTCTTTGTGCTTAGTGTTCAGTAAAGC 1196 

Qy 381 TGC TGGCAGAGGAATCCCAG 

CTGGGGTTCCCAT 413 



Db 1195 

TTCCCATAGGGCTTTGGCTGTGGTGTCACTGAATTGTGAAAGACAGCCTGGTGCAACCAC 1136 

Qy 414 G CCCCAG GCT 

CCTGCAGGAC 433 



Db 1135 

ATTATAATTTTCCTCCTCAGTATGGAGTGCAGTGAGCGCTATCATGTTAACCATCACATC 10 76 

Qy 43 4 — TTGCTGGGCCAGTCCG-TGGGGTTG-GCGGGC CATCCCAAC— 

AG 474 

III I I I I I I I I I I I I I I I I I I I I I II 

Db 1075 

ATTTGTGTGGCCTGGGAGCTGGGGAAGTGCTGAAATGATCTTCTCTACTAAGTTGTCTCC 1016 

Qy 4 75 GTGATGACCC CACAAGGAAGA GGTACTGT TGCAGCCGCTGCAG — 

CT 519 



Db 1015 

ATGATGTCCAACTGCTCCTGTGAGATCCAGGGTTCTGTCCACAAAAACCACTGATGCCCT 956 

Qy 52 0 G-CTGCCACAG CCAGT — ATTGCCGGGGCTCCAAC — CCAGTACC 

C— 560 

I I I I I III I I I I I I I I I I I I I I I I I I II I 



Db 955 

GCCTGCAGCAGTCTTCTTCCTGTTCTTTGCAGGGGCATAATTGGCCAGATCCGCAGCGAT 896 

Qy 561 -ACCTGGCCGTGGGGG TCCTCCC— 

CCACCTA 589 

I I I I I I II I I I I I I I I I I I I 

Db 895 

GACCTGACTTAAGGAACCTACAGCAAAACACTCCTCCCGTACTCCTAAATGTTCACACAG 836 

Qy 590 TG-GGCC 

GA 59 7 

II I I I I 

I 

Db 835 

AGAACTGAGGCCTGACACTAGGCATCTGATCTGCAGCAGCAGCTCTGGGGTTAGCGTAGT 7 76 

Qy 59 8 GGAG CACCC CCTCCAGGCATGATGGGC- 

C 625 

I I I I I I I I I I I I I II I I I I I I 

Db 775 

GGAGTCCACATCACCCAGGCTTCCCAGCTTCCTCTTGTCCGGTCGGGCGCTATTAAGGAG 716 

Qy 626 CACCTCCTGGTAT GAGACCTCCTATGG-GTCC 

CCCAATG 663 

I I I I I I I I I I III I I I I I I II 

I I 

Db 715 

GTGCACATCCTGGGGTAGCAGTGGGAAAAGGGATGCAAAAGCTGGAGTCAAGGCAAAGTG 656 

Qy 66 4 GGGATCCCC 

CCTGGAAGAGGGACTCCAATGGGC 696 

I I I I I I I I I I I I I I I I I I I I 

Db 655 GGGAGCAACAGGGGCAAGCAATAACGGGACATGGAACACC — 

TCGGCCGTGTAGTTCATG 59 8 

Qy 697 ATGCCC CCTCCGG 

GAATGCGGCCTCCTC 72 4 

I I I I I I I I I I I I I I I I I I 

I I I I 

Db 59 7 TTGCCCATCCATTCACACAGCTTCTCCTCCAGCTGCTCGAACACCGGCTGCTGCC- 



Qy 725 C CC-CTGGGA 

TGCGAGGCCTTC 745 

I I I I I I I I I II 

I 

Db 538 

CATCTCGGCCGCTGCCGCCGCTGGGACATGATTAGCTGTGAGGTGGACAGCGTGGCTCAC 4 79 

Qy 7 46 TTTGACCCTTGGCCACAGAGTA-TGGAAGTAGCT-CCGCAGAGG 

CGTGGGC 79 4 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 478 GGTTGTGACC 

ACCACACAATACTGGAAGTGACTGCGGCAGATGATGTCCCGTAGGA 423 

Qy 795 TCGATTCCTCAG GGCC AC G T T AC C AC AG AC - - 

CTGTT 829 

II I I I I I I I I I I I I I I I I 



Db 422 TC 

TCCACGGTCCGGCCTTTCAGCAGGCAGCTCAGCACAAACACTGCCTTGGGCTGCT 366 



Qy 83 0 TGTTTCTTATGCTGTTGTTCGTGGAG 

TCTCATGGGATTGTCTGGTTTCC — 878 

I I I I I I I I I I I I I I I I I I I I I I 

Db 365 TGGCTCCACCACCAAT- 

TGCGTCGGGCTCGAACTCTCGCAGGTGACAGTCAGGGCCCCCC 3 0 7 

Qy 8 79 CTTACAGGG CCCCCTCCCC CGG- 

GAATGCG 907 



Db 306 

ACCGCCTCCAGGAGACGGGTGGATCCGCAGCCCCAGTGCAGGCTCTCGGCGCAGGCGGCG 2 4 7 
Qy 908 CCC 

ACCAAGGCCCTAGACTCATCTTGGCCCTCCTCAGCTCCCTGCCT G 955 



Db 2 46 TCCAGGTAAACCACAGCCC — 

GTTTCACTTTGGCCAGCACCTGCTCCCATCCTTGCTGGG 189 

Qy 956 TTT CCCGTAAGGCT 

GTACATAGTC 9 79 

I I I I I III I I I I I 

Db 188 

TAAAGGACAGTACGCCCGAGGCGCTCATGGTTGGGGATTCGCAGACTTGGGAAACTACGG 12 9 

Qy 980 CTTTTATC TCC 

TTGTGGCCTATGAAACT G 1008 

I I I I I I III I I I I I II 

Db 128 

TGCAGGAACTTCTTTCAGAACTCACCGCTTCCGGAAATTGGGCTCCGGGAGACTTTGACA 6 9 

Qy 10 09 GTTT— ATAATAAACTCT-TAAGAGAACATTAT AATTGC 10 44 

III I I I I I I I III I I I I I I I I I I 

Db 6 8 GTCTCCACAGTACACCCAGTAGGCCAACAGTGCTCGAGGAATTCC 2 4 



RESULT 6 

US-10-478-914-61 

; Sequence 61, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 

; TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
FILE REFERENCE: 7388-80899 

CURRENT APPLICATION NUMBER: US/ 1 0 / 478 , 9 1 4 
; CURRENT FILING DATE: 2003-11-26 

PRIOR APPLICATION NUMBER: PCT/JP02/05294 
; PRIOR FILING DATE: 2001-05-30 
; PRIOR APPLICATION NUMBER: JP 2001-162775 

PRIOR FILING DATE: 2001-05-30 
; PRIOR APPLICATION NUMBER: JP 2001-255226 
; PRIOR FILING DATE: 2001-08-24 
; NUMBER OF SEQ ID NOS : 417 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 61 

LENGTH: 2 43 7 



TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-478-914-61 

Query Match 43.2%; Score 2816; DB 1 ; Length 2437; 

Best Local Similarity 36.2%; Pred. No. 7.4e-177; 
Matches 706; Conservative 0; Mismatches 322; Indels 924; 
Gaps 110; 

Qy 2 AATTCCCTGAGGAGGCGA ATCCGGC 

GGGTATCAGAGCCATCA 43 

I I I I I I I I I I I I I I I I I I II I III 

I 

Db 9 0 

AATTTCCGGAAGCGGTGAGTTCTGAAAGAAGTTCCTGCACCGTAGTTTCCCAAGTCTGCG 149 

Qy 4 4 GAACCGCCACCATGA CGGTGGGCAAGAG CAGCAA 

GATGC 82 

I I I I I I I I I I I I I I I I I I I I I I III 

Db 150 

AATCCCCAACCATGAGCGCCTCGGGCGTACTGTCCTTTACCCAGCAAGGATGGGAGCAGG 2 09 

Qy 83 T GC AG — CATATT GAT T AC AGGATGAGG T GCATCCTGC 

AGGAC 123 

I I I I I I I I I I I I I I I I I I I II I I I I I II I 

Db 210 TGCTGGCCAAAGTGA- 

AACGGGCTGTGGTTTACCTGGACGCCGCCTGCGCCGAGAGCCTG 2 6 8 

Qy 12 4 GGC — CGGATCTTC 

ATTGGCACCTT 146 

I I I I I I I I I I III 

I I I I I 

Db 269 

CACTGGGGCTGCGGATCCACCCGTCTCCTGGAGGCGGTGGGGGGCCCTGACTGTCACCTG 32 8 

Qy 147 CAAGGCTTTTGACAAGCAC-ATGAATTTGAT CCT 

CTGTGAC 186 

I I I I I I I I I I I I I I I I I II I I I I 

Db 329 CGAGAGTT 

CGAGCCCGACGCAATTGGTGGTGGAGCCAAGCAGCCCAAGGCAGTGTT 3 8 4 

Qy 187 TGTGATGAGTT CAGAAAGATCAAGCCAAAGAACTCCAA ACA 

AG 229 

I I I I I I I I I I I I I I I I II II I I I I III 



Db 385 

TGTGCTGAGCTGCCTGCTGAAAGGCCGGACCGTGGAGATCCTACGGGACATCATCTGCCG 4 4 4 

Qy 23 0 CAGAAA GGGAA GAGAAGCG- 

AGTCCTC 255 

I I I I III III I I I I I I I 

Db 445 

CAGTCACTTCCAGTATTGTGTGGTGGTCACAACCGTGAGCCACGCTGTCCACCTCACAGC 5 0 4 

Qy 256 -GGTCTGGT GCTGCTGC GAGGGG- 

AGAATCTGGT 287 

II II I I I I I I I I I I I I I I I I I I I 

Db 505 

TAATCATGTCCCAGCGGCGGCAGCGGCCGAGATGGAGGGGCAGCAGCCGGTGTTCGAGCA 56 4 



Qy 288 -CT CAATGACAGTAGAGGGAC CTCCTC— 

CCAAAG 319 

II I I I I I I I I I I I I I I I I I I 

Db 565 

GCTGGAGGAGAAGCTGTGTGAATGGATGGGCAACATGAACTACACGGCCGAGGTGTTCCA 62 4 

Qy 32 0 -ATACTGGTATTGCT CGAGTT CCACTT 

GCTGGAGCTGC 356 

I I I I I I I I I I I I I I I I I I I I III II 

I 

Db 625 

TGTCCCGTTATTGCTTGCCCCTGTTGCTCCCCACTTTGCCTTGACTCCAGCTTTTGCATC 6 8 4 

Qy 357 CGGGGGCCCAGGGAT 

CGGCAGGGCTGCTG 3 85 

II I I I I I I I I I I I I I 

I 

Db 685 

CCTTTTCCCACTGCTACCCCAGGATGTGCACCTCCTTAATAGCGCCCGACCGGACAAGAG 7 4 4 

Qy 3 86 GCAG— AGGAATCCCAGCTG GGGTTCC CATGCCCCAG 

GCTCCTGCA 429 

III I I I I I I III II III I I I I I I I III 

I I I I I 

Db 745 

GAAGCTGGGAAGCCTGGGTGATGTGGACTCCACTACGCTAACCCCAGAGCTGCTGCTGCA 8 0 4 

Qy 43 0 GGAC — TTGC TG GGCC 

AGTCCGTGGG — 453 

I I I I I I I I I I I I I I I I I I 

Db 805 

GATCAGATGCCTAGTGTCAGGCCTCAGTTCTCTGTGTGAACATTTAGGAGTACGGGAGGA 86 4 

Qy 454 GTTGG CGGGCCATCCCAACAG 

GTGATGACCC 48 4 

I I I I I I I I I I I I I I I I I I 

I I I 

Db 865 

GTGTTTTGCTGTAGGTTCCTTAAGTCAGGTCATCGCTGCGGATCTGGCCAATTATGCCCC 92 4 

Qy 485 CACAAGGAAGAGGT ACTGTTGCAGCC GCTGCAG 

CTGCTGCCACAGC 53 0 

I I I I I I I I I I I I I I I I I I I II III II I I I I 

I 

Db 925 

TGCAAAGAACAGGAAGAAGACTGCTGCAGGCAGGGCATCAGTGGTTTTTGTGGACAGAAC 9 8 4 

Qy 531 C AGTATTGCCGGGGC TC 

CAACCCAGTA 55 7 

I I I I I I I I II I I I I I I I I 

Db 985 

CCTGGATCTCACAGGAGCAGTTGGACATCATGGAGACAACTTAGTAGAGAAGATCATTTC 10 4 4 

Qy 558 CCCA CCTGGCC GTGGGGGT CCT 

CCCCCAC 586 

I I I I I I I I I I I I I I I I II II 

I I I 

Db 1045 

AGCACTTCCCCAGCTCCCAGGCCACACAAATGATGTGATGGTTAACATGATAGCGCTCAC 110 4 



Qy 587 

CTATGGGCCGAGGAG 601 

II I I I I I I I 

Db 1105 

TGCACTCCATACTGAGGAGGAAAATTATAATGTGGTTGCACCAGGCTGTCTTTCACAATT 116 4 

Qy 6 02 CACCCCCTCC-AGGCATGATGG GCCCAC 

CTC 631 

I I I I I I I I I I I I I I I I I I II 

Db 1165 

CAGTGACACCACAGCCAAAGCCCTATGGGAAGCTTTACTGAACACTAAGCACAAAGAGGC 12 2 4 

Qy 632 CTGG TATGAGACCTCCTATGG GTC CC— 

CCA 66 0 

III I I I I I I I I III II I I I I I 

Db 1225 

AGTGATGGAAGTTCGGAGACATCTAGTGGAAGCGGCAAGCAGAGAAAACCTGCCAATCAA 12 8 4 

Qy 661 ATGGGGA TCCCCCCTGG 

AAGAG 6 82 

I I I I I I I II I II II 

I I I I 

Db 1285 

GATGAGTATGGGGAGAGTCACACCGGGACAGCTCATGTCCTATATTCAGCTCTTCAAGAA 13 4 4 

Qy 6 83 GGA CTCCAATGGG-CAT GCCCCCTCCG 

GGAATGCGGCCTC 721 

I I I I I I I I III I I I I I I I I I I I I I I I I I 

Db 13 45 CAACCTCAAAGCTCTAATGAATCATTGTGGCCTCCTCCAGCTTGGACT — 

GGCCACAGCT 1402 

Qy 722 CTCCCCC TGGGA 

TGCGAGG — C 741 

I I I I I I I I I I I I I I I 

I 

Db 1403 

CAAACGTTGAAACACCCACAGACTGCCAAGTGGGACAACTTTCTGGCTTTTGAAAGGCTC 1462 

Qy 742 CTTCTT TGACCCTTG 

GCCACAG 763 



Db 1463 

CTTCTTCAGAGCATTGGGGAGTCAGCAATGTCCGTTGTGTTAAATCAGCTGCTGCCCATG 152 2 

Qy 76 4 AGTATGGAAGTAGC TCCGCAGAGG 

CGTGGG 793 

I I I I I I I I II I I I I I I I I 

Db 1523 

ATTAAGCCTGTAACCCAGAGAACCAACGAGGACTACAGCCCTGAGGAACTGCTGATCCTT 15 82 

Qy 79 4 CTCG ATTC CTCAGG GC-CACGTTA- 

CCACAGACCTGTTTGTTTCT 836 

III I I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 1583 

CTCATATATATTTATTCTGTCACTGGAGAGCTCACGGTAGACAAAGACCTGTGTGAAGCA 16 42 

Qy 837 TATG CTGTTGTTC 

GTGGAGTCTCATGGGATTGTC 8 70 



Db 16 43 GAAGAAAAAGTCAAGAAAGCATTGGCTCAGGTCTTCTGTGAGGAATCT 

GGATTGTC 169 8 

Qy 871 TGGTTTCC 

CTTACAGGGCCCCCTC C 895 



Db 1699 

ACCTTTGCTGCAAAAAATTACGGACTGGGACTCTTCAATTAATCTGACATTTCACAAATC 175 8 

Qy 896 C CCGGGAATGCGCCCACCAAGGCCCT 

AGACT 926 



Db 1759 

CAAAATTGCCGTGGATGAACTCTTTACTTCACTTCGGGATATTGCTGGAGCTCGGAGTCT 1818 

Qy 927 C ATCTTGG CCCTCCT — 

CAGCT 9 46 

I I I I I I I I I I I I 

Db 1819 

CCTGAAACAGTTTAAGTCTGTATATGTTCCTGGAAATCATACCCACCAGGCATCTTATAA 18 7 8 

Qy 947 -CCCT GCCTGTT 

TCCCGTAAGGCTGTACATAGTCCT 981 



Db 18 79 GCCATTGTTGAAGCAAGTTGTGGAGGAAATATTTCATCCCGAGAGGC 

CAGATTCCG 193 4 

Qy 9 82 TTTAT CTC CTTGTGGCCT ATGAAACTGG- 

TTTATAATAA 1019 

I I I I II I I I I I I I I I I I I I I I I I 

Db 1935 

TTGATATTGAACACATGTCTTCAGGCCTCACTGATCTCCTTAAAACTGGATTTAGCATGT 199 4 

Qy 102 0 ACTCTTAAGAGA ACATTATAATTGC 10 44 

I I I I I I I I I I I I 

Db 1995 TCATGAAGGTGAGCCGGCCTCATCCTAGTGAC 2 026 
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Qy 1 GAATTCCCTGAGGAGGCGAATCCGGCGGGT — 

ATCAGAGCCATCAGAACCGCCACCATG- 57 



Db 1885 

GTGGGCCCTGAGGAAGCACCACCAGAGGGAGGAGCTCCACCACCAGGAAATCCCCCAGGC 182 6 
Qy 5 8 ACGGTGGGCAAG — 

AGCAGCAAGATGCTGCAGCATATTGAT 96 

I I I I I I I I I I I I I I I I I I I I I I 

Db 1825 

ATTCCTCCTGGCATGCCTCCTGCACTCTGGTACAGCTTGGTGATGATGGGGTTGCAAACT 1766 

Qy 9 7 TACAG GATGAGGTGCA TCCTGCAGGACGGCCGGA- 

TCTT 134 

I I I I I I I I I I I I Mill I I I I I 

Db 1765 

TTCTCCAGCTCTTTCTGTTGATGTTCAAATTCTTCCTTCTCAGCAGTCTGATTCTTATCA 17 06 
Qy 135 — CA- 

TTGGCACCTTCAAGGCTTTTGACAAGCA 16 4 

I I I I I I I I I I I I I I I I I I 

Db 1705 

AGCCAGTTGATAATTTCATTACACTTGTCCAGAATCTTCTGTTTGTCCTCATCGTTAATC 16 46 

Qy 165 

CAT G AAT T T GAT C C T C T G T G AC T G T GAT GAG T TC AGAAAGATC AAGC C AAAG AAC T 2 2 0 



Db 1645 TTGCCTTGAAGTTTCTCATCTTCAACAGT — TGCTTTC 

ATGTTGAAGGCATAGGACT 1591 

Qy 221 C C AAAC AAGC AG AAAGGGAAGAGAAGCG AG T-CCTCGG — 

TCTGGTGCTGCTGCGAGGGG 2 77 

I I II I I I I I II I I I I I I I III I I I I 

Db 1590 C AAGTGAATTCTTGGATGA-CACCTTGTCCCTCTGCTTCTCAT- 

CTTCAGCTTTGTA 1536 

Qy 2 78 AGAATCTGGTCTCAATGAC AGT AGAGGG AC C T C C T 

CCC 315 

I I I I I I III I I I I I I I I I II 

Db 1535 CTTCTCAGCT- 

TCCTGGACCATACGTTCAATGTCTTCCTTGCTCAAACGGCCCTTGTCAT 14 7 7 

Qy 316 -AAAGATACTGGTATTGCTCGAGTTCCACTTGCTGGAG-CTGC — 

CGGGGGC 363 

I I I I I I I I I I I I I I I I I I I II I I I I I 

Db 1476 



TAGTGATAGTAATCTTGTTCTCTTTTCCCGTACTCTTGTCCACAGCAGAGACATTGAGTA 1417 

Qy 36 4 — CCA— GGGATC GGCAG GGCTGCTGGCAGAGGAATCCC 

AGCTG 403 

III I I I I I III I I I I I I I I I I I II 

Db 1416 

TACCATTGGCATCAATGTCAAAAGTGACTTCAATCTGAGGAACACCTCGGGGTGCAGGAG 135 7 

Qy 40 4 GGGTTC CCATGC CC 

CAGGCT 423 

I I I I I II I I I II II 

Db 1356 

GTATGCCTGTGAGTTCAAACTTGCCAAGCAGGTTGTTATCCTTTGTCATGGCACGCTCGC 12 9 7 

Qy 42 4 CCT GCAGGACTTGCTGG — GCCA 

GTCCGTG 451 

III III I I I I I I III 

I I I 

Db 1296 

CTTCATAAACCTGAATAAGCACACCAGGCTGGTTGTCAGAATAGGTAGTGAAGGTCTGTG 123 7 

Qy 452 GGGTTG GCGGGCCATC 

CCAACAG — G 4 75 

I I I I I I I I II III 

I 

Db 1236 

TCTGCTTGGTAGGAATGGTGGTATTACGCTTGATGAGGACAGTCATGACTCCACCAGCAG 117 7 

Qy 4 76 TGATGACCCCACAAGGAAGAGG 

TACTGTTGCAGCC 510 

I I III I I I I I I I II I I I I I 

Db 1176 

TTTCAATACCAAGGGAAAGAGGAGTGACATCCAAGAGCAGCAAATCTTGAACATTCTCAG 1117 

Qy 511 GCTG CAGCTGCTGCCACAGC — 

CAGTATTGCCGG 542 

I I I I I I I I I I I I III III 

I 

Db 1116 

ACTTGTCTCCAGACAAGATGGCTGCCTGGACAGCTGCACCATAAGCAACAGCTTCATCAG 105 7 

Qy 5 43 G GCTCCAACCCAGTAC CCACCTG 

GCCGTGGGGGTCCTCCCCCACC 587 

I I I I I I I I I I I III II I I I I I I II 

I 

Db 1056 GGTTGATGCTCTTATTCAGTTCTTTTCCA- 

TTGAAGAAGTCTTGGAGAAGCTTCTGAATC 998 

Qy 58 8 TATGGG — CCGAGGAGCACC-CCCTCCAGG 

CATGA 619 

I III I I I I II I I I I I I I I I I I I I I I 

Db 997 

TTGGGGATACGAGTAGAACCACCAACCAGGACAATATCATGAATCTGTGACTTGTCTAGT 93 8 

Qy 62 0 TGGGCCCA CCTC 

CTGGTATG 639 

I I I I I I I III I I I I 

Db 937 

TTGGCATCTCGAAGGGCTTTCTCTACTGGGTCCAGGGTGCCACGGAACAGGTCAGCATTC 8 7 8 



Qy 6 40 AGACCTCCTAT GGG 

TCCCCCAATGGG 665 

I I I I I I III I I I I I I I 

Db 877 

AGTTCTTCAAATCGGGCACGGGTAATGGAGGTATAGAAGTCGATTCCTTCATAGAGAGAA 818 

Qy 666 — GATCCC CC CTGGAAGAGGGACTCCAATGGGCATGCCCCCTC- 

CG 708 

I I I I I II I I I I I I I I I I II I I I I I II 

I 

Db 817 

TCGATCTCAATACTGGCCTGGGTGCTGGAAGAGAGGGTACGCTTAGCACGTTCACAAGCA 75 8 

Qy 709 GGAATGCGGCCTCCTCC CCCTGGGATGC GAGGCCTTCTT 

TGACCCTTG 756 

II I I I I I I I I III I I I I I I I I I I I 

I I I 

Db 757 

GTACGGAGGCGTCTTACAGCTCTCTTGTTCTCACTGATGTCCTTCTTATGCTTGCGCTTA 69 8 

Qy 75 7 GCCACAG AGTATGG 

AAGTAG 7 76 

I I I I I I I I I I I I I I 

Db 697 

AACTCAGCAATAAAATGGTTGACCATTCGGTTGTCAAAATCTTCTCCACCCAAGTGGGTG 63 8 

Qy 777 -CTCC-GCAG-AGGCGTGGGCTC 

GATTCCT 803 

I I I I I I I I I I I I III I I I I I I 

Db 637 

TCTCCAGCTGTAGACTTGACCTCAAAGATTCCATCCTCAATAGTGAGGATTGACACATCA 57 8 

Qy 804 CAGGGC CACGTTAC 

CACAGACCTGTTTGTTT 83 4 

I I I I I I I I I I I I II I I I I I I I I I 

I 

Db 577 

AAAGTGCCACCTCCCAGGTCAAAGATGAGCACGTTTCTTTCTGCTCCAACCTTTTTGTCT 518 



Qy 835 CTTATGCTGTT 

GTTCGTGGAGTC— 857 

I I I I I I I I I I I I I I 

Db 517 

AAGCCGTAAGCAATAGCAGCAGCAGTTGGCTCATTAATAATTCTAAGTACATTGAGACCA 45 8 

Qy 858 TCATGGGATTGTCTG — GTTTC CCTTACAG 

GGCCC 89 0 

I I I I I I I I I I II I I I I I I III 

Db 45 7 GCAATAGTTCCAGCATCTTTGG- 

T AGC C T G AC GC T GAG AG T C AT T AAAGT AAGC TGGC AC 399 

Qy 891 CCTC CCCCGGGAATGC 

GC 908 

III I I I I I I I I II 

Db 398 

TGTGACCACAGCATTGGTAACAGTCTTCCCAAGGTAGGCTTCTGCAATTTCCTTCATCTT 339 

Qy 9 09 CCACCAAGG CCCTAG- 

ACTC 92 7 



Db 338 

TGTCAGAACCATAGAAGACACCTCCTCTGGATAGAAGCTTTTGGTCTCTCCCTTGTATTC 2 79 

Qy 928 ATCTTGGCCCT CCT — CAGC TCCCTGCC 

TGTTTCCCGTA 964 



Db 278 

TACTTGGACCTTGGGCCTGCCAGCATCATTCACCACCATAAAGGGCCAATGTTTCATATC 219 

Qy 965 AGGCTG TACATAGTCCTTTTATC TCCT 

TGTGGC 997 

I I I I I III III III III I I I I I 

Db 218 

AGACTGGACAACAGCATCATCAAATCTGCGTCCAATCAGACGTTTGGCATCAAAAACTGT 159 

Qy 99 8 CTATGAAACTGGTTTAT 

AAT AAAC T C T T AAG AG AA 1032 

I I I I I I I I I I I I I I I I 

Db 158 

GTTGGTGGGGTTCATTGCAACTTGATTCTTTGCGGCATCACCGATCAACCGTTCAGTGTC 9 9 

Qy 1033 CATTA TAATTGC 1044 

III I I I I 

Db 9 8 CGTAAAGGCGACATAGC 82 
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Qy 1 GAATTCC CTGAGGAGGCGAAT CCGG CGGGTATCAGAGCCA- 

TCAG 4 4 



Db 21 

GTTTTCCAGCACGGAAAAGTCGAGATAATTGCCAATGATCAGGGAAACCGAACCACTCCA 8 0 

Qy 45 AAC CGCC AC CAT G A CGGTGG 

GCAAGAGCAGCAAGA 79 



Db 81 AGCTATGTCGCCTTTACGGACACTGAACGGTTGATCGGTGATGCCGCAA- 

AGAATCAAGT 139 

Qy 8 0 TGC — TG CAGCA TATTGATTACAG 

GATGAGGTGC 111 

I I I I I I I I I I I I I I I I I I I I I 

Db 140 

TGCAATGAACCCCACCAACACAGTTTTTGATGCCAAACGTCTGATTGGACGCAGATTTGA 199 

Qy 112 — ATCCTGCAGGACGGCCGGATCT TCATTGGCACCTTCAAG 

GCTTTTG 157 



Db 200 TGATGCTGTTGTCCAGTCTGATATGAAACATTGGC- 

CCTTTATGGTGGTGAATGATGCTG 258 

Qy 158 ACAAGCACATGAATTTGATCCTCTGTG 

AC T G T GAT GAG T T C AG AAAG ATC AAGCC AA 214 



Db 259 GCAGGCCCA 

AGGTCCAAGTAGAATACAAGGGAGAGACCAAAAGCTTCTATCCAG 312 

Qy 215 AGAA CTCCAA ACAAGCAGAAAGGGAAG 

AGAAGCGAGTCCTCG — 256 

III I I I I I I I I I I I I I I I I I I I I I 

Db 313 AGGAGGTGTCTTCTATGGTTCTGACAAAGATGAAGGAAATTGCAGAAGC- 

CTACCTTGGG 371 

Qy 257 — GTCTGGTGCTGCTGCGAGGGGAGAATCTGGTCTCAATGACAG 

TAGAGGGAC-C 308 



Db 3 72 AAGACTGTTACCAATGC 

TGTGGTCACAGTGCCAGCTTACTTTAATGACTC 421 

Qy 3 09 TC CTCCCAAAGATACTGG TATTGC 

TCGAGTT — 339 

II I I I I I I I I I I I I I I I I I I I I 

Db 422 

TCAGCGTCAGGCTACCAAAGATGCTGGAACTATTGCTGGTCTCAATGTACTTAGAATTAT 4 81 

Qy 3 40 CCACTTGCTGGAGC 

TGCCGGGGGCCCAGGGATCGGCAGGGCTGCTGGCAG 389 



Db 482 TAATGAGCCAACTGCTGCTGCTATTGCTTACGGCTTAG 

GAGCAG 53 7 

Qy 39 0 AGGAATCCCAGCTGGGGTTCCCATGCCCCAGGCTCCTGCAGGAC — 

TTGCTGGGCCAGTC 447 

I I I I I I I I I I I I I II I I I I I I I I I I 



I I I I I I 
ACAAAAAGGTTG- 



1 1 

Db 53 8 AAAGAAACGTGCT CATCTTTGACCTGGGAGGTGGC 

ACTTTTGATGTGTCAATC 59 0 

Qy 448 C GTGGGGTTGGC GGGCCATCCCAACAGGTGATGAC-CCCAC- 

AAGGA 492 

I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 591 

CTCACTATTGAGGATGGAATCTTTGAGGTCAAGTCTACAGCTGGAGACACCCACTTGGGT 65 0 

Qy 493 AGAGGTACTGTTG-CAGCCGCTGCAGCTGCTGCCACAGCCA — 

GTATTGCCGGG G 544 

III I I I I I I I I I III I I I I I I I I I I I I I 

I 

Db 651 G G AG AAG AT T T T G AC AAC C G AATGGT 

CAACCATTTTATTGCTGAGTTTAAG 701 

Qy 545 CTCCAACC CAGTACCCACCTG GCCGTGGGGGTCCTCC — 

CCCACC 58 7 

I I I I I I I I III I I I I I I I I I I I 

I 

Db 702 

CGCAAGCATAAGAAGGACATCAGTGAGAACAAGAGAGCTGTAAGACGCCTCCGTACTGCT 761 

Qy 58 8 TATGGGC CGAGGAGCACCCCCT 

CCAGGC 615 

III I I I I I I I I I I I I I I I I I 

Db 762 

TGTGAACGTGCTAAGCGTACCCTCTCTTCCAGCACCCAGGCCAGTATTGAGATCGATTCT 821 

Qy 616 ATGATGGG CCCACCT 

CCTGGTATGA 6 40 

I I I I I I I I I I I I I I I I I 

Db 822 

CTCTATGAAGGAATCGACTTCTATACCTCCATTACCCGTGCCCGATTTGAAGAACTGAAT 8 81 

Qy 641 GACC — TCCTATGGGTCC 

CCCAATGGGGATCCCCCCTGGAAGAGGGACTCCA 690 

I I I I II III II I I I I I I I I I I I I I I II 

I I I 

Db 8 82 GCTGACCTGTTCCGTGGCACCCTGGACCCAGTAGAGAAAGCCCTTCGA 

GATGCCA 936 

Qy 691 A— TGGGCATGCCCCC TCCGGGAATGCGG-CCTCCTC 

CCCCT 729 

I I I I I I I I I I I I I I I I I I I I I 

I I I I 

Db 937 

AACTAGACAAGTCACAGATTCATGATATTGTCCTGGTTGGTGGTTCTACTCGTATCCCCA 996 

Qy 73 0 GGGATGCGAGGCCT-TCTTTGAC CCTTGGC CACAGAGTATG- 

GAAGTAGCT 778 

I I I I I I I I I III I III I I I I I I I I I I 

I 

Db 997 

AGATTCAGAAGCTTCTCCAAGACTTCTTCAATGGAAAAGAACTGAATAAGAGCATCAACC 1056 

Qy 779 CCGCAGAGGCGTGGGCT — CGATTC — CT CAGG — GCCA- 

C G T T AC C AC AG AC C T G T 82 8 



Db 1057 

CTGATGAAGCTGTTGCTTATGGTGCAGCTGTCCAGGCAGCCATCTTGTCTGGAGACAAGT 1116 

Qy 829 TTGT TTCTTATGCTGTTGTTCGTGGA 

GTCTCATGGGATTGT — 869 

II III I I I I I I I I I I I I I I I 

Db 1117 

CTGAGAATGTTCAAGATTTGCTGCTCTTGGATGTCACTCCTCTTTCCCTTGGTATTGAAA 1176 

Qy 870 CTGGT TTC 

CCTTACAG- 8 85 

I I I I I III 

I I I 

Db 1177 

CTGCTGGTGGAGTCATGACTGTCCTCATCAAGCGTAATACCACCATTCCTACCAAGCAGA 1236 

Qy 886 GGCC— CCCTCCC CCGGGAATGCGCCCACC 

AAG 916 



Db 1237 

CACAGACCTTCACTACCTATTCTGACAACCAGCCTGGTGTGCTTATTCAGGTTTATGAAG 12 96 

Qy 917 GC CCTAGACT CATCTTGGC CCTCCTCAGC- 

TCC 948 



Db 1297 

GCGAGCGTGCCATGACAAAGGATAACAACCTGCTTGGCAAGTTTGAACTCACAGGCATAC 1356 

Qy 9 49 CTGCCTGTTTCCCGTAAGGCTGTACAT AGTC- 

CTTTTATCTCCTTGTGGCCT 999 



Db 1357 CT-CCTGCACCCCG — AGG-TGTTCCTCAGATTGAAGTCACTTTTG — 

ACATTGATGCCA 1410 

Qy 1000 AT G AAA CTGGTTTATAATAAACTC TTAAGAGAAC 

ATTATAA 10 40 



Db 1411 

ATGGTATACTCAATGTCTCTGCTGTGGACAAGAGTACGGGAAAAGAGAACAAGATTACTA 14 7 0 

Qy 1041 TTGC 1044 

I I 

Db 1471 TCAC 1474 
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Qy 2 AATTCCCTGAGGAGGCGAATCCGGC- 

TCAGAGCCATCA 43 



Db 280 ATTACCATGATGAATC — 

AT C C AAC T G AT GC C T GG AAAGAAGGGC AAT TC AAAGAT AT C A 33 7 

Qy 4 4 GAACC GCCACCATG — 

ACGGTGGGCAAGAGCAGCAAGATGCTGC 85 

III I I I I II II I 

Db 338 

TTACCAAGGTTGCCAATGTGGAACTATACTACAGAGCAATACAGTTCTACTTAGAATTCA 39 7 

Qy 86 AGC — AT AT T GAT T AC AG GATGA- 

GGTGCATCCTGCAGGACGGCCGGATCTTCATT 13 8 



Db 398 AGCCTCTGTTGTTAAATGATTTGCTGATGGTGCTGTCTCC 

ACACT 453 

Qy 139 GGCACCTTCAAGGCTTTTGACAAG CAC 

ATGAATTT 173 

I I I I I I III I I I I III 

Db 454 

CGTGCAGTCAATTATTTCAGCAAGGTTAAACAGCTACCACTGGTGAAACCGTATTTGCGT 513 

Qy 174 GATCC TCTGTGACTG TGA 

TGAGTTCA 199 



Db 514 

TCAGTTCAGAACCATAACAACAAATCTGTGAATGAATCATTGAACAATCTTTTTATTACA 5 73 

Qy 200 GA — AAG AT C AAGC - C AAAGAAC T C C AA 

ACAA 22 8 

II III I I I I I I I I I I I III 

Db 574 

GAAGAAGATTATCAGGCTCTGCGAACATCAATAGATGCTTATGACAACTTTGACAATATC 633 



-GGGTA — 
III I I I I I I 



I I I I I I I I I II 
ACGGTTGGATC- 



Qy 



2 2 9 G C AG AAAG G G AAG AG AAGC GAG T C C T C GGTCTGGTGC — 



Db 634 

TCGCTTGCTCAGCGTTTGGAAAAACATGAACTCATTGAGTTCAGGAGAATTGCTGCTTAT 693 

Qy 2 72 GAGGGGAGAAT — CTGGTCTCAATG ACAGT 

AGAGGGACCTCCT- 312 

III I I I I I I I I II I III III III 

I I I 

Db 694 

CTCTTCAAAGGCAACAATCGCTGGAAACAGAGTGTAGAGCTGTGCAAGAAAGACAGCCTT 753 

QY 313 

CCCAAAGATACTGGTATTGCTCGAGTTCCA 342 

I I I I I I I I I I I I I I I II 

Db 754 

TACAAGGATGCAATGCAGTATGCTTCTGAATCTAAAGATACTGAATTGGCTGAAGAACTC 813 

Qy 3 43 CT TGC — TGGAGC 

TGCCGGG 36 0 



Db 814 

CTGCAGTGGTTTTTGCAGGAAGAAAAAAGAGAGTGCTTTGGAGCTTGTCTGTTTACCTGT 8 73 

Qy 361 GGC CCAGGGATCGGCAGGG CTGC- 

TGGCAGAGGAATCCCAGCTGG 40 4 

I I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 8 74 TACGATCTTTTAAGGCCAGATGTCGTCCTAGAAACTGCATGGAGGCACAATATCA-- 

TGG 931 

Qy 405 GGTTC- 

CCATGCCCCAGGCTCCTGCAGGACTTGCTGGGCCAGTCCGTGGGGTTGGCGG — 461 

II I I I I I I I I I I I I I I I I I I I II I I I I I I II 

Db 932 ATTTTGCCATGCCCTA-TTTCATCCAGGTCATGAAGG 

AGTACTTGACAAAGGTGGAT 987 

Qy 462 GC CATCCC AACAGGTGATGACC 

CCACAAGGAAGAGGTAC 50 0 



Db 988 

AAAT T AG AT GC T T C AG AAT C AC T G AG AAAAGAAGAAGAAC AAGC T AC AG AG AC AC AAC C C 10 4 7 

Qy 501 — TGTT GCAGCCGCTGCAGCTGCTGCCA-CAG 

CCAGTATTGCCGGGGCTCCA 549 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 10 48 ATTGTTTATGGTCAGCCCCAGTTGATGCTGACAGCAGGACCCAGTGTTGCC- 

GTCCCT 1104 

Qy 550 ACCCAGTACCCACCTGGCCGTGGGGGTCCTCCCCCACC- 

TATGGGCCGAGGAGCACCCCC 608 

I I I I I II III III II I I I I I I I I I I I I I 

I 

Db 1105 CCCCAGGCACCTTTTGGTTATGGTTATACCGCACCACCGTATGGACAG 

CCACA 1157 



6 09 TCCAGGCAT GATG-GGCCCACCTCCTG- 



1 1 1 1 1 I 1 1 1 1 1 1 I 1 1 1 1 1 

1 1 

Db 1158 

GCCTGGCTTTGGGTACAGCATGTGAGATGAAGCGCTGATCCTGTAGTCACCTATTTTCGT 1217 

Qy 63 7 A-TGAGACCTC — CTATGGGTCCCCC 

AATGGGG 666 

I I I I I I I I III I I I I I I I I I I I 

Db 1218 ACTGAAACATCGTCTTT 

ACCCACTTCTCAGTTTATAATGGGGGAAAACAGGCAACGT 12 7 4 

Qy 66 7 ATCCCC CCTGGAAGA 

GGGAC 6 86 

I II I I I I I I 

I 

Db 1275 

GTTCTTGTAACCTTTATTTCATGAAGGACTTCTTTTTGTTTCTAACTATAAACTTGGATC 133 4 

Qy 687 TCCAATGG GCATGCCCCCTCC 

GGGA 711 

I I I I I I I I I I I I I 

I I I I 

Db 1335 

ACCTATGTTAAAACCTTATTTCACATTCCACATCATTTTAGAATTTATTTTCGAAGGGGA 139 4 

Qy 712 AT GC GGC CTCCTCCCCCT 

GGGAT 73 4 

II I III I I I I I I I I I 

I 

Db 1395 

ATAGTTTCAATGTTTTATTCACTTGGGCTTTTTTTCTTCCCCCTCTTTCTTTAAAGAACT 1454 

Qy 735 GC GAGG — CCTTCTTTGACC 

CT 754 

II I I I I I I I I I I I 



Db 1455 

GCTCAATATTCAATCTGTTGTGAAGAACCTGATTTGCACTCTGTAGTGTTTAAAGAAACA 1514 

Qy 755 TGGCCAC 

AGAGTATGGAAGTAGCTCCGCAGAGGCGT 79 0 

III I I I I I I I I I I I I I I 

I I I 

Db 1515 AAGAAACTCTAATATTGAATCTCTTAAATTTAGTGTATGTAAACAGCT- 

TACAAATACGT 15 73 

Qy 791 G — GGCTCGATTC 

CTCAGGG 808 

III III 

I I 

Db 1574 

ATTGTCTAAATGCATTTAAATCTGTTTTATTCAAAGAAAAGCTAAAGCAAAAACACTGG- 1632 

Qy 8 09 CCACGTTACCA — CAGACCTGTTTGTTTCT 

TATGCTGTTGTTCGTGG 853 

I I I I I I I II I I I I I I I I I I I 

Db 1633 - 

CATATGACCATGCAAGACTGTCAGTGCCAACAAAGACAACACTAATCAGCACATCGT — 16 89 



Qy 854 AGTCTCATGGGATTGTCTGGTTTCCC TTACAGG- 

GCC— 889 

II I I I I I I I I I I I I I I I I I I 

Db 1690 

ACACTGGATTGCAGTGCTTCCCAGATTATTGAAAAATGTTACAGACAACTTGCCTG 17 45 

Qy 89 0 CCCTC 

CCCC G 899 



Db 1746 

ATTTTTAAATGAGCGTAAAAGGCCCTCTAACCTATGCAGGTTTCCCCATTATGCATATAG 18 05 

Qy 900 GGAATGC GCCCAC CAAG-GCCCT 

AGACT 926 



Db 1806 

AAAATGCTAGTATGTTTTGCTCACTTCATATGTAACAGGTGCCCTTATGTTGTGCTGTAT 1865 

Qy 927 CAT CTT GGCCC — TCC--TC AGCTCCCTG — 

I I III I I I I I I I I I I I I I I I I 

Db 1866 

CCTGTGCTTTTTCTGTGGGACCATTCCATTCAGGAGCAAAGAGCACCATGATTCCAATCT 192 5 

Qy 954 TGT TTCCC 

GTAAG 966 

III I I I I I I I I I 

Db 1926 

TGTGTGTGTTTACTAACCCTTCCCTGAGGTTTGTGTATGTTGGATATTGTGGTGTTTTAG 19 85 

Qy 967 GC TGT AC 

ATAGTCCTTTTATCTCC 990 

I I I I I I I I I I I I I I I I 

Db 19 86 AT C AC T GAG T G T AC AG AAG AG AG AAAT T C AAAC AAAAT AT TGCTGT — 

TCTTCAGTTTTG 2043 

Qy 991 -TTGTGGCCTATGAAA CTGGTTTATAATAA 

ACTCTTAAGAGAACATT 1036 



Db 2044 

T T T G T GG AAT T T G AAAT T AC T C AAAT T T AAAAT AAAT T AC T GG AC T G T GG AAAT AAC A- T 2102 

Qy 103 7 ATAATTG 10 43 

I I I I I I 

Db 2103 AGAATTG 2109 



RESULT 10 
US-10-478-914-62 

; Sequence 62, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 

; TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
; FILE REFERENCE: 7388-80899 

; CURRENT APPLICATION NUMBER: US/ 1 0 / 478 , 9 1 4 
; CURRENT FILING DATE: 2003-11-26 



PRIOR APPLICATION NUMBER: PCT/JP02/05294 
PRIOR FILING DATE: 2001-05-30 
PRIOR APPLICATION NUMBER: JP 2001-162775 
PRIOR FILING DATE: 2001-05-30 
PRIOR APPLICATION NUMBER: JP 2001-255226 
PRIOR FILING DATE: 2001-08-24 
NUMBER OF SEQ ID NOS : 417 
SOFTWARE: Patentln version 3.2 
SEQ ID NO 62 
LENGTH: 1946 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-478-914-62 

Query Match 40.3%; Score 2623.5; DB 1; Length 1946; 

Best Local Similarity 41.8%; Pred. No. 5.9e-164; 
Matches 629; Conservative 0; Mismatches 362; Indels 515; 
Gaps 92; 

Qy 1 GAATTCCCTGAGGA GGCGAATCCGGCGGG 

TAT C AG AGC C AT C AG AAC C GC C 51 

I I I I I I I I I I I III I I I I I I I I I I I I I I I I 

I 

Db 39 GAATTCCTCGAGCACTGTTGGCCTA-CTGGCTGAGGCTTATCTGA — CATCT- 



Qy 52 ACC 

ATGACGGTG 63 

I I I 

I 

Db 95 

CCCTGGTGTGTGTGTGTGTGTGTGGTGTTTTTGTGTGTGTGGGGTGTGAGTGTGGCAGAG 154 

Qy 6 4 GGCAA GAGCAGCAAGATGCTGCAG CATATTGATTAC- 

AG 101 

III I I I I I I I I I I I I I I I I I I I I 

I 

Db 155 

AGGAAAGGGGAGCTGTTTGATTCTGTAGTTCCTTTTCTGCCTTCTTCCTTTCTCTGTAAA 214 



Qy 102 GATGAGGTGC — ATCC TGCAGGACGGCCG 

GATCT T 134 

I III I I I I II I I I I I I I II 

I 

Db 215 CTTTGGATACTTATCCAAATTACT- 

AACGGCAGATTGAGCCCTATGCAGATGGCATGTGT 2 73 

Qy 135 CATTGGCACCTTCAAGGCTTTTGACAAGCACATGAATTTGAT — CCT-- 

CTGTGA 185 

I I I I I I I I I I I I III I I I I III III 

I 

Db 2 74 CTGTGACAACCTCT — GCTCTCCACAT- 

CTCTTGGGCCTGTTTACCTGCGCTCCCAGAGC 330 



Qy 186 C T G T G -AT GAG T T C AG AAAG AT C AAGC C AAAG AAC T C CAAACAAGCAG — 

AAA 235 

II I I II II I I I I I I III I I I I I III 

Db 331 CTCCGCCAGCATCCCAGAATCTCCATCCCCATCTCTCACTTATACACACAT- 

CAGTCATC 3 89 



Qy 236 GGGAAG— AGAAGCGAGTCCTCGGTC 

TGGTGCTGCTGCGAGGGGAGAATCTGG — 286 

III I I I I I I I I I I II II I I 

Db 390 

GGTTATCCATTAGCTAAACCGCCTTCCTTAATAGCTTTACACTGTTTGCTTTCTCTGGAA 4 49 



Qy 287 TCTCAATGACAGTAGAGGGACCTCCTCCCA 

AAGATACTG 325 

II I I I I I I I I I I I I I I I I I I 

I 

Db 45 0 CATTTTTAGTTAAAATTTCATAATG-CAGTTG 

C AC AC AAAT G AAG AC AC AG 499 

Qy 326 GTATTGCT CG — AGTTCCAC TTGCTGG- 

AGCTGCCGGGGGCCCAGGGA 3 70 

I I I I I II I I I I I I I I I I I I I I I I II 

I I I I I I 

Db 50 0 — ATGGCTGCATCCTCCGTCTCTTCCCCTCGTTTACAGGAAGCTGC 

GGATCAGGGA 553 



Qy 3 71 TCGG CAGGGCTGCT GGCAGAGGAATCCCAGCTGGGGTTCCCA- 

TGC 415 

II I I I I I I I I I I I I I II I I I I I I I I 

Db 55 4 GGGGTGTTAGGGTTACCCACATGGTAAGGGCAGAG 

ACAAGAGGGGACCCCAGTTT 608 



Qy 416 CCCAGGCTCCTGCAGGAC- 

TTGCTGGGCCAGTCCGTGGGGTTGGCGGGCCATCCCAACAG 4 7 4 

I I I I I I I I I I I I I I I I I I II I I I I I I I I 

Db 6 09 TCCATGCTGCACATGGTCATTGCTGGG GAC — 

TGAGGTTTGCACATCACCCTGCCCT 663 



Qy 4 75 GTGATGACCCCACAAGGAAGAGGTAC TGTTGC- 

AGCCGCTGCAGCTGCTGCCACA 52 8 

II I I I I I I I I I I I II I I I I I I I I I I I I I I I I 

I 

Db 664 

GTTCTCCCTCCGCTGGGGAGAAAGTCAGGGATGGAGCAAGCTGCAGCATCTTCTGAAAAA 723 



Qy 529 GCCA GTATTG 

CCGGGGCTCCAACCCAGTACCCACCTGGCCGTGGGGGTCCTC 5 8 0 

I I I I II I I I I I I I I I I I I I I I I I I I III 

I I 

Db 72 4 GAAAAGGTGGCCTTGTCTCCAGGTCTCCCCTCAAGT-CCCACCTTCCCAT- 

AGTTTTCTG 781 

Qy 581 CCCCACCTATGGG CCGAGGAGCAC— CCCCTCCAGGCATGATGGGC- 

CCA 627 

I I I I I I I II I I I I I I I I II I I I 

I I I 

Db 782 CCACTTCTCTTGATTTTCC TCTCTGCCACTTCT — 

CTTGATTTTCCTCTCTGCCA 834 



Qy 62 8 C — CTCCTGGTATGAGACCTC CTATGGG 

TCCCCCAATGGGGA 66 7 

I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 835 CTTCTCTTGATTT 

TCCTCTCTGCAGCTGCTTTGAGGTGGGTTTTCTCCAGATGCACA 891 



Qy 66 8 — TCCCC CCTGGAAGAGGGAC — 

TCCAATGGGCAT 69 8 

I I I I I I I I I I I I I I I I I I I I III 

Db 892 

CTTTCCCCTGCTTTGCGTCCTTATTCTGGTAGAAGCACAATCTAAAGCTCATTAAGGGAA 951 

Qy 699 GCCCCCTCCGGGAATGCGGC CTC CTCC-CCCTGGGA 

TGC 736 

I I I I I I I I II III I I I I I I I I 

I I 

Db 952 

CTAATCAATTCTGTGCATGGCGCTAGCTCAGCAGATCACCACACAGGCAGCACTATTAGC 1011 

Qy 73 7 GAGGCCTTCTTTGACCCTTGGCC — ACAGAGTATGGAAGTAGCTCC GCAG- 

AGG 787 

III I I I I I I I I I I I I I I I I II II III 

I I I 

Db 1012 

AAGTCGGTGCTTAACACATGGCACTTCCATGAATCGATATGGAGCCCGTGTAGAACAAGG 10 71 

Qy 78 8 CGTGGG CTCGATTCC TCAGGGCCAC-GTTACCA 

CAGA 823 

I I I I I I I I I I I III I I I I I I I I 

Db 1072 

CATGGGTTTTTTTCTCTTCCCATTAAGAAAAACTGATGCCAAAAATAACTTCTCAGATAT 1131 

Qy 824 CCTGTTTGTTT 

CTTATG- 8 40 

I I I I I I I I I I 

I 

Db 1132 

TTTCAAGTATGACTTTTATGAAGGGAAAAAGCATTTTTGTTTGCAAAATCATGCTTCAGT 1191 

Qy 841 CTGTTGT TCGTGGAG TCTCATGGGATTGT — CT 

GGTT 875 

I I I I I I I I I I I I I I I I I I I III II 

I I I I 

Db 1192 

GCAGGCCAGTTGTGAATTGTGATGGCTTTTATTTCTCCTGGGGCTGTAACTTTAAGGGTT 12 51 

Qy 8 76 T CC CTTA- 

CAGGGCCCCCTCCCCCGGGAATGCGCCCACCA 914 

I II I I I I I I I I I I I I II 

Db 12 52 TAGAATTTGGAACCACAGCCTAGCTAATCATGACACACACACAC 

AC AC AC AC AC 13 05 



Qy 915 AGGCCCT AGACTCAT CTTGGCCCTCCTCAGCTCCCTGC 

CTGT 956 

I I I I I I I I I I I II I I I I I I I I 

I 

Db 1306 

ATGCACACACATACACATACACAAAGCATCACGAAGAACCATACAAATTGTACATTATTT 1365 

Qy 957 TTCCCGT-AAGGCT GT AC AT AG — TCC TTTTATCT 

CCTTGTGGC 997 

I I I I I I I I I I I I I I II I I I I I I I I I I I I 

Db 1366 

TACACATGGAGGCTCACTCTAAAATAGATACCATTTTAAATATTAACTAAAACTTGTGCT 142 5 
Qy 99 8 CTATGAA ACTGGTTTATAATAAAC 



T C T T AAG AG AAC AT TAT 103 8 

I I I I I I I I I I I I I I I I I I I I I 

Db 1426 

CATTGTATGTTCATTCTATATGTACTGATTTGTTATTCACATTTCTTTCAAAATCGTTCA 14 85 

Qy 1039 AATTGC 1044 

I I I I I 

Db 1486 AATTTC 1491 



RESULT 11 

US-10-478-914-20/C 

; Sequence 20, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 

; TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
; FILE REFERENCE: 7388-80899 

; CURRENT APPLICATION NUMBER: US/ 1 0 / 478 , 9 1 4 
; CURRENT FILING DATE: 2003-11-26 
; PRIOR APPLICATION NUMBER: PCT/ JP0 2 / 0 52 9 4 
; PRIOR FILING DATE: 2001-05-30 

PRIOR APPLICATION NUMBER: JP 2001-162775 
; PRIOR FILING DATE: 2001-05-30 
; PRIOR APPLICATION NUMBER: JP 2001-255226 
; PRIOR FILING DATE: 2001-08-24 
; NUMBER OF SEQ ID NOS : 417 
; SOFTWARE: Patentln version 3.2 
; SEQ ID NO 2 0 

LENGTH: 2 495 

TYPE: DNA 
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US-10-478-914-20 

Query Match 39.2%; Score 2555.5; DB 1; Length 2495; 

Best Local Similarity 37.3%; Pred. No. 2.3e-158; 
Matches 665; Conservative 0; Mismatches 370; Indels 749; 
Gaps 102; 

Qy 1 GAATTCCCTG AGGAGGC GAATCCGGCGGG 

TATCA 3 4 



Db 1791 

GGGAAACCTGCATAGGTTAGAGGGCCTTTTACGCTCATTTAAAAATCAGGCAAGTTGTCT 1732 

Qy 35 GAGCCAT CAGAACCGCCACCATG 

ACGGTG 63 

I III I I I I I I I I I I I I I I 

Db 1731 

GTAACATTTTTCAATAATCTGGGAAGCACTGCAATCCAGTGTACGATGTGCTGATTAGTG 16 72 

Qy 6 4 GGCAAGAGCA GCAAG ATGC TGC — 

AGCAT 9 0 



Db 1671 

TTGTCTTTGTTGGCACTGACAGTCTTGCATGGTCATATGCCAGTGTTTTTGCTTTAGCTT 1612 



Qy 



9 1 ATTGATT ACAGGATGAGGTGCATCCTG-CAGGACG- 



GCCG 128 

I I I I I I I I I II I I I I I III III II 

Db 1611 

TTCTTTGAATAAAACAGATTTAAATGCATTTAGACAATACGTATTTGTAAGCTGTTTACA 1552 

Qy 129 GATCTTCATTG 

GCAC 143 

II I I I I I I III 

Db 1551 

TACACTAAATTTAAGAGATTCAATATTAGAGTTTCTTTGTTTCTTTAAACACTACAGAGT 1492 

Qy 144 CTTCAAGGC 

TTTTG 157 

I I I I I I I I I I 

Db 1491 

GCAAATCAGGTTCTTCACAACAGATTGAATATTGAGCAGTTCTTTAAAGAAAGAGGGGGA 1432 

Qy 158 ACAAGCACA— TGAAT TTGA TCCTCTGTGA 

CT— 187 

I I I I I I I I I I I I I I I I I I I I I II 

Db 1431 

AG AAAAAAAGC C CAAG T G AAT AAAAC AT T GAAAC TATTCCCCTTC G AAAAT AAAT TCTAA 1372 

Qy 188 - G T GAT GAG T T C AG AAAG ATC AAG CCAAAGAACTCCAA 

ACAAGC 23 0 

I I II I I I I I I I I I II I I I I I 

I 

Db 1371 

AATGATGTGGAATGTGAAATAAGGTTTTAACATAGGTGATCCAAGTTTATAGTTAGAAAC 1312 

Qy 231 AGAAAGG GAAGAGAAG CGAGTCCTCG — 

GTCTGGT 263 

I I I I I III III I I I I I I I I I I I 

Db 1311 

AAAAAGAAGTCCTTCATGAAATAAAGGTTACAAGAACACGTTGCCTGTTTTCCCCCATTA 12 52 

Qy 26 4 GCTGCTGCGAGGGGAGAATC-TGGTCTCAAT — GACAGTAG 

AGGGAC — 307 

III I I I I I I I I I I I I I I I I I I I I 

Db 1251 

TAAACTGAGAAGTGGGTAAAGACGATGTTTCAGTACGAAAATAGGTGACTACAGGATCAG 1192 

Qy 308 CTC CT CCCAAAGATACTGG- 

TATTGCTCGAGTTCCACTTGCTGG 350 

III II I I I I I I I I I I I I I I I I I I I 

I I I 

Db 1191 CGCTTCATCTCACATGCTGTACCCAAAG — CCAGGCTGTGGCT 

GTCCATACGGTGG 113 8 

Qy 351 AGCTG CC GGGGGCCCAGGGA TCGGCA GGG — 

CTGCTGG 386 

III II I I I I I I I I I I I I I I III 

I I I I I I 

Db 1137 

TGCGGTATAACCATAACCAAAAGGTGCCTGGGGAGGGACGGCAACACTGGGTCCTGCTGT 10 7 8 

Qy 387 CAGAG 

GAATCCCAG 40 0 

III I I I I I 

Db 1077 



CAGCATCAACTGGGGCTGACCATAAACAATGGGTTGTGTCTCTGTAGCTTGTTCTTCTTC 1018 

Qy 401 CTGGGGTTCCCATGC CC CAGG— 

CTCCTGCAGGAC 433 

I I I I I I III II III I I I I I I I 

I I I 

Db 1017 

TTTTCTCAGTGATTCTGAAGCATCTAATTTATCCACCTTTGTCAAGTACTCCTTCATGAC 95 8 

Qy 43 4 TT GCTGGGCCAG TCCGTGGGGTT 

GGC 459 

I I I I I I I I I I I I I I I I 

I 

Db 957 

CTGGATGAAATAGGGCATGGCAAAATCCATGATATTGTGCCTCCATGCAGTTTCTAGGAC 89 8 

Qy 46 0 G GGCC ATCCCAACAGGTGA — TGACC — 

CCACAAGGAA 493 

I I I I I III I I I I I I I I III I I I I I I 

Db 897 

G AC AT C T GGC C T T AAAAG AT C G T AAC AGG T AAAC AGAC AAGC T C C AAAGC AC TCTCTTTT 838 

Qy 49 4 GAGGTACTGTTGCAGCCGCTGCAG 

CTGCTGCCA 526 

I I I I I I I I I I I I I I I I I I I I 

Db 837 

TTCTTCCTGCAAAAACCACTGCAGGAGTTCTTCAGCCAATTCAGTATCTTTAGATTCAGA 7 7 8 

Qy 52 7 — CAGCCAGTATTGC CGGGGCTCCA- 

ACCCAGTAC 558 

I I I I I I I I I I I I I I I I I I I I 

Db 777 

AGCATACTGCATTGCATCCTTGTAAAGGCTGTCTTTCTTGCACAGCTCTACACTCTGTTT 718 

Qy 559 CCACC TGGCCGTGGGGG TCCTCCCCCAC- 

CTATGGG 593 

I I I I I I I I I I I I I I I I I I I I I I I 

Db 717 

CCAGCGATTGTTGCCTTTGAAGAGATAAGCAGCAATTCTCCTGAACTCAATGAGTTCATG 65 8 

Qy 59 4 CCGAGGAGCACCC CC TCC-AGGCA 

TGATGG 622 

I I I I I I I I I I I I I I I 

I I I I I 

Db 657 

TTTTTCCAAACGCTGAGCAAGCGAGATATTGTCAAAGTTGTCATAAGCATCTATTGATGT 59 8 

Qy 623 GCCCACCTCCTGGTATGAGACCTC 

CTATGGGTCCCCCAATGG 66 4 

III I I I I I I III I I I I I I II 

I 

Db 597 

TCGCAGAGCCTGATAATCTTCTTCTGTAATAAAAAGATTGTTCAATGATTCATTCACAGA 53 8 

Qy 665 GGATCCCCCCTGGAAG AGGG — ACTCCAATGG — GC — 

ATGCCC 702 

I I I I I I I I I III I I I I I I I II I 

I 

Db 537 

TTTGTTGTTATGGTTCTGAACTGAACGCAAATACGGTTTCACCAGTGGTAGCTGTTTAAC 4 7 8 



Qy 703 CCTCCGGGAATGC-GGCCTCCTC CC — 

CCTGGGATGCGAGGCCTTCTTTGA 750 



Db 477 

CTTGCTGAAATAATTGACTGCACGAGTGTGATCCAACCGTGGAGACAGCACCATCAGCAA 418 

Qy 751 — CCCTTGGCCACAGAGTATGGAAGTAGCTCCGCAGAGGCGTG-GGCTC- 

GAT 799 



Db 417 AT CAT T T AAC AAC AGAGGC T TGAAT T — 

CTAAGTAGAACTGTATTGCTCTGTAGTATAGT 360 

Qy 800 TCCTCA-GGGCCACGTTACCACAGACCTGTTTGT TTCTT 

ATGCTGTTGTT 848 



Db 359 

TCCACATTGGCAACCTTGGTAATGATATCTTTGAATTGCCCTTCTTTCCAGGCATCAGTT 3 0 0 

Qy 8 49 CGTGGAGTC-TCATGGG ATTGTCTGGTT TCC 

CTTACAGGGCC 8 89 



Db 299 

GGATGATTCATCATGGTAATTATGGCATTATCATATTCTTCATACTTGTCATACAAAAAC 240 

Qy 89 0 CCC— TCC-CCCGGGAATGCGC C — CACCAAGGCCCTAG 

ACTCATC 93 0 

II II III I I I I I I II II I I I I 

Db 239 

ACCAGTTCTGCCCAAAGATGAGCTTGTTCTGCAGCTCTTAGCACCTTGGGAATATTCACT 18 0 

Qy 931 TTGGCCCTCCTCAGCTCC CTGCCTGTTTCCCGTAAGGCT 

GTACA- 9 74 



Db 179 CTAGACCAGAACAGCTCCAGGTGCTCCCTCATTTTC- 

T G AGGC T T AAAT T TAG AG TAT AG 121 

Qy 9 75 TAGTCCTTTTATCT CCTTGTGG 

CCTATGAAACT G 1008 

I I I I I I I I I I I I I III 

Db 120 

AATAGCTAATTCAGTAAACATTCCCATGTGAGCTCGCTCAAGTCCCAGTGCTGCTTCCAA 6 1 

Qy 10 09 G T T T AT AAT AAAC T C T T AAGA GAACATTATAATTGC 10 44 

I I I I I I I I I I I I I I I I I I I I I I 

Db 6 0 CATGGTGATCAGCTCTTCAAAATAGCCACGATCCTGATAGTAGC 17 



RESULT 12 

US-10-478-914-57/C 

; Sequence 57, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 

; TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 



; FILE REFERENCE: 7388-80899 

CURRENT APPLICATION NUMBER: US/ 1 0 / 478 , 9 1 4 
; CURRENT FILING DATE: 2003-11-26 

PRIOR APPLICATION NUMBER: PCT/ JP02 / 0529 4 

PRIOR FILING DATE: 2001-05-30 
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; SEQ ID NO 57 
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TYPE: DNA 
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US-10-478-914-57 

Query Match 38.4%; Score 2502; DB 1; Length 2336; 

Best Local Similarity 36.2%; Pred. No. 8.7e-155; 
Matches 626; Conservative 0; Mismatches 366; Indels 736; 
Gaps 95; 

Qy 3 ATTCCCTGAGGAGGCGAATCCGGCGGGTATC AGAGCCATC 

AGAAC 4 7 

I I I I I I I I I I II I I III I I I I II I II 

Db 1689 

AACCCATGAAGGGAAAAATCCTCCTTTTTTCCCCCAAGTGCATTCTACTTATTACTTTGC 163 0 

Qy 4 8 CGCC ACCATGACGGTG GGCAAGAGCAGCAAG ATGC — 

TGC 85 



Db 1629 

ATTTTTCTCCCAAAGTCCAAATTTATGCAAAGAAAATAGAAACAAGTTCAAATGCAATGC 157 0 

Qy 86 AGCATATTGATTACAGGATGAGGTGCATCCT — GC 

AGGA 122 

I I I I III II I I I I I I I 

Db 1569 

ATTAACCAAATAAAACAAGTCTCCTTCAAATTAGGAACCAACCTAAGCATTTGTAAAGTG 1510 

Qy 123 CGGCCGGATC TTC ATTGGCACC 

TTCAAGG- 151 

I I I I I I III I I I I I I I I I I I 

Db 1509 

TAGCAGAATCAGAATTCTTTTAAAAATTAGATTTGGAACCTGAACTATATAATTCATAAT 145 0 

Qy 152 CTTTTGACAAGCACATGAATTTGATCCTCTGTGACTGTGATGA- 

GTTCA- 199 



Db 1449 

TCTCATTTTTCTGTGGAAAATTATTTTATCTTTCTCCTGTATACCTGAAAAAATGTCCAT 139 0 

Qy 200 GAAAGATCAAGCCAAAGAACTCCAAACA AGCAG — 

AAAG G G AAG AG AAG C G 2 48 



Db 1389 

AG G C T T AAAG GGTCATGCTTT T AC AT TCCTTCCATAT C AC AGG T AC TAT G AAG T AAG GAG 133 0 



Qy 2 49 A GTC 

CTCGGT 258 

I III I I I I 

Db 1329 

ACTTTTAGGTTTCTTTTTGTCTTAAACTCAGACAGCTTTGTAAGCAGTAGTGTGTAGATT 12 7 0 

Qy 2 59 CTGGTGCTGCTGCGA 

GGGGAGAATCTGG 2 86 

I I I I I I I I I I I I I I I I I I I I I 

Db 1269 

ACAAGAGTTAGACAAAAGCAGGCGCGACTGAGAAGAGTTGGTGGGGGAGAAGCTTGGGGC 1210 

Qy 287 TCTCA AT G AC AG TAGAGGG 

ACCT 3 09 

I I I I II III III 

I I 

Db 1209 

ACTTCCTGTCACTCAACACATTCCAGATCACTAAAAAATTTCCACACCCTCTGCATTCCC 115 0 
Qy 310 CCT— CCCA— 

AAGATACTGGTATTGCTCGAGTTCCA 3 42 

III I I I I I I I I I I I I I I II I I I I I 

Db 1149 CCTTGCCCACTCCAGTTCCCGGTATT- 

TTCTGATTCCATATGTTGTGGTATTTACCATAC 1091 

Qy 3 43 CTTGCTGGA — GCTGCCGGGGG 

CCCAGGGATCG 3 73 

I I I I II I I I I I I I I II III 

I 

Db 1090 

TTCTCTCCCTCACTAGGCTCTGGCAAGACTGCTTCAGAGGGGATGCATTCCTTTAGATTG 1031 

Qy 3 74 GCAGGGCT GCTGGCAG AGGAATC 

CCAGCTG 403 

I I I I I I I I I I I I I I I I I I I III 

Db 1030 

CACAAAGCGGAGCTGGGAAAATGGCTGGCAGTTTCAGAATCTAGTCACGATCGCACGCAT 9 71 
Qy 404 GGGTTCC-CATGCC- 

CCAGGCTCCTGCAGGACTTGCTGGGCCAGTCCGTGGGGTTGGCGG 461 

II I I I I I I I I I I I I I II II I I I I I I I I I I I I 

I 

Db 970 

GAGCACCTCACACATCCATGTCCCTACCCGCCCCCCCGCTCCCGCCCCTGCAGCTGGCTG 911 

Qy 462 GCC-ATCCCAACAGGTGATGACC CCACAAGGAAG 

AGGT 49 8 

II I I I I I I I I I I I I I I I I I III 

Db 910 

ACCTGTCTCACCCACTGCTGGCCTATCGAACGGCCAGGACTGTCTGGTTTTGGCTCGTGC 851 

Qy 499 ACTGT TGC- 

AGCCGCTGCAGCTGCTGCCAC 52 7 

I I I I III I I I I I I I I I I I 

I 

Db 850 

CTTTGTCCATGTCTGGCTTAGTTCCTCTCTGTCTATGCTTGCCTCTACCCCCACCGCCCC 791 

Qy 52 8 AG CCAGTATT — GCC GGGG 



CTCCAA 550 



Db 790 

AGGCGGCACAAGTGTTTGGCCACACAAAACTAGAGATAGAAAAGGTGGTAAAAACTTCAA 731 

Qy 551 CC CAGTAC-CCACCTG GCCGTGGGGG TCCT 

CCCC 583 

I I I I I I I I I I I I I I I I I I 

I I 

Db 730 

ACTTTTCTAAATTCTCCAACAGTTTATTTCTTGTGAATTTCTTCCTTCTTTAAATACTCC 6 71 

Qy 58 4 C ACCTATGGGCCGAG 

GAGCACC 6 05 

I I I I III II I I I I I 

Db 670 

AT T T T AAG AAAAC AAAAAAAT T AAT T AT C T AAAGGC AAAGAATGGAAAGC AAC C T T T G T G 611 

Qy 6 06 CCCTCCAG — GCATGATGGG 

CCCA 62 7 

I I I I I I I I I I I I I I I 

I 

Db 610 

TTCCTTATAATAACTGACTTCATAACTCTCTCCAGCTGCGTTATGGGATGTGTATAAAAA 551 

Qy 62 8 CCTCCTGGTATGAGACCTCCTATGGGTCCC C — CAATGGG 

GATCCCCCCTGG 677 

I I I I I I I I I I I I I I I II I I I I I I I II 

I I I 

Db 550 GCTTCTGTTCTGAGA 

AC AAAG GAGCACGT G C AG AAAT GAG AC G AAAAAAT C C AC T G A 494 

Qy 6 78 AAGAGGGACTCCA ATGGGCATGCCCC- 

CTCCGGG— 710 

II I I I I I I I I I I I I I I I I I 
Db 493 CAGT 

ATTCCATTACACAAATTACTTAAAAGATTTTAGTCAAGCCCCTCAACAGATTC 43 7 

Qy 711 AATGCGGCCT 

CCTCCC 726 

I I I I I I I I I I 

Db 436 

AATTTTAAAATGGCTTTTAGTTAAAAAAAAAAAATTGAAAGTGCTTACCCAGTAAAAGAA 3 7 7 

Qy 72 7 CCTGG GATGCG-AGGCCTTCTTT 

GACCCTTGGCCACAGAGT 766 

I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 376 

CCGAAGTAGTCCTGAACTGTTACGTAAGACTTTTTACAGTTGGATCTTTGTCAAAAGGGG 317 

Qy 76 7 ATGG AAGTAGCTCCG CAGAGGCGT — GGGCT- 

CGATTCCTCA 8 05 

I I I I I I I I I I I I III I I I I I I I I I I 

Db 316 

ATGGGGGTGATGGGAGAAAGCAGCAACGACAATCAAAAAAGTTCGAGCTGCTGTGGCTAG 2 5 7 

Qy 8 06 GGGCCACGTTACCACAG ACCTGTTT- 

GTTTCTTATGCTGTTGTTCGTGGA 854 



Db 2 56 AGGACA ACTTCTGTGTTTCCAGATAGGATTCT— TGCTGTAG- 

AAATGGAACTTCC 204 

Qy 855 GTCTC 

ATGGGATTGTCTGGTTTCCC 879 

III I I I I I I I I I I I 

Db 203 

AGCCAGCACAGCATCCTGTCCCAGTAGAGAAATGAGTTTGTCAGTTAAAACAAAAAAAAA 14 4 

Qy 880 TTACAGGGCCCC 

CTCCCCCGGGAATGCGCCCACCAAGGCCCTAGACTCATCT 931 



Db 143 ATTAGATACTGGAACCCAGGCTAGACGAGGTATTGAACCGC 

TTTCCT 91 

Qy 932 TG— GCCC— TC— 

CTCAGCTCCCTGCCTGTTTCCCGTAAGGCTGTACATAGTCCTTTTA 9 85 



Db 9 0 TGCAGCCCTGTCTGCTCAGCTCGC ATTCCAGTAGGCC — 

AACAGTGCTCGAGGA 39 

Qy 986 TCTCCTTGTGGCCTATGAAACTGGTTTATAATAAACTCTTAAGAGAAC 1033 

III I I I I I III II I I I I I I III 

Db 3 8 AT T C C — GC AGC T T T T AGAGC AG AAGTAACACTTC— CGTAC 1 



III I I I I 
GCC— AGA- 



RESULT 13 
US-10-478-914-57 

; Sequence 57, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 

; TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
; FILE REFERENCE: 7388-80899 

; CURRENT APPLICATION NUMBER: US / 10/478, 914 

; CURRENT FILING DATE: 2003-11-26 

; PRIOR APPLICATION NUMBER: PCT/ JP02 / 0 52 9 4 

; PRIOR FILING DATE: 2001-05-30 

; PRIOR APPLICATION NUMBER: JP 2001-162775 

PRIOR FILING DATE: 2001-05-30 
; PRIOR APPLICATION NUMBER: JP 2001-255226 
; PRIOR FILING DATE: 2001-08-24 
; NUMBER OF SEQ ID NOS : 417 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 57 

LENGTH: 2336 
TYPE: DNA 
; ORGANISM: Homo sapiens 
US-10-478-914-57 

Query Match 38.3%; Score 2498; DB 1; Length 2336; 

Best Local Similarity 33.1%; Pred. No. 1.7e-154; 
Matches 681; Conservative 0; Mismatches 309; Indels 1070; 
Gaps 112; 



QY 

AAC 47 



2 AATTCCCTG AGGAGGCGAATCCGGCGGGTA — TCAGAGCCATCAG- 



I 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 I II I III I 

I 

Db 171 

ATTTCTCTACTGGGACAGGATGCTGTGCTGGCTGGAAGTTCCATTTCTACAGCAAGAATC 23 0 

Qy 4 8 C GCCACCATGACGGTG 

GGCAAGAGCAGCAAGA 79 

I III I I I I I I I I I I I I I I I I 

Db 231 

CTATCTGGAAACACAGAAGTTGTCCTCTAGCCACAGCAGCTCGAACTTTTTTGATTGTCG 2 9 0 

Qy 8 0 -TGCTGCAG CATATTGA 

TTACAGGATGAGGTGC 111 

I I I I I I I I I I I I I I I I I I I 

I 

Db 291 

TTGCTGCTTTCTCCCATCACCCCCATCCCCTTTTGACAAAGATCCAACTGTAAAAAGT-C 3 49 

Qy 112 ATCC TGCAGGACGGC— CGGATCTT C ATTGGCACCTTCAA 

GG 151 

II I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 350 

TTACGTAACAGTTCAGGACTACTTCGGTTCTTTTACTGGGTAAGCACTTTCAATTTTTTT 4 09 

Qy 152 CTTTTGAC AAGCACA TGAATTTGATCCT 

CTGTGAC 186 

I I I I I I I I I I I I I I I I I I I II I I I I I I 

Db 410 TTTTTAACTAAAAGC-CATTTTAAAATTGAATCTGTTGAGGGGCT- 

TGACTAAAATCTTT 46 7 

Qy 187 TGTGAT-GAGTTCAGAAA 

GATCAAGC 211 

I I I I I I I I I I I II 

I I 

Db 468 

TAAGTAATTTGTGTAATGGAATACTGTCAGTGGATTTTTTCGTCTCATTTCTGCACGTGC 52 7 

Qy 212 CAAAGAAC TCCAAACAAGCA 

GAAAGGGAA 2 40 

I I I I I I I I I I I I I I I I I I I 

Db 528 

TCCTTTGTTCTCAGAACAGAAGCTTTTTATACACATCCCATAACGCAGCTGGAGAGAGTT 5 8 7 

Qy 2 41 G AG AAG CGAGTCCT CGG TCTGGTGCTGC 

TGCGAGGGGAGAA- 281 

I I I I I I I I II I I I I I III II 

I I 

Db 588 

ATGAAGTCAGTTATTATAAGGAACACAAAGGTTGCTTTCCATTCTTTGCCTTTAGATAAT 6 4 7 

Qy 2 82 TCTGGTCTC AATG ACAGTAGAGGGACCTCCTCCC- 

AAAGAT 321 

I I I I I I I I I I I I I I I II III III 

I 

Db 648 

T AAT TTTTTTGTTTTCT T AAAAT GG AG T AT T T AAAGAAGGAAGAAAT TC AC AAG AAAT AA 7 0 7 

Qy 322 ACTG GTATT 

GCTCGAGTT 339 

I I I I I I I I I I I I I I I 



Db 708 

ACTGTTGGAGAATTTAGAAAAGTTTGAAGTTTTTACCACCTTTTCTATCTCTAGTTTTGT 76 7 



Qy 3 40 CCACTTGCTGGAGCTGCCGGGGGCCCAGGGATCGGCAG-GGC-TGC — 

TGGCAGAG 391 

III I I I I I I I I I I I I I I II I I I I I I I II I 

I I I I I 

Db 76 8 GTGGCCAAACACTTGTGCCGCCTGGGGCGGTGGG 

GGTAGAGGCAAGCATAGACAGAG 82 4 

Qy 392 -GAATC CCAG CTGGGGTTCCCATGCC CCAGGC— TCCT- 

GCAGGA 432 

III I I I I I II II III I I I I I I I I I I I I 

Db 825 AGGAACTAAGCCAGACATGGACAAAGG — 

CACGAGCCAAAACCAGACAGTCCTGGCCGTT 8 82 

Qy 433 CTTGCTGGGCCAGTCCGTGGGGTTGGCGGGCCATCC — 

CAACAGGTGATGACCC 48 4 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 8 83 C— GATAGGCCAG- 

CAGTGGGTGAGACAGGTCAGCCAGCTGCAGGGGCGGGAGCGGGGGG 939 

Qy 485 -C ACAAGGA AGAGGTACTGTTGC-AGC CGCTGC- 

AGCTGCTGCCA 526 

I I I I I I I I I I I I I I III II II I I I I I I I 

I 

Db 940 

GCGGGTAGGGACATGGATGTGTGAGGTGCTCATGCGTGCGATCGTGACTAGATTCTGAAA 999 

Qy 52 7 CAGCCAG TATTGCCGGGGCTC CAACCCA 

GTACCCAC 562 

I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1000 CTGCCAGCCATTTTCCC — 

AGCTCCGCTTTGTGCAATCTAAAGGAATGCATCCCCTCTGA 105 7 

Qy 563 CTGGCC GTGGGGG 

TCCTCCCCCACCTATGG— 592 

I I I I I I I I I I I I I I I I I I I I I 

Db 1058 

AGCAGTCTTGCCAGAGCCTAGTGAGGGAGAGAAGTATGGTAAATACCACAACATATGGAA 1117 

Qy 593 

GCCGAGG 599 

II I I I I 

Db 1118 

TCAGAAAATACCGGGAACTGGAGTGGGCAAGGGGGAATGCAGAGGGTGTGGAAATTTTTT 117 7 

Qy 600 AGCACCCC 

CTCCAGGCATGATGGGC 62 4 

II I I I I I I I I 

I 

Db 1178 

AGTGATCTGGAATGTGTTGAGTGACAGGAAGTGCCCCAAGCTTCTCC C 12 2 5 

Qy 625 CCACCTCCTGGTATGAGAC-CTCCT — 

ATGGGTCCCCCAATGGGGATCCCCCCTGGAAGA 681 

I I I I I II I I I I I I I I I I III I II III II 

Db 1226 

CCACCAACTCTTCTCAGTCGCGCCTGCTTTTGTCTAACTCTTGTAATCTACAC 12 7 8 



Qy 6 82 GGGACTCCAATGGGCATGCC CCCTCCGGGAATGCGGC 

CT 720 

I I I I I I I I I I I I I I II 

I I 

Db 12 79 ACTACT 

G C T T AC AAAG CTGTCTGAGTT T AAG AC AAAAAG AAAC C TAAAAG T C T 1331 

Qy 721 CCTCCC CCTGGGAT 

GCGAGGCCTTCTTTGACCCT 754 

I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 1332 CCTTACTTCATAGTACCTGTGATATGGAAGGAATGTAAAAGCATGACC — 

CTTTAAGCCT 1389 

Qy 755 -TGGCCA CAGAGTAT GGAAGTAGC 

TCCGCAGAGGCGTGGG 793 

I I I I I I I I I I I I III II I I I I I I I I 

I 

Db 1390 ATGGACATTTTTTCAG- 

G T AT AC AGG AG AAAG AT AAAAT AAT T T T C C AC AGAAAAATG AG 14 4 8 



Qy 794 C TC-GATTCC 

TCAGGG 808 

I I I I I 

Db 1449 

AATTATGAATTATATAGTTCAGGTTCCAAATCTAATTTTTAAAAGAATTCTGATTCTGCT 15 0 8 



Qy 8 09 CCACGTTACCAC- 



AGACCTGT 82 8 

I I I I I I I I I I I I I I I 

Db 1509 

ACACTTTACAAATGCTTAGGTTGGTTCCTAATTTGAAGGAGACTTGTTTTATTTGGTTAA 156 8 

Qy 829 TTGTTTCT 

TATGC 8 41 

I I I I I I I I I I I I 

Db 1569 

TGCATTGCATTTGAACTTGTTTCTATTTTCTTTGCATAAATTTGGACTTTGGGAGAAAAA 162 8 

Qy 8 42 TG TTGTTCGTGGAG TC 

TCATGGGA 865 

II I I I I I I I II 

I I I I I I I 

Db 1629 

TGCAAAGTAATAAGTAGAATGCACTTGGGGGAAAAAAGGAGGATTTTTCCCTTCATGGGT 16 8 8 

Qy 866 TTG TCTGG 

TTTCCCTTAC 883 

II I I I I I I I I I I I I 

Db 1689 

TGGAAAGTATTTTAAAGGGTTGTTTTCTTGAAAAAACAAGCTCTCTCTTACTTTCTGCAT 17 4 8 

Qy 884 

AGG 8 86 

I I I 

Db 1749 

CTATGCTATAAAGATAACTATGCTATAATAAATGTAAGGTAGAAAACTTTAAAGAGAAAA 18 0 8 

Qy 887 GC CCC— 

CTCCCCC 898 



Db 1809 

TAACAGTGTTCTAAGTGAAAAGCTACTTAGCATTTTCCCAAACTCACACATTATCAACAG 186 8 



Qy 899 GGGAATGCGCCCACC 

AAGGCCCTAGACTCAT 929 

I I I I I I I I I I II I I I I I I 

Db 1869 

AAACACTAAAAATTAAGAGAAAAGCGGCCAACTAATTAAATAGCTAGCCTTATTTTGGGG 192 8 



Qy 930 CTTGG 

CCCTC 939 

I I I I I I I I I I 

Db 1929 

GTATGGGATAGAAAATTAAGTGTGAATAAAATGATACTTGGGAATGTTTCTCCCTCGTAC 19 8 8 

Qy 940 — C TCAGC TCCCTGCCTGTTTC 

CCGT 963 

I I I I I I I I I I I I I I I I 

I 

Db 1989 

CACAAAGGATGTTAGTGGTCAGCCTACGAGTTAATCCTTCCTAGCATGGCTCTGAGCCTT 2 0 4 8 

Qy 96 4 AAGGCTGTAC ATAGTCCTTT 

TATCTCCTTGT 994 

I I I I I I I I I I I I I I I I I I I 

Db 2049 

CATGCCGAGCAGACGTTATTCACATGACGATTCGAAAAGTCCATTCATATATCTC 2103 

Qy 995 GGCCTATGAAACTGGTTTATAATA AACTCTTAAG AG— 

AACAT 1035 

II I I I I I I I I I I I I I I I I I I I III 

I I I 

Db 2104 -GC 

TACCTGGATTTGAATAGAAACCAGACAGCAATTCTTTAGTTCCAGCCACCAT 215 7 



-ATAATTGC 1044 
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Qy 1 GAATTC— CCTGAGGAGGCGAATCCGGCGGGTATCAGAGCCA 

TCAGAACCGCCAC- 53 



Db 182 7 GAAATCCACCCGTCGCTG- 

GGATCCACAGGGGAGTAACGCCAAAACACGGCACCGCTACT 1769 

Qy 5 4 — CATGACGG TGGGCAAGAGCAGCAAGATGCT- 

GC AGC AT AT TGAT T AC A 100 



Db 1768 

GTCTTCGCAGCTTTTAGGATGAGGAGCCTATTTCTCTCCCATCACTGTAAAATAATTTTA 17 09 

Qy 101 GGATGAGGTGCATCCTGCAGGACG GCCGGATCTT 

CATTGGCACCTTC 147 

I I I I I I I I I I I I I I I I I I I I I I I 

Db 170 8 TG-TATGTTG — TCCAGTAGAACAATGATGTTGAGAGTTCTTTCTACAATTTG — 

CCTCA 1654 

Qy 148 AAGGCTTT TGAC 

AAGCACATGAATTTGATCCTCTGTGACTG — 18 8 

I I I I I I I I I I III I I I I I I I I I I III 

Db 1653 G AAG C T T T T TAG T AT G T G AC AT TAG T T T T AAG AT C AT G 

T GAT AC T TAG T T AC T G AC 159 8 

Qy 189 

T GAT GAG T T C AG AAAG A 2 05 

I I I I I III II 

Db 1597 

AAAGTATGAGCAGTATTGATAACACCAGTGGGAAAAATAGATTTGAATTTATTTTGTGCG 153 8 

Qy 2 06 -TCAAGCC AAAG AAC T C C AAA CAAGCA 

GAAAG— 236 

I I I I I I I I I I I II I I I I II 

Db 1537 

CTCATGTTTG TAG AAAG T AC TAT T AAT AT T T AAT TGATGTGGGC T AGAAAT T T G AAC GAT 14 7 8 

Qy 23 7 GGAAGAGAAGCGAGTCCTCGGTCTG 

GTGCTGCTGCGAGGGG 2 77 

I I I I I I I I I I III II I I I I I 

Db 1477 

T T T GAAAG AAAT G T G AAT AAC AAAT C AGT AC AT AT AGAATGAAC AT AC AAT G AGC AC AAG 1418 

Qy 2 78 AG AATCTGGTCTCAATGACAGTAGAGGGA- 

CCTCC 311 

II II I I I I I I I I I I I I I I I I I I I I 

Db 1417 T T T T AG T T AAT AT T T AAAAT GG T AT C T AT 

TTTAGAGTGAGCCTCCATGTGTAAAATA 1361 



Qy 312 

T C C C AAAG AT AC T G 325 

II I I I I I 

Db 1360 

ATGTACAATTTGTATGGTTCTTCGTGATGCTTTGTGTATGTGTATGTGTGTGCATGTGTG 13 01 

Qy 326 GTATTGCTCG AGTTCCA 

CTTGCTG 349 



Db 1300 

TGTGTGTGTGTGTGTGTCATGATTAGCTAGGCTGTGGTTCCAAATTCTAAACCCTTAAAG 12 41 

Qy 350 GAGCTGCC GGGG GCC 

CAGGGATCGGCAGGGCTGC 3 83 

I I I I III III II I I I I I I I I 

Db 1240 

TTACAGCCCCAGGAGAAATAAAAGCCATCACAATTCACAACTGGCCTGCACTGAAGCATG 1181 

Qy 384 TGGCAGAGGAA TCCC 

AGCTGGG — 405 

I I I I I II III 

Db 1180 

ATTTTGCAAACAAAAATGCTTTTTCCCTTCATAAAAGTCATACTTGAAAATATCTGAGAA 1121 

Qy 406 GTT 

CCCATGC C 416 



Db 1120 

GTTATTTTTGGCATCAGTTTTTCTTAATGGGAAGAGAAAAAAACCCATGCCTTGTTCTAC 1061 

Qy 417 CCAGGCTCC TGC AG GACTTGC 

TGGGCC 443 



Db 1060 

ACGGGCTCCATATCGATTCATGGAAGTGCCATGTGTTAAGCACCGACTTGCTAATAGTGC 10 01 

Qy 444 AGTCCGTGGGGT TGGCGGGCCATC-CCA ACAGG— TGATGA 

CCC 484 

I I I I I I I I II I I I I I I I I I I I I I I I I I 

Db 1000 

TGCCTGTGTGGTGATCTGCTGAGCTAGCGCCATGCACAGAATTGATTAGTTCCCTTAATG 9 41 

Qy 485 CACAAGGAAGAGGTA 

CTGT 503 



Db 940 

AGCTTTAGATTGTGCTTCTACCAGAATAAGGACGCAAAGCAGGGGAAAGTGTGCATCTGG 8 81 

Qy 50 4 TG CAGCCGC — TGCAGCTGCTG CCA 

CAGCCAGTATT 53 7 

I I I I I I I I I I I I I I II I 

Db 880 

AGAAAACCCACCTCAAAGCAGCTGCAGAGAGGAAAATCAAGAGAAGTGGCAGAGAGGAAA 821 

Qy 53 8 GCC GGGGC TCCAACCCAGTACC— 

CACCT G 565 



I I 1 1 1 III 1 1 1 I III 

Db 820 

ATCAAGAGAAGTGGCAGAGAGGAAAATCAAGAGAAGTGGCAGAAAACTATGGGAAGGTGG 761 

Qy 566 GCCGT GGGGGTCCT CCC — CCACCTAT 

GGGCCGAGG — A 600 

III I I I I I I I I I I I I I I I III 

Db 760 

GACTTGAGGGGAGACCTGGAGACAAGGCCACCTTTTCTTTTTCAGAAGATGCTGCAGCTT 7 01 

Qy 601 GCACC — CCCT CCAGGCATGATGGGCCCACCTCCTGGT- 

ATGAGACCTCCT 6 48 

I I I I I I I I II II II II I I I I I I I I 

I I I 

Db 700 

GCTCCATCCCTGACTTTCTCCCCAGCGGAGGGAGAACAGGGCAGGGTGATGTGCAAACCT 6 41 

Qy 6 49 ATGGGTCCCC— CAATG 

GGGATCCCCCCTGGAAGAGGG 684 

I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 640 C— 

AGTCCCCAGCAATGACCATGTGCAGCATGGAAAACTGGGGTCCCCTCTTGTCTCTGC 5 83 

Qy 6 85 ACT — CCA-ATGGGCATGC CCCCTCCGGGA-ATGCGGCCTCCT 

CCCCCTGG 731 

II III I I I I I I I I I I I I I II I I I I I I I I I 

I I 

Db 582 

CCTTACCATGTGGGTAACCCTAACACCCCTCCCTGATCCGCAGCTTCCTGTAAACGAGGG 523 

Qy 732 GATGCGA GGCCTTCTTTGAC — 

CCTTGG 757 

I I I I I I I I I I I I I I I I I I 

Db 522 

GAAGAGACGGAGGATGCAGCCATCTGTGTCTTCATTTGTGTGCAACTGCATTATGAAATT 463 

Qy 75 8 CC ACAGAG TATGGAAG 

TAG 776 



Db 462 

TTAACTAAAAATGTTCCAGAGAAAGCAAACAGTGTAAAGCTATTAAGGAAGGCGGTTTAG 4 03 
Qy 777 CT 

CC 780 

II II 

Db 402 

CTAATGGATAACCGATGACTGATGTGTGTATAAGTGAGAGATGGGGATGGAGATTCTGGG 3 43 
Qy 781 GCAGAGGC 

GTGGGCTCGATTCCTCAGGGCCAC G 813 

I I I I I I I I I I I I I I I I I I I I 

I 

Db 342 

ATGCTGGCGGAGGCTCTGGGAGCGCAGGTAAACAGGCCCAAGAGATGTGGAGAGCAGAGG 2 83 

Qy 814 T T AC C AC AG AC CTGTTTGTTTCTTATGCTGTTGTTCG TGG- 

AGT 856 



Db 282 

TTGTCACAGACACATGCCATCTGCATAGGGCTCAATCTGCCGTTAGTAATTTGGATAAGT 2 23 

Qy 857 CTCATGGGATT GTCTGGT TTCC- 

CTTACAGGGCC 8 89 



Db 222 

AT C CAAAG T T T AC AG AG AAAGG AAGAAGGC AGAAAAGGAAC T AC AGAATC AAAC AGC T C C 
Qy 890 

CCCTCCCCCGGGAATGCGCCCACCAAGGCCCTAGACTCATCTTGGCCCTCCTCAGCTCCC 9 49 



Db 162 CCTTTCCTC TCTGC-CACACTCACACCCCACACACACAAAAACACCACACA- 

CACAC 108 

Qy 950 TGCCTGTTTCCCGTAAGGCTGTACATAGTCCTTTTA-TCTC— CTTGT- 

GGCCTATGAAA 10 05 

I II III III I I I I I I I I I I I I I 

Db 107 ACAC 

ACACCAGGGGACAATGAGATGTCAGATAAGCCTCAGCCAGTAGGCCAACAGTG 5 1 

Qy 10 06 CT G G T T T AT AAT AAAC T C T T A- AG AG AAC AT T AT AAT T G C 10 44 

II I I I I I I I I I I I I I I I I I I I 

Db 50 CTCGAGGAAT-TCCGCAGCTTTTAGAGCAGAAGTAACACTTCC 9 



I I I I I 

163 



RESULT 15 
US-10-478-914-3/C 

; Sequence 3, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 

; TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
; FILE REFERENCE: 7388-80899 

; CURRENT APPLICATION NUMBER: US/ 1 0 / 478 , 9 1 4 
; CURRENT FILING DATE: 2003-11-26 
; PRIOR APPLICATION NUMBER: PCT/ JP02 / 0 529 4 
; PRIOR FILING DATE: 2001-05-30 

PRIOR APPLICATION NUMBER: JP 2001-162775 
; PRIOR FILING DATE: 2001-05-30 
; PRIOR APPLICATION NUMBER: JP 2001-255226 
; PRIOR FILING DATE: 2001-08-24 
; NUMBER OF SEQ ID NOS : 417 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 3 

LENGTH: 2 0 83 
TYPE : DNA 

ORGANISM: Homo sapiens 
US-10-478-914-3 

Query Match 38.1%; Score 2483.5; DB 1; Length 2083; 

Best Local Similarity 35.6%; Pred. No. 7.7e-154; 
Matches 654; Conservative 0; Mismatches 365; Indels 817; 
Gaps 103; 



Qy 



7 CCTGAGGAGGCGAATCCGGCGGGTATCAGAGC- 



CATCAGAAC 47 

I III I I I I I I I I I I I I I I I I I I I I 
Db 2 0 83 CTAGCGGCCGCGA — CCTGCAG — CTC- 

GAGCACATGTGGCCTTTTTTTTTTTTTTTTTA 2 029 

Qy 4 8 CGCCACCATG — ACGGTGGGCAAGAGC AGCAAGATG- 

CT 83 



Db 2028 

TTTTGCCTTCTTGCTCTGGTGGTTAGTCTCCTACTATAAACATGATTTTCCTTCTTCCCT 1969 

Qy 8 4 GCAG CAT AT T GAT - T AC AGG A 

TGAGGT 109 



Db 1968 

T C AG AG TCCTGTCAATCTTTACTTTTTTTCT CAT T AAT AT AC AG AAG T TAT AC T AC AG AT 19 09 

Qy 110 GCATCCTG CAGGACGGC-CGGATCTTCAT TGGCACCTTCAAGGCTTTTGAC- 

AA 161 



Db 19 0 8 ACATTCAAGCTCATGACTGCTAAAAACTTCCTGAAATGAGCCTTAA 

TTCATACGAA 1853 

Qy 162 GCACATGAATTTGATC CTCTGT-GACTGTGAT- 

GAG 195 



Db 1852 

TCATTTCTATTTGTTCTTTTTTTTTTGAGACAGACTCTTACTCTGTCGCCCAGGCTGGAG 1793 

Qy 196 TTCA G AAAG AT C AAGC C AAAG AAC T C C AAAC - AAG C AG AAAG G G — 

AAG AG AAG C G A 2 49 



Db 1792 TGCACTGGCATGATCTCGGCT 

CACTGCAACCTCTGCCTCACGAGTTAGGTCAAGCAA 1736 

Qy 250 GT CCTCGGTCT GGTGCTG 

CTGC 271 

I I I I I I I I I I I I I I I I I 

Db 1735 

TTCTTCTGCCTCAGCCTCCCGAGGAGCTAGGATTACAGGTCCCTGCCACCATGCCTGAGT 16 76 

Qy 2 72 GAGGGG AG— 

AATCTGGTCTCAA 292 

I I I I I II I I I I I I I I I 

Db 1675 

AATTTTTGTATTTTTAGTAGAGATGGGGTTTCACCATGTTAGCCAAGCTGGTCTCGAATT 1616 

Qy 2 93 — TGACAGTAGAGGG ACCT CCTCCCAAAGATACTGGTATTGC — 

TCGAGT 33 8 



Db 1615 CCTGACCTCAGGTGATCCGCCCACCTCAGCCTCCCAAAG- 

TGCTGGGATTACGGGCGTGA 1557 

Qy 339 TCCAC— TTGCTGG 



AGCTGCCGGGGGCCCAGGGATC 372 

I I I I I I I I I I I I I I I I I I I I I 

Db 1556 

GCCACGGTGCCTGGCCTCTATTTTTTATTTTAAATACCTTTTGCCCTGAAATCTATGTCC 149 7 

Qy 3 73 GGCAGGGCTGCTGG-CAG AGG 

AATCCC 39 8 

II I I I I I III II 

I I I I I I 

Db 1496 

TCTGATTATCAACACTGCTTACCAGTTTTATTTTTGTTGGTATTTATTTCATTCAATCCC 143 7 

Qy 399 AGC TGG GGTTC — 

CC 411 

III III I I I I 

I I 

Db 1436 

AGCTCTACAACTTACCAGTATAAAAATTTGGATCAGTTACTTAATCTTGCAGTTTCTTCC 13 7 7 

Qy 412 ATGCCC 

CAG 42 0 

I I I I I I I 

Db 1376 

TTTTAAAAATGGAATAATATTATCTATCATATAAAATGCCCAGTTAACAAATATGAACTA 1317 

Qy 421 GCTCCTG CAGG — ACTTGCTGGG 

CCAGTCCGT 450 

I I I I I I I I I I I I I I I I I I I 

I 

Db 1316 

ACACCTACAATGTGCCAGGCACCATTCTAGGGAATAGAATGTAAATAAGTTCAAGTTCTT 1257 

Qy 451 G 

GGGTTGGCGG 461 

I I I I I I 

Db 1256 

G AC C T GAATAAG TTTACATTTTAGT G AG AAG AC AGAC AAT AAAT C AAT AAC AT GTCCATT 1197 

Qy 462 — GCCA TCCCAA CAG 

GTGATG 48 0 

I I I I I I I I I III I I I I I I 

Db 1196 

ATGCCAGATAGTTTTAAATGTCACAAAGAAAAAGGGGGAGCAATCAGATAGTGATGGAAG 113 7 

Qy 481 -ACCCCACAAGGAAGAG-GTACTGTTGCAG CCGCTG— 

CAG— 517 

I I I I I I I I I I I I II I I I I I I I 

Db 1136 

TTACACTCTAGAGAGGGTGGACAGAAAAAGTTTCTCTTAGGAGATGACATCTGAACAGAG 10 7 7 

Qy 518 CTG 

CTGC— 524 

III III 

Db 1076 

ATCTAAATGACTGAGTACAAACCCCAATGGAAAAGTGATTGGTGTGTTGTGAAACAGCAA 1017 

Qy 525 CACAGCCAGTAT TGCC GGGGCT 

CCAACCC 553 

I I I I I I I I I I I III I I I I I III 

Db 1016 



CACAGCCAGTTTGACTAGATGAGTAGATCAGAATGACTGAGGTGGGACTGTGGTCATCAA 95 7 

Qy 554 AGTACCCACC TGGCCGTGGGGGTCCTCC-CCCA CCTATGGGCC 

GAG 59 8 



Db 956 

AGAGATCAGAGAGGTAGCCTGGGAAGGCCACATCACAGAATGTCCTTTGGGCCTCTGTAA 89 7 

Qy 599 GAGC ACCCCCTCCAGGC- 

ATGATGGGCCCACCTCCTGGTATGAGACCTCC 647 

I I I I II I I I I I I I I I III 

Db 896 

GAGCGTTTTGAAAAAACAGTTCTGAGCAAAGGTTAGTCATTTAACTTGACTTACTCTTTA 83 7 

Qy 6 48 TATGGGTCCC CCAATGG — GGATCCCCCCTGGAAGA 

GGGACTCCAAT 692 



Db 836 

GAAGGATCACTTTGGCTATTGAGTGGCAAACAACTGGAGAAGTGGTTAGGGGAGCACTGT 7 7 7 

Qy 693 GGGCATGCCCCCTC— CGGGAATGCGGCCT CCTCCCCCTG 

GGATGCGAG- 739 

I I I I I I I I I I I I I I I III III 

Db 776 

GGGCAGGAATAGAAGGAGGAAAAGCAGACTGGAGCCTAATCCAGTACTTCAGATGAGAAA 717 

Qy 740 GCCT TCTT TGACCCTTGGC 

CAC 761 

I I I I III I I I I I I I II 

Db 716 

TGACAGATTTGCCTGGTACATCATTTCCTATCCCTTTACTTATAACTTATTGGTGTATTT 657 

Qy 762 AGAG TATGGAAGTAGC TCCGCAGA 

GGC- 788 



Db 656 

TTATTTTAAGTGTAAATTTTTATAAATGAAGCATAATTTGAAGATTGATATGTTCGGACT 59 7 

Qy 789 -GTGGGCTCGATTCCTCAGGGCCACGTTACCACAGACCTGTT 

TGTTTCT 836 



Db 596 TGTGTTTTCTATT-ATCATGGTGTC- 

TCTCCACATACCTTTTCCCTTTAGTTCTGCTTCT 539 

Qy 83 7 TA TGC TGTTGTTCGT GGAGTCTCATG 

GGATTGT 869 



Db 538 

TATTGTGCTCCTCAAGTCTATTTTTGTTCTTTTCCCTCTACTTCTACTGTTTTGAACTGT 4 79 

Qy 870 CTG GTTTCCCTT ACAGG GCCCCCTCC 

CCCGGGAATG 9 05 

I I I I I II I I I I I I I I I I I 

Db 478 

ATATGCTATTTCTATTTTGCCAGGAGTTACCCCCTTCAATTTTCGTTGTACTCTCTCAGT 419 



Qy 9 06 CGCCCACCAAGGCCCTAG-ACTCATCTTGGCC — CTCCTCAGCTCC- 

CTGCCTGTTT 958 

II II I III I I I I I I I I I I I I I I I I II III I I 
Db 418 CTCAAACACATACCCATGCTCTCTTC- 

TGTCCTTCTCTTCATATCATCTTCTAGAATTAG 360 

Qy 959 CCCGTAAGGCTGTACATAGTCCTTTTATCTCC- 

TTGTGGCCTATGAAACTGGTT 1011 



Db 359 TTCTACCCCGATTTTC — 

T AC C AAT TTATTTTATTTC C AAT C AGT AC AT TGAT TC AGAT T 3 02 

Qy 1012 TATAATAAACTCTTAAGAG AACATTATAATTG 10 43 

I I I I I I I I I I I I I I I I III 

Db 301 TATCAATATGTTTTACCAATTTTATCATTTTTTTTG 266 
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RESULT 1 
US-10-478-914-4 

; Sequence 4, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 

TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 

FILE REFERENCE: 7388-80899 
; CURRENT APPLICATION NUMBER: US/ 1 0 / 478 , 9 1 4 

CURRENT FILING DATE: 2003-11-26 
; PRIOR APPLICATION NUMBER: PCT/JP02/05294 
; PRIOR FILING DATE: 2001-05-30 

PRIOR APPLICATION NUMBER: JP 2001-162775 
; PRIOR FILING DATE: 2001-05-30 
; PRIOR APPLICATION NUMBER: JP 2001-255226 
; PRIOR FILING DATE: 2001-08-24 
; NUMBER OF SEQ ID NOS : 417 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 4 



LENGTH: 3210 
TYPE : DNA 

ORGANISM: Homo sapiens 
US-10-478-914-4 

Query Match 42.4%; Score 4608; DB 1; Length 3210; 

Best Local Similarity 38.0%; Pred. No. 8.5e-303; 
Matches 1164; Conservative 0; Mismatches 623; Indels 1280; 
Gaps 163; 

Qy 1 CAGTTACAGGGAGC 

ACCACCAGGGAACATCTCGGGGAGCCT 41 

II I I I I I I I I I I I I III I I I I 

Db 133 

CAAGGACAAGGAGCTGCTGTTCGAAACAGCCTTCCTGCTCCCCAACCTGCCTCCCACCCA 192 

Qy 42 GGTT GGAAGCTGCAGGCTTAGT — 

CTGTCGGCT 72 



Db 193 ACAGGTTTTGCATATACTCTACTGGGAAGAGGGACACACCCGACTGCATCACTGCC — 

CT 250 

Qy 73 GCGGGTCTCTGACTGCCCTG 

TGGGGAGGGTCTTGC 107 



Db 251 CCAAGTCTCTCCCTGCCCTGTCCAGCATCCAGGAGCACCCCTAGTTGGGGAAG — 

CTT-C 307 

Qy 108 CTTAACATCCCTTGCAT TTGGCTGCAAAG AAATCTGC — 

TTGGAA 150 



Db 308 

TGTGACTCCCCCTACAACAGCCTAGGATGGAGTGGGGTTTGTGAACAAATGCAGAAGGCA 36 7 

Qy 151 G AAGGGGTTACGCTG TTTGGCCGG 

GCAGAAACT 183 

I I I I I I I I I I I I I I I I I II 

Db 368 

GTCTTAGGGAGGTCAGCTGACATGCCCCTGGCCTGTGGCTGGGAAGTAGCAGAGGCTAAG 42 7 

Qy 184 CCGCT GAGCAGAACTTG— CCGCCAG 

AATGCTCC 215 



Db 428 

GTTCTTCCCCGCTCTGGGGTTGCCAGGAGTAGCACTGGATCAGTCAGGTGACAGGGCT-C 4 86 

Qy 216 TCCTGTTGCTGAGTA TCATC — GTCCTCCACGTCG 

CGGTG- 253 

I I I I I I I I I I I III I I I I I I I I I 

Db 487 

TCCTCTCTCTGAGCAGGTCCGGTGGCAGCCTTCAAGGTCGCCACGCCGTATTCCCTGTAT 5 46 

Qy 254 — CTG GTGCTGCTGTTCGTCTCCACGATCGTCA-GCCAATGGATC — 

GTGGGCAAT 304 

III II II I I I I I I I II I I I I III I 

I I I 



Db 547 

GTCTGTCCCGAGGGGCAGAACGTCACCCTCACCTGCAGGCTCTTGGGCCCTGTGGACAAA 6 06 

Qy 3 05 GGACACG CAACTGATCTCTG 

GCAGAAC 331 

I I I I I I I I I I I I I I I I I 

I 

Db 607 

GGGCACGATGTGACCTTCTACAAGACGTGGTACCGCAGCTCGAGGGGCGAGGTGCAGACC 666 

Qy 332 TGT AGCAC CTCTTCC 

TCAGGAAAT 355 

II I I I I I I I I I I I I I I I 

Db 667 

TGCTCAGAGCGCCGGCCCATCCGCAACCTCACGTTCCAGGACCTTCACCTGCACCATGGA 72 6 

Qy 356 GTCCACCA — CTGTTT CTCATCAT — 

CACCAAACGAATGGCTGCAGTCTGTC 403 

I I I I I I I III II II II II I I I I I I I I I I 

I 

Db 727 

GGCCACCAGGCTGCCAACACCAGCCACGACCTGGCTCAGCGCCACGGGCTGGAGTCGGCC 7 86 
Qy 40 4 -CAGGCCACCATG — 

ATCCTGTCGATCATCTTCAGCATTCTGTCTCTGTT C 451 

I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I 

I 

Db 787 

TCCGACCACCATGGCAACTTCTCCATCACCATGCGCAACCTGACCCTGCTGGATAGCGGC 8 46 

Qy 452 CTGTTCTTCTGCC AACT — CTTCACC- 

CTC 478 

I I I I I I I I I I II I I I I I I I I 

Db 847 

CTCTACTGCTGCCTGGTGGTGGAGATCAGGCACCACCACTCGGAGCACAGGGTCCATGGT 9 06 
Qy 4 79 ACCAAGGGG- 

GGCAGGT 49 4 

I I I I I I I I I I I I 

Db 907 

GCCATGGAGCTGCAGGTGCAGACAGGCAAAGATGCACCATCCAACTGTGTGGTGTACCCA 966 

Qy 495 TTTACATCACTGGAATCT TCCAAATTCTTGC TGG — 

TCTGTGC 535 

I I I I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 967 

TCCTCCTCCCAGGAGAGTGAAAGTAAGGGACCAACCTCTTGCCCCTTTTGGGTTCTCTGT 102 6 

Qy 536 GTGAT GAGTGCTGCGGCCATCTACACGGTGAGGCACCCGGAGTGGCATC- 

TCA 58 7 

I I I I I I I III I I I I I I I I I I I I I I I I 

I I I 

Db 102 7 TTTCTTCTGTCCTCATCCTGCACCCA — GACCCTGTTTGGAACTC 

TGGCCTCATCA 1080 

Qy 58 8 ACTCGGAT TACTCCTACGGTTTCGC 

CTAC 616 

I I I I I I I I I I I I I I I I II 



Db 1081 

CCCCAAGCCCTCAGAACCCCCCGGTCCTCCTCCTTTTCTGCTGCTGCACATCCCTTCTGC 114 0 



Qy 617 ATCCTGGCCTGGGTG GCCTTCC- 

CCCT 648 



Db 1141 TTCCT — 

CCTTGGTGCAATCCCCAGAAGCCCACTCTCCTTCCATCTGCTCTGGAGTCTCT 119 8 

Qy 6 49 TCTC AGCGGTGTCATCTATGTGATCTT — GC — 

GGAAACG — 684 

III I I I I I I I I I I I I I I I I 

Db 1199 

GCTCCTCTTGACTCTCTGGAGTGGCTGTGCCTTGGCAGTGACCTTTGGCCAGGGCAAGTG 12 5 8 

Qy 6 85 -CGAATGA GGCG-CCCAGACGGTCTGTCTGAGGCTCTG 

AGCGTAC 72 7 



Db 12 59 CCTCATGACAGGTACTGGGTGCCCCAGGCAG- 

CTAAGTGCCGCCCTGCCCACCAGCCCCC 1317 

Qy 72 8 ATAGGGAAGGGAGGAAG GGAAAACA 

GAAAGC 75 8 

I I I I I I I I II I II I I I I 

Db 1318 

TATGGCTTGGGAAGGCTGGGGGTCCTCTTGGCCAACAGGGTGAAACCCCATCTCTACTAA 1377 

Qy 759 AGACAAAGAAAAAA GAGCTAGCCCAAAATCCCA — AACTCAA 

AC 800 



Db 1378 

AAACACAAAAATTAGCCAGGCATGGTGGTGCGCGCCTGTAGTCCCAGCTACTCAAGAGGC 143 7 
Qy 801 CAAACCAA 

ACAGAAAGCAGTGGAGGTGGGGGTTGCTGTTGATTGAAGAT G 85 0 



Db 143 8 TGAGGCAAGAGAATCGCTTGAACCCG-GGAGTCAGAGGTTGCAG- 

TGAGCCGAGATCGCG 1495 

Qy 851 TATATAATATCT — CCGGTTTATAAAAC 

C T AT T T AT AAC AC T T T T T AC AT AT AT 902 



Db 1496 

CCACTGCACTCCAGCCTGCTGATAGAGCAAGACTCCATCTCAAAAAAAACAACCCAAAAT 1555 

Qy 9 03 GTACATAGTATTGT TTGCTTTTTATGTTGACCATCAGC — 

CTCGTG 946 

I I I I I I I I II I I I I II I I I I I I I I I 

Db 1556 TTGCCTGGCATGGTGGCAGGC ATCTGTAATCC — 

CAGCTACTCGGGAGGCTGAGACA 1610 

Qy 9 47 TTGAGCCT 

T AAAGAAGT AGC T AAGGAAC T T T AC ATCCTAACAGTAT 992 

I I I I I I I I I I I I I I I I I I I I I 

Db 1611 TGAGAGCTGCTTGAACCTGGGAGGCAGAAGTTGCAGTGAGCC — 



-CCCTGG 

I I I I I I 



GAG AT C AC AC C AC T GC 166 8 

Qy 993 AATCCAG CTCAGT 

ATTTTTGTTTTGTTTTTTGT 1025 

I I I I I I I I I I I II I 

I I I I I 

Db 1669 

ACTCCAGCCTGGGTGACAGAGCGAGACTCTGTCCCAAAAAATCAAAAAAATCACTTTTGG 172 8 

Qy 1026 TTGT TTGTTTTGTTTTACCCAGAAATAAGAT- 

AACTCC A 1063 

II I I I I I I I I I I I I I I I I I I I 

I 

Db 172 9 TAG AG AT GC AC T C T C GC T ATG — 

TTGCCCAGGCTGGTCTTGAACTCCTGGGCTCAATTGA 17 86 

Qy 1064 TC-TCGCCCCTTCCCTTTCAT CTG AAAGAAGATA-CCTCCCT- 

CCCAGTC- 1110 

I I I I I I I I I I I I I III III I I I I I I I I I I I 

I 

Db 1787 

TCTTCCCACCTTGACCTCCAAAGTGCTGGGATTACAGGTGTGAGCCACCATGCCTAGCCT 18 46 

Qy 1111 CACC- 

T C AT T T AG AAAAC C A 1129 

I I I I I I I I III 

I 

Db 1847 

CAGGGAATTCTTATAAGAACTCTATGAAGTAGGCATCACCATCTTCTCTGTATCCATGGA 19 06 

Qy 113 0 AAGTGTGG-GTAGAAA CCC 

CAAATGTCCAA 1158 

I I I I I I I I I I I III I I I I III 

Db 1907 

AAGAGAGGCCTAGAGATGTATGCTAACTTGCCCAAGCTCACATAGCCCAGGGTAGCATAG 1966 

Qy 1159 AAGCC 

CTT 1166 

I III 

I I 

Db 1967 

CTGGGATGTTGAGCTGAGGCCGGAGGAGAAGTAGCAGTCGCTGGCAGAGCACACAGGCTG 2 02 6 

Qy 116 7 TTCTGGTGGGTGACCCAGTGCAT CCAACAGAAACAGCCGC- 

TGC 1209 

I I I I I I I I I I I I I I I I I I I I I II III II 

Db 2027 

CTCTGGGGGATGAGCTGGTGCGTTTAAGGAACAGGCCAGCACTGGCATTCGCAAGCAGTG 2 0 86 

Qy 1210 CCG AACCT — 

CTGTGTGAAGC 122 8 

III I I I I II I I I I I 

Db 2087 

GGGAAGGGGAGAGATGCCGAGGTGGTCAGTATCCTGACTTTCAGAGGCCTTTTTTTGTTT 2146 

Qy 1229 TTTAC 

GCGCACACGGA 12 44 

I I I I I I I I I I I 

Db 2147 

GTTTTAATTTTTGCTAGATTGATATTAAAAACTCATGTGGAGGAACTCAAGGAATGTTTA 2 2 06 



Qy 12 45 CAAAA-TGCCCAA — ACTGGAGCCCTTGCAAAAA- 

CACGGCT- 12 82 



Db 2207 

G AAG AC C AAAAG T C C C C AAT G AC AGGAAC AAAAGC AACC AAT T T T T AAC TTTCTCTTCTC 2 2 66 

Qy 1283 TG-TGGCATTGGCATAC TTGCCCTT ACAGG 

TGG 1314 

III I I I I I II I I I I I I I I I I 

Db 2267 

ATTCCTGTTTTCATTGATTTCCCACATGTAGTCCTTTTGCTCAGGAAGTCTTTGGGGAAA 232 6 

Qy 1315 AG TATCTTCGT C AC AC ATC T - AAAT G AG AAAT C AG 

TG AC AAC AAG 135 8 

I I I I I I I I I I I I I I I I I I I I I II 



Db 232 7 TTAAGGATCTTTGAAGC TCTGAAATAGGTGATCAGGTTAGTGGTGTCTGTCAGC- 

TG 2382 

Qy 1359 TC TTTGAAATGGTGCTA 

TGGATTT 1382 

II I I I I I I I I I I I 

I I I I I 

Db 2383 

T C T AAG AG G T T G G AAAAT G AAC T AC T C AAG AT AG T C AC G AAAAT AC T G AAAG TTTGATTT 2 4 42 

Qy 13 83 ACCATTCCTTATTATCACTAA — T C AT C T AAAC AAC T C AC TGG 

AA 1425 

I I I I I I I I I I I I I I III III II 

I I 

Db 2443 

TTCTTTCCATATTTGAATTAATTTTTTCTGTTTGACTGGAAGGGGTTTTTGTATAACTAA 2 502 

Qy 142 6 ATCCAATTAACA ATTTTACA 

ACATAAGATAGAATGGAGACCTGAATA 14 72 

III II I I I I I I I I I I I I I I I I I I 

I I 

Db 2 5 03 AACCTCAGCGCATAAAGGAGATTTAAAAGGAGCACATGATTTAGTGGGTGGGCC 

ATG 2559 

Qy 1473 ATTCT GTGTAAT 

ATAAATGGTTTATAA 1499 

III I I I I I I I I I I I I I I I I 

Db 2560 

AAACTAGAGATGGGATTTGGGGGTGAATTTGTCAATATCTGGATTTTAATCCAGACATCT 2 619 

Qy 150 0 CTGCTTTTGTACC — TAGCTAGGC TGCTATTATTACTAT 

AAT GAG 15 42 

I I I I I I II II III I I I I I I I I I I I I I I 

Db 2620 

CTGCTAACGAGCCTTTGGTAAGTCACTTCAGATACTTTTCCTCCTTTTTACAAAGAGAGG 2 6 79 

Qy 15 43 TAAATCAT AAAG — CCTT 

CATCACTCCCACA 1571 

I I I I I I I I I I I I I I I I I I I I I I 

Db 2680 

GCTGGCTTAGTTATTTGCCAAAGCCCCTTCCAGGCCTGAATTCCACAAGTACGATTTACT 2 739 



Qy 15 72 TTTT TCTTACG-GTC GGAGCATCAGA- 

ACAAGCGTCTAG 16 0 8 

III I I I I I III I I I I I I I I I I I I I I I 

I I I I 

Db 2 740 GTAGTGTCTTATCACTCTTTCATGTCACAATAGCGTGGAGCATTAGAGAAAAGC — 

CTAG 2 79 7 

Qy 1609 ACT CC 

TTGGGACCG 1622 

III II I I I I I 

Db 2798 

ACTTTTAGTTGATAGCCAGTTGAAATATCATTGATAGAATTTTAGTTTTAGGAAAAATTG 2 85 7 

Qy 1623 TGAGTTC CTA GAGCTTGG 

CT 1642 

I I I I I I III I I I I I I I I 

I 

Db 2858 

GTTTGATTTCTAGCTTTATTACTATTAGGTATGTGAGCTTGGGCAAATCGCTTAATCTTT 2 917 

Qy 1643 GGGTC TAGGCTG 

TTCTGTGCCTCCA 1667 

I I I I I I I I I I I I I I I I I I I 

Db 2918 

GAGTCTAGTTTTCTCTCAAAATGAGAACATTAGGCTAAATGATTTCCGAGTTTCCAGCTA 2 9 7 7 

Qy 166 8 AGGACTGTCTGGCAA 

TGACTTGTATTGGCCACCAACTGTAGATGTAT 1714 

I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 2978 

GTCCTAGAGTTCTATATTTCTACATAGTTGAATTATTTTATCATGCTGTTGCTGGGGAAT 3 03 7 

Qy 1715 ATATGGTGCCCTTCTGATGC TAAGAC 

TCCAGACCTTTTGTTTTTG 1759 

II I I I I I I I I I I I I I I I I I I I I II 

Db 3038 

ATGACTAACCCTTTTGAAGCTACTAATTTTATGTCGAGCTTTAAAGTCCATAATTGTTAT 3 09 7 

Qy 1760 

CTTTGCATTTTCTGATTTTATACCAACTGTGTGGACTAAGATGCATTAAAATAAACATCA 1819 

III II I III I I I I I I I I I I III I I I I I III 

I 

Db 3 09 8 C T T — C AG AAAAT AT T AT T T G AC C T AC AG TAT G T C C — 

AAATCAATTTAATAAAATCGCT 3153 



1820 GAGTAAC 1826 
I I I I 

3154 TTATAAC 3160 



RESULT 2 

US-10-478-914-61 

; Sequence 61, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 

; TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
; FILE REFERENCE: 7388-80899 

; CURRENT APPLICATION NUMBER: US/ 1 0 / 478 , 9 1 4 



; CURRENT FILING DATE: 2003-11-26 

PRIOR APPLICATION NUMBER: PCT/JP02/05294 
; PRIOR FILING DATE: 2001-05-30 

PRIOR APPLICATION NUMBER: JP 2001-162775 

PRIOR FILING DATE: 2001-05-30 
; PRIOR APPLICATION NUMBER: JP 2001-255226 

PRIOR FILING DATE: 2001-08-24 
; NUMBER OF SEQ ID NOS : 417 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 61 

LENGTH: 2 43 7 
TYPE : DNA 

ORGANISM: Homo sapiens 
US-10-478-914-61 

Query Match 41.2%; Score 4473; DB 1; Length 2437; 

Best Local Similarity 44.1%; Pred. No. 4.9e-294; 
Matches 1100; Conservative 0; Mismatches 659; Indels 736; 
Gaps 157; 

Qy 3 GTTACAGGGAGCACCACCAG 

GGAACATCTCGGGGAGCCTGGTTGGAAGCTGCAGG- 57 



Db 1 GTTACT-TCTGCTCTAAAAGCTGCGGAATTCCTC 

CTACTGGG 5 4 

Qy 58 CT TAGTCTGTC GGCTGCGGGTCTCTGACTGCC 

CTGTGGG 96 

II I I I I I I I I I I I I I I I I I I III 

Db 55 

TGTACTGTGGAGACTGTCAAAGTCTCCCGGAGCCCAATTTCCGGAAGCGGTGAGTTCTGA 114 

Qy 9 7 — GAGGGTCTTG — CCTTAACATCCCTTGCATTTGGCTGCAAAGAAAT 

CTGCTTGGA 149 



Db 115 

AAGAAGTTCCTGCACCGTAGTTTCCCAAGTCTGCGAATCCCCAACCATGAGCGCCTCGGG 17 4 

Qy 150 AGAAGGG GTTACGCTGTTTGGCCGG — GCAGAAACT — CC 

GCTGA 19 0 



Db 175 

CGTACTGTCCTTTACCCAGCAAGGATGGGAGCAGGTGCTGGCCAAAGTGAAACGGGCTGT 23 4 

Qy 191 G CAGAACTTGCCGCCAG AATGCTCCT — CCTGTTGCTGAGTATCAT — 

CGT 237 



Db 235 GGTTTACCTGGAC — 

GCCGCCTGCGCCGAGAGCCTGCACTGGGGCTGCGGATCCACCCGT 2 92 

Qy 23 8 CCTCCACGTCGCGGTGCTGGTGCTGCTGTTCGTCTCC-ACGA TCGT 

CA 288 



Db 2 93 -CTCCTGGAGGCGGTG-GGGGGCC- 

CTGACTGTCACCTGCGAGAGTTCGAGCCCGACGCA 349 



I I I I I I I I I III 
GAGCACTGTTGGC — 



-CAGC- 
I I 



Qy 2 89 ATGGATCGTGG-GCAATGGA-CAC- 

GGC 326 



Db 350 

ATTGGTGGTGGAGCCAAGCAGCCCAAGGCAGTGTTTGTGCTGAGCTGCCTGCTGAAAGGC 4 09 

Qy 32 7 AGAACTGTAG CA- 

CCTCTTCCTCAGGAA 353 

I I I I I I I I I I I I I I I I I I 

Db 410 

CGGACCGTGGAGATCCTACGGGACATCATCTGCCGCAGTCACTTCCAGTATTGTGTGGTG 469 

Qy 354 ATGTCCAC — 

CACTGTTTCTCATCATCACCAAAC 385 

I I I I I I I I I I I I I I I I I I I I I I 

Db 470 GTCACAACCGTGAGCCACGCTGTCCACCTCACAG 

CTAATCATGTCCCAGCGGCGGCA 526 

Qy 3 86 GAATGGCTGCAGTC — 

TGTCCAGGC 408 

II I I I I I I I I I I I I II 

Db 527 

GCGGCCGAGATGGAGGGGCAGCAGCCGGTGTTCGAGCAGCTGGAGGAGAAGCTGTGTGAA 5 86 

Qy 409 CACCATGATC CTGTCGA TCATCTTCAGCATT 

CT 441 

I I I I I I I I I I I I I I I I I I I I 

I I 

Db 587 

TGGATGGGCAACATGAACTACACGGCCGAGGTGTTCCATGTCCCGTTATTGCTTGCCCCT 6 46 

Qy 442 GT CTCTGTTCCTGTTCTTCTGC CAACTCTTCACCCTC 

ACCAAGG 485 

II I I I I I I I II I I I I I I I I I I I I I III 

Db 647 

GTTGCTCCCCACTTTGCCTTGACTCCAGCTTTTGCATCCCTTTTCCCACTGCTACCC 7 03 

Qy 486 GGGGCAGGTTTTACATCACTGGAAT — CTTCCAAAT — TCTTGCTGGT- 

CTGTGCGTGAT 540 

I I I I I I I I I I III I III I I I I I I I 

I 

Db 704 

CAGGATGTGCACCTCCTTAATAGCGCCCGACCGGACAAGAGGAAGCTGGGAAGCCT 759 

Qy 541 GAGTGCTGCGGCCATCTACACGGTGAGGCACCCGGAGTGGCATCT- 

CAACTCGGATTACT 599 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 

I I 

Db 76 0 GGGTGATGTGGACTCCACTACGCTAA 

CCCCAGAGCTGCTGCTGCAGATCAGATGCCT 816 

Qy 600 CCTACGGTTTC GCCTACATCCTGGCCT — 

GGGTGG 632 

III I I I I I I I I I II I I I I I I 

Db 817 

AGTGTCAGGCCTCAGTTCTCTGTGTGAACATTTAGGAGTACGGGAGGAGTGTTTTGCTGT 8 76 

Qy 633 CCTTCCCCCTGGCCCTTCTCAGCGG TGTCATCTATGTGATCTTGCGGA- 



AAC 6 83 



Db 8 77 AGGTTCCTTAAGTCAGG-TCATCGCTGCGGATCTGGCCAAT- 

TATGCCCCTGCAAAGAAC 93 4 

Qy 684 GCGAATGAGGC-GCC-CAGACGGTCTGTCTGAGGCTCT 

GAGCGTACATAGGG 733 

III I I I I I I I I I I I I I I I I I II III 

Db 935 

AGGAAGAAGACTGCTGCAGGCAGGGCATCAGTGGTTTTTGTGGACAGAACCCTGGATCTC 99 4 

Qy 73 4 AAGGGAGGA-AGGGAAAACAGAAAGCAGACAAAG-AAAAAAGAGCT 

AGCCCAAAAT 78 7 



Db 995 ACAGGAGCAGTTGGACATCATGGAGACAACTTAGTAGAGAAGATCATTTCAGCAC 

TT 1051 

Qy 78 8 C C C AAAC T C AAAC C AAAC C AAAC AGAAAGC AG T GG AGG 

TGGGGGTTGCTG- 83 7 



Db 1052 CCCCAGCTC CCAGGCCACACA- 

AATGATGTGATGGTTAACATGATAGCGCTCACTGC 110 7 

Qy 838 

TTGATTGAAGATGTATATAATATCTCCGGTTTATAAAACCTATTTATAACACTTT- 892 

I I I I I I I I I I I I I I I I I I I II 

Db 1108 ACTCCATACTGAGGAGGAAAATTATA- 

ATGTGGTTGCACCAGGCTGTCTTTCACAATTCA 1166 

Qy 893 - T T ACATATATGTACATAG 

TATTGTTTGCTTTTTATGTTGACCATCAGCCTC 9 43 

I I I I I I I I I I I I I I I I I I I I I I 

Db 116 7 GTGACACCACAG-CCAAAGCCCTATGGGAAGCTT 

TACTGAACACTAAGCACAAAGA 1221 

Qy 9 44 GTGTT GAGCCTTAAAGAAGTAGC — TAAG — 

GAACTTTACATCCTAA 9 86 



Db 1222 

GGCAGTGATGGAAGTTCGGAGACATCTAGTGGAAGCGGCAAGCAGAGAAAACCTGCCAAT 12 81 

Qy 987 C AGTAT AATC 

CAGCTCAGTATTTTTGTTTTGTTTTTTGT 1025 



Db 12 82 CAAGATGAGTATGGGGAGAGTCACACCGGGACAGCTCA 

TGTCCTATAT T 1330 

Qy 1026 TTGTTTGTTTTGTTTTACC-CAGAAATAAGATAACTCCATCTCGCCCCTTCC — 

CTT 1079 

II I I I I I I I I I I I I I I I I I I I I I I I 

Db 1331 

CAGCTCTTCAAGAACAACCTCAAAGCTCTAATGAATCATTGTGGCCTCCTCCAGCTTGGA 139 0 



Qy 1080 
CA 1117 



-TCATCTGAAA-GAAGATACCTCC CTCCCA GTCCAC CT — 



Db 1391 

CTGGCCACAGCTCAAACGTTGAAACACCCACAGACTGCCAAGTGGGACAACTTTCTGGCT 145 0 

Qy 1118 TTTAGAAAAC CAAAGTGTGGGTAGAAACCCCAAATGTCC 

AAA — 1159 

III II I I I I I III II I I I I I I I I 

Db 1451 TTTGAAAGGCTCCTTCTTCAGAGCATTGG — 

GGAGTCAGCAATGTCCGTTGTGTTAAATC 1508 

Qy 116 0 AGCCCTT-TTCTGGTGGGTGACCCAGTGCATCCAAC-AGAA — 

ACAGCCGCTGC 12 09 



Db 1509 AGCTGCTGCCCATGAT T AAGC C T G T AAC C C AG AG - AAC C AAC G AG G AC T AC AG C C - 

CTGA 1566 

Qy 1210 CCGAACC-TCT-GTGTGAAGCTTTAC GCGCACACGG- 

ACAAAA 1249 



Db 1567 

GGAACTGCTGATCCTTCTCATATATATTTATTCTGTCACTGGAGAGCTCACGGTAGACAA 162 6 

Qy 1250 TGCCCAAACTGGAGCCCTTG CAAAAACAC GGCT 

TGTG 1286 

III I I I I I I I I I I I I I I I I 

Db 1627 

AGACCTGTGTGAAGCAGAAGAAAAAGTCAAGAAAGCATTGGCTCAGGTCTTCTGTGAGGA 16 86 

Qy 1287 GCATTGGCATACTTGC 

CCTTACAGGTGGAGTATCTTCG 1324 

I I I I I I I I I I I I I I I I II I I I I I 

Db 1687 

ATCTGGATTGTCACCTTTGCTGCAAAAAATTACGGACTGGGACTCTTCAATTAATCTGAC 17 46 

Qy 132 5 TCACACATCTAAA TG — 

AG AAAT C AG T G AC AAC AAG T C T T T G AAAT GG T G 1372 



Db 1747 ATTTCACAAATCCAAAATTGCCGTGGATGAACTCT-TTACTTCA 

CTTCGGGATATTG 18 02 

Qy 1373 

CTATGGATTTACCATTCCTTATTATCACTAATCATCTAAACAACTCACTGGAAATCCAAT 1432 



Db 18 03 CTGGAGCTCGGAGTCTCCTGA — 

AACAGTTTAAGTCTGTATATGTTCCTGGAAATCATAC 186 0 

Qy 1433 TAACAA T T T T AC AAC AT AAGATAG-AATGGA- 

GACCTGAATAATTCTGT 1479 



Db 1861 

CCACCAGGCATCTTATAAGCCATTGTTGAAGCAAGTTGTGGAGGAAATATTTCATCCCGA 192 0 



Qy 148 0 G TAATATAAATGGTTTATAAC 

TGCTTTTGTACCTAGCTAGGCTGCTATTATT 1531 



Db 1921 GAGGCCAGATTCCGTTGATATTGAACACATGTCTTCAGGCCTCACTGATCTCC — 

TTAAA 1978 

Qy 1532 AC T AT AAT GAG T A- AATC AT AAAG C — CTTCATC 

ACTCCCACATT 15 73 



Db 1979 

ACTGGATTTAGCATGTTCATGAAGGTGAGCCGGCCTCATCCTAGTGACTACCCCCTCCTG 2 03 8 

Qy 1574 TTTCT TACGGTCGGAGCATCAGAA 

CAA 1600 

III I I I I I I I I I I I I III 

Db 2039 

ATCCTCTTTGTGGTAGGTGGGGTCACAGTCTCTGAAGTGAAAATGGTCAAAGATCTTGTG 2 09 8 

Qy 16 01 GCGTC-TAGACTCCTTGGGAC CGTG-AGT 

CT 1637 



Db 2 099 GCATCGTTGAAGCC- 

AGGAACCCAGGTAATCGTGCTGTCCACACGACTCCTGAAGCCACT 215 7 

Qy 163 8 TGGCTGGGTCTAGGCTGTTCTGTGCCTCC-AAGGACTG 

TCTGGCAATGACTT G 16 89 



Db 2158 TAAC- 

ATTCCTGAGCTGTTATTTGCAACTGACCGACTGCATCCAGACCTTGGCTTCTGAG 2216 

Qy 169 0 TATTGGCC AC C AAC T G TAG AT G TAT AT ATG 

GTGCC 1724 

II II I I I I I I I I I I I I I III 

Db 2217 

CATCCGCTAAGAAGATAAGACCTACTCAAGCTGGAAATGCCGATGCAATTTTCTGCCACC 2 2 76 

Qy 1725 -CTTCTGATGCTAAGACTCCA GACCTTTT— GTTTTTGC 

TTTGCA 1766 



Db 22 77 ACTCCAAATACT 

CCTCCACAACCAGCGTCCCTGTCACTAATTGCGAGAATGATGGAA 2333 

Qy 176 7 TTTT — CTGAT TTTATACCAACT — GTGTGG- 

ACTAAGATG 18 02 



Db 2334 

TTCTGCCTGAAGGGTCTTGATACCTACTCAGTGAGGTACTTTGCTTGGATTGCTGTGATT 2393 

Qy 18 03 CATTAAAATAAACAT C AG AG T AAC T C 182 8 

I I I I I I I I I I III III 

Db 239 4 CTTAAAAAAAAAAAAAAAAAAGGCCACATGTGCTC 2 42 8 



RESULT 3 

US-10-478-914-4/C 
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; Patent No. 7335755 



-TCCTAGAG 

I I I I II 



; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 
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Qy 2 AG T T AC AG - GG AGC AC C AC C AGGGA ACATCTCGGGGAGCCT — 

GGTTGGAAGCTGC 54 



Db 3108 AT T T T C T G AAGAT AAC AAT T ATGGAC T T T AAAGC T C G 

ACATAAAATTAGTAGCTTC 3 053 



Qy 55 AGGCTTAGTC-TGT CGGCTGCGG 

GTCTCTGACTGCCCTGTGGGGA- 9 8 

I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 3052 

AAAAGGGTTAGTCATATTCCCCAGCAACAGCATGATAAAATAATTCAACTATGTAGAAAT 2 993 



Qy 99 GGGTCTTGCCTTAACATC- 

CCTTGCATTTGGCTGCAAAGAAATCTGCTTG 147 

I I I I I I II II I I I I I I I III II 

I I I 

Db 2992 ATAGAACTCTAGGACTAGCTGGAAACTCGGAAATCATTTAGCC- 

TAATGTTCTCATTTTG 2934 

Qy 148 GAAGA 

AGGGGTTACGCTGTTTGGCCGGGCAGAAACTCC 185 

III I I I I I I I I I I I I I II II II 

Db 2933 

AGAGAAAACTAGACTCAAAGATTAAGCGATTTGCCCAAGCTCACATACCTAATAGTAATA 2 8 7 4 

Qy 186 — GCT GAGCAGAAC TTGCC GCCAGAATGCTCCTCCTGTT 

GCTG 226 

III I I I I I I I I I I I I I I I I III 

I I I 

Db 2873 

AAGCTAGAAATCAAACCAATTTTTCCTAAAACTAAAATTCTATCAATGATATTTCAACTG 2 814 



QY 



22 7 AGTATCATC GTC- 



TCGCG G 251 



Db 2813 

GCTATCAACTAAAAGTCTAGGCTTTTCTCTAATGCTCCACGCTATTGTGACATGAAAGAG 2 75 4 

Qy 252 TGCT — GGTGCTGCTGT TCGTCTCCACGATCGTCA-GCC — 

AATGGATCGTGGGCAA 3 03 



Db 2753 

TGATAAGACACTACAGTAAATCGTACTTGTGGAATTCAGGCCTGGAAGGGGCTTTGGCAA 2 69 4 

Qy 304 TGGAC-ACGCAACTGATCTCT GGCAGAA 

CTGTAGCACCTCTTCCT 347 



Db 2 693 ATAACTAAGCCAGCCCTCTCTTTGTAAAAAGGAGGAAAAGTATCTGAAGTGACT- 

TACCA 2 635 

Qy 3 48 CAG GAAATGTC 

CACCACTGTT 36 8 

II I I I I I I I I I I I 

Db 2634 

AAGGCTCGTTAGCAGAGATGTCTGGATTAAAATCCAGATATTGACAAATTCACCCCCAAA 2 5 75 

Qy 369 TCTCATC ATCACC 

AAACGAATG 390 

I I I I I I I I I I III II 

Db 2574 

TCCCATCTCTAGTTTCATGGCCCACCCACTAAATCATGTGCTCCTTTTAAATCTCCTTTA 2515 

Qy 391 GCTGCAGT CTGTCCAGGCCACCATGATCCTGTCGATCA 

TCTTC 433 



Db 2514 TGCGCTGAGGTTTTAGTTATACAAAAACCCCT — 

TCCAGTCAAACAGAAAAAATTAATTC 2 45 7 

Qy 434 AGCATTCTGTCTCT — GTTCCT- 

GTTCTTCT 461 



Db 2456 

AAATATGGAAAGAAAAATCAAACTTTCAGTATTTTCGTGACTATCTTGAGTAGTTCATTT 239 7 

Qy 462 GCCAA-CTCTT CACC- 

CTCACCAAGGGGGGCAGGTTTTACATCACT 5 05 



Db 2396 T C C AAC C T C T T AG AC AGC T G AC AG AC AC C AC T AAC C T 

GAT C AC C T AT T T C AG AGC T 23 41 

Qy 506 GGAATCTT CCAAATTCTTGCTGGTC 

TGTG 534 

II III I I I I I I I I I I I I III 

Db 2340 

TCAAAGATCCTTAATTTCCCCAAAGACTTCCTGAGCAAAAGGACTACATGTGGGAAATCA 2 2 81 
Qy 535 CGTGATGAGT 



GCTGC 549 

I I I I I I I I I I 

Db 2280 

ATGAAAACAGGAATGAGAAGAGAAAGTTAAAAATTGGTTGCTTTTGTTCCTGTCATTGGG 2 2 21 

Qy 55 0 GGCCATCTACACG GTGAG 

GCACCCGGAGT 578 

I I I I I I I I I I I I I I I I I I I 

Db 2220 

GACTTTTGGTCTTCTAAACATTCCTTGAGTTCCTCCACATGAGTTTTTAATATCAATCTA 2161 

Qy 5 79 GGC AT C T - C AAC T C GG AT T AC T 

CCTACGG- 6 06 

I I I I I I I I I I I I I I I III 

I I I 

Db 216 0 GCAAAAATTAAAACAAACAAAAAAAGGCCTCTGAAAGTCAGGATACTGACCACCT- 

CGGC 2102 

Qy 607 TTTCGCCTACATC-CTG GCCTGGG-TGGCC — TTCC 

CCCTG 643 

I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 2101 

ATCTCTCCCCTTCCCCACTGCTTGCGAATGCCAGTGCTGGCCTGTTCCTTAAACGCACCA 2 0 42 

Qy 6 44 GCCCTTCTCAGCGGTGTCATCTATGTGATCTTGCGGAAAC-GCGAATG 

AGGC — 69 4 

I I I I I I III I I I I I I I I I II I I I I I I III 

Db 2041 GCTCATCCCC- 

CAGAGCAGCCTGTGTGCTCTGCCAGCGACTGCTACTTCTCCTCCGGCCT 19 83 
Qy 695 

GCCCAGACGGTCTGTCTGAGGCTCTGAGCGTACATAGGGAAGGGAGGAAG 7 4 4 

I I I I I II I I I I I I I I I I I I I I I I III 

Db 19 82 CAGCTCAACATCCCAGCTATGCTACCCTGGGCTATGTGAG — 

CTTGGGCAAGTTAGCAT- 1926 

Qy 7 45 G G AAAAC AG AAAG C AG AC AAAG AAAAAAG AG C T AG C C 

CA— 783 

III III I I I I I I I I I I I I I I I II 

Db 1925 — ACATCTCTAGGCCTCTCTTTCCATGGATACAGAGAAGATGGTGAT- 

GCCTACTTCATA 1869 

Qy 78 4 AAATCCCAAA CTCAAACC 

AAACCAAACAG 812 

I I I I I I I I I I I I I I I I I I I 

I I 

Db 1868 

GAGTTCTTATAAGAATTCCCTGAGGCTAGGCATGGTGGCTCACACCTGTAATCCCAGCA- 1810 

Qy 813 AAAGCAGTGGAGGT GGGGGTTG--CTGTTGA TTGAAGA 

TG- 850 

I I I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 1809 

CTTTGGAGGTCAAGGTGGGAAGATCAATTGAGCCCAGGAGTTCAAGACCAGCCTGG 175 4 

Qy 851 — TATATA ATATCTCC 

GGTTTATAAAACCTATTT 8 82 

I I I I I I I I I I I I I I I I 

I 



Db 1753 

GCAACATAGCGAGAGTGCATCTCTACCAAAAGTGATTTTTTTGATTTTTTGGGACAGAGT 169 4 



Qy 8 83 ATAACACTTTTTACATATAT GTACATAGTATTGTT 

TGCTTTTTATGT 929 

I I I I I I I I I I I I III III 

I I 

Db 1693 

CTCGCTCTGTCACCCAGGCTGGAGTGCAGTGGTGTGATCTCGGCTCACTGCAACTTCTGC 163 4 

Qy 93 0 TGACC ATCAGC-CTCGTGT- 

T G AGC C T T AAAG AAG T AGC T AAGG AAC T T T AC A 980 

II I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I I 

Db 1633 CTCCCAGGTTCAAGCAGCTCTCATGTCTCAGCCTCCC — GAGTAGCTGGG 

ATTACA 158 0 

Qy 981 TCCTAACAGTATAATCCAGCTCAGTATTTTTGTTTTGTTTTTTGTTTGTT 

TGTTT 1035 

III II II I I I I II I I I I I I I I I I I I I I I I II 

I 

Db 15 79 GATGCCTGCCACCATG— CCAG-GCA — 

AATTTTGGGTTGTTTTTTTTGAGATGGAGTCT 152 5 

Qy 1036 TGTTTTACC 

CAGAAATAAGATAACTCCATCTCG C 1069 

I I I I I I I I I I I I I I I I I I I 

I 

Db 1524 

TGCTCTATCAGCAGGCTGGAGTGCAGTGGCGCGATCTCGGCTCACTGCAACCTCTGACTC 1465 

Qy 10 70 CC CTTCCCTTTCATCTG AAAGAAGAT 

AC 1097 

II I I I I I I I I I I I I I I I 

I I 

Db 1464 

CCGGGTTCAAGCGATTCTCTTGCCTCAGCCTCTTGAGTAGCTGGGACTACAGGCGCGCAC 14 05 

Qy 109 8 CTCCCTCCCAGTCCACCT C AT T T AG AAAA 

CCAAAGTGTGGGT 1139 

I I I I I I I I I I I I I I I I I II III 

Db 1404 

CACCATGCCTGGCTAATTTTTGTGTTTTTAGTAGAGATGGGGTTTCACCCTGTTGGCCAA 13 45 

Qy 1140 -AGAAACCCCAAATGTCCAAAAGCCCTT 

TTCTGGTGGGTGACCCAGTGCATCCAACA 1195 

I I I I I I I I III I I I I I I I I I I I II I I I I I I 

I 

Db 13 4 4 GAGGACCCCCAGCCTTCC-CAAGCCATAGGGGGCTGGTGGG 

CAGGGCGGCACTTA 1291 

Qy 1196 GAAACAGCC GCTGCCCGAACCTCT-GTGTGAAGCTTTAC GC 

GCACACGG 12 43 

I I I I I II I I I I I I I I III III I II I 

I 

Db 1290 G— 

CTGCCTGGGGCACCCAGTACCTGTCATGAGGCACTTGCCCTGGCCAAAGGTCACTG 123 4 

Qy 12 44 ACAA AATGCCCAAACTG-GAGCCCT 

TGCAAAAACAC 12 78 



Db 1233 

CCAAGGCACAGCCACTCCAGAGAGTCAAGAGGAGCAGAGACTCCAGAGCAGATGGAAGGA 117 4 

Qy 1279 GGCTTGTGGCATTGGCAT ACTTGC 

CCTTACA 1309 



Db 1173 

GAGTGGGCTTCTGGGGATTGCACCAAGGAGGAAGCAGAAGGGATGTGCAGCAGCAGAAAA 1114 

Qy 1310 GGTGGAGTATC TTC GT — 

CACACA— T 1332 



Db 1113 

GGAGGAGGACCGGGGGGTTCTGAGGGCTTGGGGTGATGAGGCCAGAGTTCCAAACAGGGT 105 4 

Qy 1333 CTAAATG AGAAATCAG- 

T G AC AAC AAG T C T T T G AAAT GG T GC T AT GG AT T T ACC A 13 86 



Db 1053 

CTGGGTGCAGGATGAGGACAGAAGAAAACAGAGAACCCAAAAGGGGCAAGAGGTTGGTC- 995 

Qy 13 8 7 TTCCTTATTATCACTAATC ATCTAAACAACTCAC TGGA — 

AATCCAA 1431 

I I I I I I I I I I I I II I I I I I I I I I I I 

Db 994 — 

CCTTACTTTCACTCTCCTGGGAGGAGGATGGGTACACCACACAGTTGGATGGTGCATC 93 7 
Qy 1432 

TTAACAATTTTACAACATAAGATAGAATGGAGACCTGAATAATTCTGTGTAATATAAATG 1491 



Db 936 TTTGCCTGTCTGCACCTGCAGCTC-CATGGCACCATGGA 

CCCTGTGCTCCGAGTG 883 

Qy 1492 GTTTATAACTG— CTTTTGTACCTAGCT AGGCTGCTATT- 

AT 1530 



Db 882 G- 

TGGTGCCTGATCTCCACCACCAGGCAGCAGTAGAGGCCGCTATCCAGCAGGGTCAGGT 82 4 

Qy 1531 TAC TATAATGAGTAAAT CAT AAAGCCTTCATCACTCCC 

ACAT 1572 

II I I I I III III I I I I I I I III I 

Db 823 TGCGCATGGTGATGGAGAAGTTGCCATGGTGGTCGGAGGCCGAC- 

TCCAGCCCGTGGCGC 765 

Qy 15 73 TTTTCTTACGGTCGGAGCATCAGAACAAGCGTC TAGACTCCTTGGGACC-GTG- 

AG 1626 



Db 76 4 TGAGC — CAGGTCGTGGCT — 

GGTGTTGGCAGCCTGGTGGCCTCCATGGTGCAGGTGAAG 709 

Qy 162 7 TTCCTAGAGCTT GGCTGGG TCTAGGCTGTTCTGTGCCT 

CCA 1667 



1 1 1 1 1 1 I I 1 1 1 1 1 1 III 1 1 I 1 1 1 1 III 

1 1 

Db 708 

GTCCTGGAACGTGAGGTTGCGGATGGGCCGGCGCTCTGAGCAGGTCTGCACCTCGCCCCT 6 49 

Qy 166 8 AGGACTGTCTGGCAATGACTTGTA TTGGCCA 

CCAA 1702 

I I I I I I I I I I I I I I I I I I I 

I I I I 

Db 648 

CGAGCTGCGGTACCACGTCTTGTAGAAGGTCACATCGTGCCCTTTGTCCACAGGGCCCAA 5 89 

Qy 1703 CTGTAGATGTATATATGGTGCCCTTCTGATGCTAAGACTCCAGACCTTTTG 

TT 1755 

I I I I I I I I I I I I I I I I I I II I I I I I I I I 

I 

Db 58 8 GAGCCTGCAGGTG AGGGTGACGTTCTGCCCCTCGGG — 

ACAGACATACAGGGAAT 536 

Qy 1756 TTTGCTTTGCATTTT CTGATTTTATACCAACT GTGTGGACTAAGATG- 

CA 1804 

I I I I I I III I I I I I I I I I I II 

I I 

Db 535 

ACGGCGTGGCGACCTTGAAGGCTGCCACCGGACCTGCTCAGAGAGAGGAGAGCCCTGTCA 4 76 

Qy 18 05 TTAAAATAAACATCAGAGTAACTC 182 8 

I I I I I I I I I I I I 
Db 475 CCTGACTGATC — CAGTGCTACTC 454 
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Qy 2 AGTTAC AGGGAGCA C C AC C AG G — G AA CA- 

TCTCG 33 

III I I I I I I I II I I I I III I I I I 

Db 2373 

AGTACCTCACTGAGTAGGTATCAAGACCCTTCAGGCAGAATTCCATCATTCTCGCAATTA 2314 

Qy 3 4 GGGAGCCTGGTTG GAAGCTGCAG 

GCTTAGTCT 65 



Db 2313 

GTGACAGGGACGCTGGTTGTGGAGGAGTATTTGGAGTGGTGGCAGAAAATTGCATCGGCA 2 2 5 4 

Qy 66 GT — CGGCT GCGGGTC TCT GACTGCCCTGTGGGGAGGGTCT 

TGC 107 



Db 2253 

TTTCCAGCTTGAGTAGGTCTTATCTTCTTAGCGGATGCTCAGAAGCCAAGGTCTGGATGC 219 4 

Qy 108 CTTAACATCCCTTGCATTTGGCTGCAAAGAAATCT GCTT-GGAAGAAG 

GGG 157 



Db 2193 AGTCG- 

GTCAGTTGCAAATAACAGCTCAGGAATGTTAAGTGGCTTCAGGAGTCGTGTGGA 2135 

Qy 158 TTACGCTGTT TGG 

CCGGGCAGAAACTCCGCTGAGCAGAACTTGCCGCCAGAATG 211 



Db 213 4 CAGCACGATTACCTGGGTTCCTGGCTTCAACGATGCC-ACAAGATCTT — 

TGACCATTTT 2 0 78 

Qy 212 CTCCTCCTGTTGCTGAGTATCATCGTCCTCCACGTCGCGGTGCTGGTG— 

CT 261 



Db 2 0 77 C-ACTTCAGAGACTGT G AC C C C AC — 

CTACCACAAAGAGGATCAGGAGGGGGTAGTCACT 2 021 

Qy 262 G CTGTTCGTCTCCACGATCGT CAGCCAAT GGATC 

GTG 298 



Db 2020 

AGGATGAGGCCGGCTCACCTTCATGAACATGCTAAATCCAGTTTTAAGGAGATCAGTGAG 1961 

Qy 299 GGCAATGGACACGCAACTGATCTCTGGCAGAACTGTAGC — 

ACCTCTTCCTCAGG 351 

II I I I I I I I I I I I I I I I I I III II 

Db 196 0 GCCTGAAGACATG TGTTCAATATC — AACGGAATCTGGCCT 

CTCGGGATGAA 1911 

Qy 352 AAATGTCCACCACTGTTTCTCATCATCA 

CCAAACGAATGGCTG 394 

I I I I I I I I I I II I I I I I I I I I I I I I I 

Db 1910 ATATTTCCTCCACAACTT- 

GCTTCAACAATGGCTTATAAGATGCCTGGTGGGTATGATTT 1852 



Qy 395 CAGTCT GTCCAG — GCCACCATGATCCTG 

TC 423 

I I I I I I I I I I I I I I I I I I 

Db 1851 

CCAGGAACATATACAGACTTAAACTGTTTCAGGAGACTCCGAGCTCCAGCAATATCCCGA 1792 
Qy 424 

GATCATCTTCAGCATTCT 441 

I I I I I I I I I I I I 

Db 1791 

AGTGAAGTAAAGAGTTCATCCACGGCAATTTTGGATTTGTGAAATGTCAGATTAATTGAA 1732 

Qy 442 — GTCTCTGTTCCTGTTCTTCTGC — CA ACTCTTCA 

CCCTCACCAAGG 485 

I I I I I I I I I I I I I I II II III I I I I I 

Db 1731 

GAGTCCCAGTCCGTAATTTTTTGCAGCAAAGGTGACAATCCAGATTCCTCACAGAAGACC 16 72 

Qy 486 -GGGGCAGGTTTTACATCACTGGAATCTTCCAAATTCTTGCTGGTCTGTG- 

CGTG- 538 



Db 16 71 TGAGCCAATGCTTTCTTGACT-TTTTCTT- 

CTGCTTCACACAGGTCTTTGTCTACCGTGA 1614 

Qy 539 ATGAGTG CTGCGG- 

CCATC T 557 



Db 1613 

GCTCTCCAGTGACAGAATAAATATATATGAGAAGGATCAGCAGTTCCTCAGGGCTGTAGT 1554 

Qy 558 ACACGGTGAGGCACCCGGAGT GGC ATCT CAACT-CGGATTACTC- 

CTAC 604 



Db 1553 CCTCGTT — 

GGTTCTCTGGGTTACAGGCTTAATCATGGGCAGCAGCTGATTTAACACAAC 1496 

Qy 605 GGT-TTCGCCTACATCC TGGCCTG GGTGGCCTTCCC CCTG- 

GCC 646 



Db 1495 

GGACATTGCTGACTCCCCAATGCTCTGAAGAAGGAGCCTTTCAAAAGCCAGAAAGTTGTC 1436 

Qy 6 47 CTTCTCAGCGGTGTCATCTATGTGATCTTGCGGAAACGCGAATGAGGCGCCCAG- 

ACGGT 705 

I II III I I I I I I I I I I I I I I I I I I I I I 

Db 1435 CCACTTGGCAG TCTGTG-GGTGTTTC 

AACGTTTGAGCTGTGGCCAGTCCAAG 13 85 

Qy 706 CTGTCTGAGG- 

CTCTGAGCGTACATAGGGAAGGGAGGAAGGGAAAACAGAAAGCAGACAA 76 4 



Db 13 8 4 C T G G AG G AGGC C AC AAT GAT T C AT T AG AGC T T T GAG GTTGTTCTT G AAG — 

AGCTGAATA 132 7 

Qy 765 AGAAAAAAGAGCTAGCCCAAAAT 



CCCAAACTCA 797 

Db 1326 

TAGGACATGAGCTGTCCCGGTGTGACTCTCCCCATACTCATCTTGATTGGCAGGTTTTCT 12 6 7 

Qy 79 8 AACCA AACCAAACAGAAAGCAGTG 

GAGGTGGGGG 831 

III I I I I I I I I I I I I I I 

I 

Db 1266 

CTGCTTGCCGCTTCCACTAGATGTCTCCGAACTTCCATCACTGCCTCTTTGTGCTTAGTG 12 0 7 

Qy 832 TTGCTGT — TGATT GAAGATGTATATAATATCTCCGGTTTATAAAA 

CC 877 

I I I I I III I II II I I I I I I I I I I I 

I I 

Db 1206 TT- 

CAGTAAAGCTTCCCATAGGGCTTTGGCTGTGGTGTCACTGAATTGTGAAAGACAGCC 114 8 

Qy 8 78 TATTTATAACACTTTTTACATATATGTACATAGTATTGTTTGCT 

TTTTATGT 929 

I I I I I I I I I I I I I II I I I I I I III II 

I I I I 

Db 1147 T G G T G C AAC C AC AT T AT A- 

ATTTTCCTCCTCAGTATGGAGTGCAGTGAGCGCTATCATGT 10 89 

Qy 93 0 TGACCATCAGCCTCG— TGTTGAGCCT TAAAGAAGTAGCT — 

AAGGAAC T T T AC 9 79 

I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I 

I 

Db 1088 TAACCATCA-CATCATTTGTGTGGCCTGGGAGCTGGGGAAGT- 

GCTGAAATGATCTTCTC 1031 

Qy 980 ATCCTAA CAGTATAATCCAGCTCAGTATTTTTGTT- 

TTGTTTTTTGTTTGTTT 1031 

I I I I I II I I I I I I I I I I I I I I I I 

Db 1030 - 

TACTAAGTTGTCTCCATGATGTCCAACTGCTCCTGTGAGATCCAGGGTTCTGTCCACAA 9 72 

Qy 1032 GTTTTGTT-TTACCCAGAAATAAGATAACTCCATC 

TC G 1068 

I I I I I I II I I I I I II 

I 

Db 971 

AAACCACTGATGCCCTGCCTGCAGCAGTCTTCTTCCTGTTCTTTGCAGGGGCATAATTGG 912 

Qy 1069 CCCCTTCC CTTTCATCT G AAAG AAG AT AC C T CCCTC— 

CCAGTCCAC 1113 

II III I I I I I I I III I I I I I I I I I I I I I 

I 

Db 911 

C C AG AT C C GC AGC G AT G AC C T G AC T T AAGGAACC T AC AGC AAAAC AC TCCTCCCGTACTC 852 

Qy 1114 C T C AT T TAG AAAAC - C AAAG TGTGGGTAGAAACCC — CAAATGTCCAAAAGCC- 

CTT 1166 

I I I I I I I I I I I I I I I I I I I II II I III I 

Db 851 CTAAAT — 

GTTCACACAGAGAACTGAGGCCTGACACTAGGCATCTGATCTGCAGCAGCAG 79 4 

Qy 116 7 TTCTGGTGGGTGACCCAGTGCATCCAACAGAAACAGCC— GCTGCCCG— 



AACCTCTGTG 1222 

I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I 

I I 

Db 793 CTCTGG-GGTTAGCGTAGTGGAGTCCAC ATCACCCAGGCTTCCCAGCTTCCTCT- 

TG 739 

Qy 12 23 TGAAGCTTTACGCGCACACGGACAAAATGCCCAAACTGG 

AGCCCTTGCAAAA 12 74 

I I III III I I I I I I I I I I III I I I I I I 

Db 73 8 TCCGGTCGGGCGCTATTAAGG 

AGGTGCACATCCTGGGGTAGCAGTGGGAAAAGGGAT 682 

Qy 1275 - 

ACACGGCTTGTGGCATTGGCATACTTGCCCTTACAGGTGGAGTATCTTCGTCACACATC 1333 

I I I I I I I I I I I I I I I I I I I I I III II 

Db 681 GCAAAAGCTGGAGTCA-AGGCAAAGT 

GGGGAGCAACAGGGGCAAGCA — 636 



Qy 1334 

TAAATGAGAAATCAGTGACAACAAGTCTTTGAAATGGTGCTATGGATTTACCATTCCTTA 1393 
I I I I I I I I I I I I II II I I I I I I I I I I I 

I 

Db 635 ATAACGGGACAT— GGAACACCTCGGCCGTG TAGTTC-ATG 

TTGCCCATCCATT 585 



Qy 139 4 TTATCA-C T AAT CATC T AAAC AAC T C 

ACTGGAAATCCAATT 1433 

I I I I I I I I II III III I I I I I 

Db 58 4 CACACAGCTTCTCCTC- 

CAGCTGCTCGAACACCGGCTGCTGCCCCTCCATCTCGGCCGCT 526 

Qy 1434 AAC AAT T T T AC AAC AT AAG AT AG AAT GG AG AC C T G AAT AAT T C 

TGTG 1480 

I I III I I I I I I III II I II I I I I 

Db 525 GCCGCCGCTGGGACATGATTAGCT- 

GTGAGGTGGACAGCGTGGCTCACGGTTGTGACCAC 467 

Qy 1481 -TAATATAAATGGTTTATAACTGC TTTTGTACC-TAG CTAGGCTG- 

CTATT 1528 

II I I I I I I I I I I I I I I I I I I I II II 

Db 466 CACACAATACTGG- 

AAGTGACTGCGGCAGATGATGTCCCGTAGGATCTCCACGGTCCGGC 4 0 8 



Qy 152 9 AT T AC TAT AAT GAG T AAATC AT AAA GCCTT CATCACTC — 

CCACATTTTT 1576 

III I II I I I I I I I I I I I II III I I I I 

I I 

Db 407 

CTTTCAGCAGGCAGCTCAGCACAAACACTGCCTTGGGCTGCTTGGCTCCACCACCAATTG 3 4 8 



Qy 1577 CTTACGG — TCGGA GCA 

T C AG AAC AAG CGTCTAGACTCCTTGGGACC 1621 

I I I I I I I I I III I I I I I I I I I I I I I 

I I I 

Db 3 47 CGT-CGGGCTCGAACTCTCGCAGGTGACAGTCAGGGCCCCCCAC- 

CGCCTCCAGGAGACG 29 0 

Qy 1622 GTGAGTTCC-TAGAGCTTG-GCTGGGTCTAGGC 

TGTTCTGTGCCTCCAAGGA 1671 

I I I I I II I I I I I I I I I I I I I I I I III 



I 

Db 289 

GGTGGATCCGCAGCCCCAGTGCAGGCTCTCGGCGCAGGCGGCGTCCAGGTAAACCACAGC 23 0 

Qy 16 72 CTGTCTGGCAATGACTTGTATTGGCCACCAACTGTAGATGTATATATG GT — 

GCCCT 1726 



Db 229 CCGTTTCACTTTGGCCAGCACCTGCTCCCATCC- 

TTGCTGGGTAAAGGACAGTACGCCCG 171 

Qy 172 7 T CTGATGCT — AAGACTC-CAGACCTTTTGTTTTTGCTTTGCA- 

TTTTCT 1772 



Db 170 AGGCGCTCATGGTTGGGGATTCGCAGA- 

CTTGGGAAACTACGGTGCAGGAACTTCTTTCA 112 

Qy 17 73 GATTTTATACCAACTG 

TGTGGACTAAGATGCATTAAAATAAACATCAG 182 0 

I I I I III I I I I I I I I I I I I 

Db 111 

GAACTCACCGCTTCCGGAAATTGGGCTCCGGGAGACTTTGACAGTCTCCACAGTACACCC 52 

Qy 1821 AGTA ACTC 1828 

I I I I III 
Db 51 AGTAGGCCAACAGTGCTC 3 4 
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Qy 1 CAGTTACAGG- 

GGGGAG — 3 8 



-GAGCACCACCAGGGAACATCTC 



Db 390 

CAAATACTGGCATGAAGACTGCCTGAAGTGTGCCTGCTGTGACTGTCGCTTGGGGGAGGT 4 49 

Qy 39 CCTGGTTGGAAGCTGCAG GCTTAGTCTGTC 

GGCT 72 



Db 450 

GGGCTCCACCCTGTACACTAAAGCTAATCTTATCCTTTGTCGCAGAGACTATCTGAGGCT 5 09 

Qy 73 GCGGGT CTCTGACTG CCCTG 

TGGGGAGGGT 102 



Db 510 

CTTTGGTGTAACGGGAAACTGCGCTGCCTGTAGTAAGCTCATCCCTGCCTTTGAGATGGT 569 

Qy 103 CTTGCC TTAACATCC CTTGCATT 

TGGCTGCAAAGAAATCT 142 



Db 570 

GATGCGTGCCAAGGACAATGTTTACCACCTGGACTGCTTTGCATGTCAGCTTTGTAATCA 62 9 

Qy 143 GC TTGGAAG AAGGGGTTAC — 

GCTGTTTGGCCGGGCAGA 179 



Db 630 

GAGATTTTGTGTTGGAGACAAATTTTTCCTAAAGAATAACATGATCCTTTGCCAGACGG- 6 8 8 

Qy 180 AACTCCGCTGAGCAGAACTTGCCGCCAGA ATGCTCCTCCTGTT-GCTGAG- 

TATC 232 



Db 6 89 -ACT AC 

GAGGAAGGTTTAATGAAAGAAGGTTATGCACCCCAGGTTCGCTGATCTATC 7 4 4 

Qy 233 ATCGTCCTCCACGTCGCGGTGCTG-GTGCTGCTGTTCGT 

CTCCAC — G 277 



Db 7 45 AACATCACCCCATTAAGAATACAGAGCACTAC- 

ATTCTTTTATCTTTTTTGCTCCACATG 803 

Qy 2 78 ATCGTCA GCCAATGGATC 

GTGGGCA 3 02 

III I I I I I I I III 

Db 804 

TACATAAGAATTGACACAGGAACCTACTGAATAGCGTAGATATAGGAAGGCAGGATGGTT 863 

Qy 3 03 — ATGG ACACGC-AACTG-ATCTCTG- 

GCAGAACTGTAGCACCTCTTC 345 

I I I I I I I I I I I I I I I I I III I I I I I I 

Db 864 

ATATGGAATAAAAGGCGGACTGCATCTGTATGTAGTGAAATTGCCCCAGTTCAGAGTTGA 923 

Qy 3 46 CTCAGGAAATGTCCACC— ACTG 

TTTCTCATC 375 



I I 1 1 1 1 1 1 1 III 1 1 1 1 

1 1 

Db 924 

ATGTTTATTATTAAAGAAAAAAGTAATGTACATATGGCTGGATTTTTTTGCTTGCTATTC 9 83 

Qy 3 76 ATCAC CAAACGA ATGG- 

CTGCAGTCT 400 

I I I I II II I I I I I III 

Db 984 

GTTTTTGTGTCACTTGGCATGAGATGTTTATTTTGGACTATTGTATATAATGTATTGTAA 10 43 

Qy 401 GTCCAG GCCACCA 

TGATCCTGTCGATCA 42 8 

I I I I I I I I I I I I I I II 

I 

Db 10 44 TATTTGAAGCACAAATGTAATACAGTTTTATTGTGTTACCATTTGTGTTCCATTTG- 



Qy 429 TCTT CAGCATT CTGTCTCTG TTCCTGTT- 

CTTC 460 

I I I I I I I I I I I I I I I I I I I I 



Db 1102 

TCTTTGTATTGTTGCATTTAGTACAATCAGTGTTTAAACTTACTGTATATTTATGCTTTC 1161 
Qy 461 TG 

CCAACTCT T 471 

II I I I I I I 

I 

Db 1162 

TGTATTTACCAGCTATTTTAAATGAGCTGTAACTTTCTAGTAAAGAATTGAAAAGCAAAT 1221 

Qy 472 CACCCT CACCAAGGG GGGCAGGTTTTACATC 

ACTGGAA 509 

I I I I I I I I I I I I I I I I I I I I I I 

Db 1222 

C T C AC T AAT G AT AC AC AG AT AG AT AAAGC AAGTC T ATC AAC AT T AAAAAT AC T AAAAAAT 12 81 

Qy 510 TCTTCCA — 

AATTCTTGCTGGTCTGTGCGTGATG 541 

I I I I I I I I I I I I I I I I I I I 

I 

Db 12 82 AAAG AC AC AC AC AG AGC AT T T T AGTGAC AT C C AC T AC TTATTGCCG — 

CTATGAGTTA-G 133 8 

Qy 5 42 AGTGCTGCGGCCATCTACACGGTGAGGCACCC GGAGT 

GGC 581 

I I I I I I I I I I I I I I I I I I I I 

Db 1339 AGT-CTAT — 

CAGTGTTCTTGTTATAACCCCCTATTTTCAGGGGGTTAAAAATCAGCTTT 1395 

Qy 582 ATC TC AAC TCGGATTACTCCTACG 

GTT 608 

I I I I I I II III I I I I I 

I I 

Db 1396 

AAAAAAATACATAAAAATTTCATCTTAAAGCACTTTCATTTTATACCAACGTGAAAAGTG 1455 

Qy 6 09 TC GCCT-ACA TCC TGGCC 

TGGGTG 631 



Db 1456 

CCATTTTTAGAATAACTTTAAAGCTTAACAGGTTTCCTTTTAATATCCTTTTTTTGTGTG 1515 

Qy 632 GCCTT CCCCCTGGC 

CCTTCTCAGCGGTGTCATCTATGTGATCT — 674 

III I I I I I I I I I I I I I I I III 

Db 1516 

CTCTTTACTTACACAATGGCTTTGTTTTGCTTTTTCAGCCACACCCCTTATGTGAACTAG 15 75 

Qy 6 75 TGC GGAAACGCG 

AATGAGGCGCCCAGACGGTCTG 708 

III I I I I I I I I I I I I III 

Db 1576 

TGCCTTTGGGTATCACGTAAAATTTTTTCCAAAGGGTTACTTTAAAAATCTGTTACCACA 1635 

Qy 709 — TCTGAG 

GCTCTGAG 722 



Db 1636 

ATTATGAGATGATTTTTAAGTGATAAATTAAACTTCTTCTTGTATAAATTCTGCCCAGAT 1695 

Qy 72 3 C G T AC AT AGGGAAGGGAGG 

AAGGGAAAACAGAAAGCAGACAAAGAAA 769 



Db 1696 

CTCTCCACAAGAGCTGAGGGTTTCATAACTTTATGGCTTAATAAATGTATGACACTGAAA 1755 



Qy 770 AAA 

GAGCTAGCCCAAAATCCCAAACTCAAACCAAACCAAACAGAAAGCAGTGGAGGT 82 6 



Db 1756 AGATTTGAGTGTG AATCTACTGAAATCACTATAATGCACATTG- 

AAGCTATGATGGT 1811 

Qy 82 7 G GGGGTTGCTGTTGATTGAAG 

ATGTATATAATATCTCC 86 4 

I I I II I I I I I I I I I I III III II 

Db 1812 

AT T T GAG TAG T G AGG T T AC T T T T GAT C GG AGC AAC AT AAT GC T C AT AG AAT C T T C T AG AA 18 71 

Qy 865 

GGTTTATAAA 8 74 

I II I I I I I 

Db 1872 

GAAGAGAAACAAAGGGATTGATAAAATGCTGAGAACTAGTGATTATATATTTTTCTGTAT 1931 

Qy 8 75 — ACCT ATTTAT AACACTTT 

TTACATATATGT 904 

I I I I I I I I I I I I I I I I I I II II 

Db 1932 

TTACCTGACATTTATTTTAATGTTCAAAAAGTAAACACTTTAAGTTTGATGTGTTTTACT 1991 

Qy 9 05 ACATAGTATT— GT — TTGC 

TTTTTATGTTGA 932 

I I I I I I I II I I I I I I I I I I I I 

Db 1992 



CTCTCATTGTTTTAAGTAATTGCCAACTCAGAATACATCATTCTTAGGCTGAAATTTGTC 2 051 

Qy 933 CCAT CAGCCT — CGTGTTGAGCCTT-AAAGAAG- 

TAGC 966 



Db 2 052 TTTCCATTTTTTAAGGTGAAATAGTACTACCTTACGTGAT- 

AGCATACAAAGAAGAAAGC 2110 

Qy 967 

TAAGGAACT 9 75 

Db 2111 

T C T AG AAAG AG AAAT T AT GG AG AAT GAT TAT T T AAAT T AC AAT T AAGG AAAT G AG AAT AT 217 0 

Qy 9 76 TTACATC 

CTAACAGTA 991 

I I I I I III 

Db 2171 

GATCCCCTCTTCCGAGTTGCCCACAAACTTGCTTCTTTGCTTTTGCTCCCTGTAATAGAA 2 23 0 

Qy 992 TAATC CAGCTCAG 

TATTTT 1010 

I I I I I I I I I I I I I I I 

Db 2231 

CTACTTTTCAACAAATCTAATTTTGCACGGCACCGTTAACCATATTTTCACTACAGCAAA 2 2 9 0 

Qy 1011 TGTTTT — GTTTTTTGTTTGTTTGTTTTGTTTT 

AC C C AG AAAT AAG AT AAC T 106 0 

I I I I I I I I I I I I I I I I I I I I I I I I II II 

Db 2291 

CTTAGTGCTATGGGTTTTCTTTTTCTTTGTTTTTTTCTTGATCACTTGTATAGGAAACAA 2350 

Qy 1061 CCATCTC GCC CCTTC 

CCTTTCATCTGAA 1088 



Db 2351 

CATTTTCCAGTGTTATTTGCATATATATTTTGTCCTTCCAATATATGCATTACAGATGAA 2 410 

Qy 10 89 AGAAG — ATACCTCCCTCCC 

AG T C C AC C T C AT T T AG AAAAC C AAA 1131 

I I I I I I I I I II I I I I I I I I I I 

Db 2411 

AATTAAATGTTATACCTGAATTCTTGGGTTGGGGCCAAAATATTAAGCTGAAAATAATGC 2 4 7 0 

Qy 1132 — GTGTGG GTAGAAACCCCAA ATGTCCA 

AAA 1160 

I I I I I I II II III III II 

Db 2471 

TGGTGTGGATTTGTTTTAAAACAAAGCTTTATTATGAACATGCATGTGAATCTGGATATT 2 53 0 

Qy 1161 GCC-CTT TTCTG GTGGGTGA 

CCCAGTG 1186 



Db 2531 

GCCTCTTATTTTTAAGAAAATGGTTCTGTGAAAAGTGAATGATATGTATTTTTCCAAATG 2 59 0 
Qy 1187 CATCC AACAGAAACAGCCGCTGCCC- 



GAACCTCTGTGTGAAGCTTTACGCGCACACG 12 42 



Db 2591 

CTTCATGGTTAGGAGTCTTCAAGTTCCATGTTCCCCAGATTTGAGATATACTAAAGAAAG 2 65 0 

Qy 12 43 GA CAAAATGCCCAAACTGGAGCCCTTGCAAA- 

AACACGGCTTGTGGCATTGGCATAC 129 8 



Db 2651 AAATTCAAAAGTAGCTATTTGGGGCCCACAAAAATAACTATTATTTTAGCCTTAG- 

AGCC 2709 

Qy 1299 TTGCCC TTACAGGTGGAGTA TCTT-CGTCAC 

ACATC 1333 

I I I I I I I I I I I I I I I I I I I I I 

Db 2710 

TTACACTTGTTTCAT G AAG AG AAAGG AC T T GC AT AACC AAAAT AAAC AAAGC AAG AC AAA 2 769 
Qy 1334 TAAAT 

G AG AAAT C AG T G AC AAC AAG T C T T T G AAAT GG T GC T ATG 13 7 7 

III I I I I I I I I I I I I I I I I I I I I I I II II 

Db 2770 

TTAAAAATATGTGGGGGAGAGATCAGTGAAAAGTGGTTTTCTTAATGCAGCCCTGCTGGT 2 82 9 

Qy 13 78 GATTTACCATTCCTT 

ATTATCA 1399 

I I I I I I I I I I I I I I I I I 

Db 2830 

CCCCATTAACAATTGCTTGAAATTCACATGGATGTAAAATTATAATTGTCAGGATCTTAT 2 8 89 

Qy 140 0 -CTAATCATC — T AAAC AAC T C AC T GG AAAT C C 

AAT T AAC AAT T T T AC AAC AT AAG 1452 

I I I I I I III I I I I I I II I I I I I I I I I I I 

Db 2890 

TCAGATGATCTTTTAAGGTTTAACTGGTTTTGCTTTTGTTTATCTATATGTCAAAATACT 2 9 49 

Qy 1453 AT AG AAT GG AG AC C T G AAT AAT - T C T G TGT 

AATATAAATG 1491 

I I I I I I I I I I I I I III I I I I II 

Db 2950 

TGTAAATTGGGAACAAACTTCTCTCAGCTTCTTGAAGTTGTTCAACTATCCTTGCCACTG 3 0 09 

Qy 1492 GTTTATAACTGCTT TTGTACCTA 

GCTAGG 152 0 

I I I I I I I I I I I I I I I I I I III I 

Db 3010 

GAAGACCAAACAAGGTTTTCACTGCTTTTTCTTTTACATAATATGCTGAGAATTATTTCT 3 069 

Qy 1521 -CTGCTATT ATTACTATAAT- 

GAGTAAATCATAAAGCCTTCATC 1562 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 3070 

TATGCTTTTTACTACAAACAAAATTACTCACCTGGATTAAAGATTAAGGCCTTAATCTGT 312 9 

Qy 1563 ACTCC 

CA 1569 

I I I I II 

Db 3130 

TTAGATTATCTTTAATCTCCATGAAATCGTGAAATAAGACAAGAATAGTGTTTCAGCTGT 3189 



Qy 1570 CATTTTTC TTAC GGTCGGAGCAT CAGAAC 

AAGC 1602 



Db 3190 

AGGCCATTTTACAGCTAATTGCCCATAAATTGTAGCATTTATTGACCTGAAGTACTAAGC 32 49 

Qy 1603 GTCTAGACTCCT-TGGGACCGTGAGTT 

CCTAGAGCTTGGCT 1642 



Db 3250 

TAATTGTCTTGACTACTCAAAGCCCCTGAATTGTTGTCAACTTTCCCCTTTGTGTTGTGT 33 09 

Qy 1643 GG GTC TAGGCTGT — TCTG-TGCCTCCA— AGGACT-GTCTG- 

GCA 1681 



Db 3310 

AGCCCTAACGTCATTTAGCTTGTTGTCTGATGCCTCCAGTAGGACACCTCCGATGGAGCT 3369 

Qy 16 82 ATGACTT GTATTGGCCACCAACTGTAGATGTAT 

ATATGGTGCCCTTCT 1729 



Db 33 7 0 TTGATTTCTGAGCAGCGAAAGCTCCCTTCCTAAGATGCATCTCGCATAGGCTGCC — 

TAT 3 42 7 

Qy 173 0 GATG CTAAGACTCCA GACCT — 

TT 1751 



Db 3428 

GATGAAGGACCGTGCACCTCCACTCCAACAGAGTGCTGAGTTTAAAAGTTGACCTGTGTT 3 4 8 7 

Qy 1752 TGT — TTTTGCTTTGCATTTT-CTGATTTTATACCAACTGTGT — 

GGACTAAGATGCATT 18 06 

III III I I I I I I I I I I I I I I I I I III I II 
Db 3488 TGTAATTTCACTTT- 

CATCTTGCTTAATAAATATCTGCTGGATTCTTTCAAAAAAAAAAA 35 46 

Qy 1807 AAAATAAACATCAGAGTAA CTC 182 8 

I I I I I I I I II II III 
Db 3547 AAAAAAAAAAAAAAAAAAAAGGCCACATGTGCTC 358 0 



RESULT 6 

US-10-478-914-21/C 

; Sequence 21, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 

TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
; FILE REFERENCE: 7388-80899 

; CURRENT APPLICATION NUMBER: US/ 1 0 / 478 , 9 1 4 

; CURRENT FILING DATE: 2003-11-26 

; PRIOR APPLICATION NUMBER: PCT/JP02/05294 

; PRIOR FILING DATE: 2001-05-30 

; PRIOR APPLICATION NUMBER: JP 2001-162775 



PRIOR FILING DATE: 2001-05-30 
PRIOR APPLICATION NUMBER: JP 2001-255226 
PRIOR FILING DATE: 2001-08-24 
NUMBER OF SEQ ID NOS : 417 
SOFTWARE: Patentln version 3.2 
SEQ ID NO 21 

LENGTH: 3589 

TYPE: DNA 

ORGANISM: Homo sapiens 



Query Match 38.9%; 
Best Local Similarity 35.1%; 
Matches 1149; Conservative 
Gaps 162; 



Score 4224.5; DB 1; Length 3589; 
Pred. No. 1.2e-275; 
0; Mismatches 649; Indels 1477; 



Qy 1 CAGTTACAG GGAGCAC CACC— AGGGAACATCT- 

CGGGGAGCCT G 42 



Db 3458 

CTGTTGGAGTGGAGGTGCACGGTCCTTCATCATAGGCAGCCTATGCGAGATGCATCTTAG 3399 

Qy 43 GTTGGAAGCT GCAGGCTTAG TCTGTCGG 

CTGCGGGTC 79 

I I I I I I I II I I I I I II I I I I 

Db 3398 GAAGGGAGCTTTCGCT- 

GCTCAGAAATCAAAGCTCCATCGGAGGTGTCCTACTGGAGGCA 33 4 0 

Qy 8 0 TCTGACTGCC — CTGTGGGGAG GGTCTTGCC 

TTAACATCCC 118 

I I I I I I I I I I I I I I I I 

Db 3339 

T C AG AC AAC AAGC T AAAT G AC G T T AGGGC T AC AC AAC AC AAAGGGGAAAGT TG AC AAC AA 32 8 0 

Qy 119 TT GCATTTGGCTG-CAAAGAAATCTGCTTGGAAGAAGGGGT 

TAC- 161 



Db 3279 

TTCAGGGGCTTTGAGTAGTCAAGACAATTAGCTTAGTACTTCAGGTCAATAAATGCTACA 32 2 0 

Qy 162 GCTGT TTGGCCG— 

GGCAGAAACTC 184 

I I I I I I I I I I I I I I I I I I 

Db 3219 

ATTTATGGGCAATTAGCTGTAAAATGGCCTACAGCTGAAACACTATTCTTGTCTTATTTC 316 0 

Qy 185 -CG CTGAGCAG 

AACTTGCCGCCAG- 207 



Db 3159 

AC GAT T T C AT GG AG AT T AAAG AT AATC T AAAC AGAT T AAGGCC T T AAT C T T T AAT C C AGG 310 0 

Qy 208 

AATGCTCCTCCTGTT 222 

I I I I I I I I I I 

Db 3099 

T GAG T AAT TTTGTTTG TAG T AAAAAGC AT AAGAAAT AAT TC TC AGC AT AT T AT G T AAAAG 3 0 4 0 



Qy 2 23 GCTG AGTATCATCGTCC- 

TCCACGTCGCG 250 



Db 3039 

AAAAAGCAGTGAAAACCTTGTTTGGTCTTCCAGTGGCAAGGATAGTTGAACAACTTCAAG 2 9 8 0 

Qy 251 GTGCTG-GTGCTG-CTGTTC 

GTCT 272 

I I I I I I I I I I I I III 

Db 2979 

AAG C T G AG AG AAG TTTGTTCC C AAT T T AC AAG T AT T T T G AC AT AT AG AT AAAC AAAAG C A 2 92 0 

Qy 2 73 CC ACGATCGTCAGCCAATGGATCGTG 

GG 300 

II I I I I I I I I I I I I I I I 

Db 2919 AAAC C AG T T AAAC C T T AAAAG AT C AT C T G - 

AAT AAG AT C C T G AC AAT TAT AAT T T T AC AT 2 861 

Qy 301 CAATG-GA CACGCAACTGATC TCTGGCAGAACTGT 

AGCACCTC- 3 42 

I I I I I I I I I I I I I I I I I I I I III I 

I 

Db 2860 

CCATGTGAATTTCAAGCAATTGTTAATGGGGACCAGCAGGGCTGCATTAAGAAAACCACT 2 8 01 



Qy 343 
TTCCTCAGGAAATGTCCACCAC-- 



I I I I II I I I I I I I I 

I 

Db 2800 TTTCACT- 

GATCTCTCCCCCACATATTTTTAATTTGTCTTGCTTTGTTTATTTTGGTTAT 2 7 42 

Qy 366 G TTTCTCATCAT-CACCAAACGAATGGCT 

GCAGTCT 40 0 

I I I I I I I I I I I I I I I I I I I I I 

I 

Db 2741 

GCAAGTCCTTTCTCTTCATGAAACAAGTGTAAGGCTCTAAGGCTAAAATAATAGTTATTT 2 6 82 

Qy 401 GTCCAGGCCACCA— TGATCCTGTCGATCATC- 

TTCAGCATTCTGTCTCTGTTC 451 

I I I I I I I I I II I I I I I I I I I I I I I I I 

Db 2681 

TTGTGGGCCCCAAATAGCTACTTTTGAATTTCTTTCTTTAGTATATCTCAAATCTGGGGA 2622 

Qy 452 -CT— GTTCTTCTGCCAACTCTTCACCCT-CACCAAGGGGG 

GCAGGTTTTACA — 50 0 

I I I I I I III I III I I I II III 

Db 2621 ACATGGAACT — 

T G AAG AC T C C T AAC CAT G AAGC AT T TGGAAAAAT AC AT ATC AT TCACT 2564 

Qy 501 — TCACTGGA ATCT 

TCCAAATTCTTGCTGGTCTGTGC 535 

I I I I I I III I I I I I I I I I 

I 

Db 2 563 T T T C AC AGAACC AT T T TC T T AAAAAT AAGAGGC AAT AT C C AG AT T C — 

ACATGCATGTTC 2506 



Qy 536 GTGATGA GT GCTGCGGC 

CATCTACACGGTGAG 56 7 

I I I I II I I I I I I I I I I 

I 

Db 2505 

ATAATAAAGCTTTGTTTTAAAACAAATCCACACCAGCATTATTTTCAGCTTAATATTTTG 2 4 46 

Qy 56 8 GC ACCC GGAGTGGCA 

TCTCAACTCGGATTACTCCTACGG— 6 06 

II I I I I I I I I I I I I I I I I I I I III 

Db 2 4 45 GCCCCAACCCAAGAATTCAGGTATAACATTTAATTTTCATCT- 

GTAATGCATATATTGGA 23 8 7 

Qy 607 TTTCGCCTACATC-CTGGCCTGGGTGG CCT 

TC— 637 

I I I I I I I I I I III III II 

Db 2386 

AGGACAAAATATATATGCAAATAACACTGGAAAATGTTGTTTCCTATACAAGTGATCAAG 232 7 

Qy 63 8 CCCCTGGCCCTTC 

TCA 653 

I I I I I I I I I 

I 

Db 2326 

AAAAAAACAAAGAAAAAGAAAACCCATAGCACTAAGTTTGCTGTAGTGAAAATATGGTTA 2 2 6 7 

Qy 654 GCGGTGTCAT CTATGT 

GATCT 674 

I I I I I I I I I I I I I I I 

Db 2266 

ACGGTGCCGTGCAAAATTAGATTTGTTGAAAAGTAGTTCTATTACAGGGAGCAAAAGCAA 22 0 7 

Qy 6 75 TGC GGAAACGCGAATGAGGCG 

CCCAGACGG 70 4 

II I I I I I I I I I I I I I II II 

Db 2206 

AGAAGCAAGTTTGTGGGCAACTCGGAAGAGGGGATCATATTCTCATTTCCTTAATTGTAA 214 7 

Qy 705 TCTGTCTGAGGC TCTGAGCGT 

AC 72 7 

I I I I I I II III I I 

I I 

Db 2146 

TTTAAATAATCATTCTCCATAATTTCTCTTTCTAGAGCTTTCTTCTTTGTATGCTATCAC 2 0 8 7 

Qy 72 8 ATAGGGAAG 

G G AG G AAG G G AAAAC AG AAAG C AG AC AAAG AAAAA — 771 

I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2086 

G T AAGG T AG T AC T AT T T C AC C T T AAAAAATGGAAAGAC AAAT T T C AGC C T AAG AAT GAT G 2 02 7 

Qy 7 72 AGAGCTAGCCCAAAATCCCAAACTC 

AAAC C AAAC C AAAC AG AAA 815 

I I I I I I I I I I II I I I I I I I I I I I I 

I I I 

Db 2 026 TATTCTGAGTTGGC 

AAT T AC T T AAAAC AAT GAG AG AG T AAAAC AC AT C AAAC T T AAA 19 7 0 



Qy 816 G 

CAGTGGAGGTGGGGGTTGCTGTTGATTGAAGATGTATATAATAT 86 0 



I 1 1 I I I 1 1 I I I I I 1 1 1 1 1 1 1 1 

1 1 

Db 1969 GTGTTTACTTTTTGAACA-TTAAAATAAATGT — 

CAGGTAAATACAGAAAAATATATAAT 1913 

Qy 861 CTC C GG T T T AT AAAACC T AT T T AT AAC AC T T T T T AC A 

TATATGT 904 

II I I I I I I I I I I II I I I I I I I I I I 

I I I I 

Db 1912 CACTAGTTCT C AGC AT T T T AT C AAT C C - 

CTTTGTTTCTCTTCTTCTAGAAGATTCTATGA 1854 

Qy 9 05 ACATAGTATTG TTTGCTTTTTATGTTGACCATC — 

AGCCTCGT 9 45 

III I I I I I II II I I I I I I I I I I I I 

Db 1853 GCATTATGTTGCTCCGATCAAAAGTAACCTCACTACTCAAAT — 

ACCATCATAGCTTC — 179 8 

Qy 9 46 G T T G AGC C T T AAAG AAG T AGC T AAGG AAC T T T AC 

ATCCTAACAGTATAATCCA- 99 8 

I I I I I I I I I I I III I I I I I I I I I I I I I I I 

I I 

Db 179 7 AATGTGCATTATAGTGATTTC — 

AG TAG AT T C AC AC T CAAAT CTTTTCAGT G TC AT AC AT 1740 

Qy 999 — GCTCAGTATTTTTGTTTTGTT TTTTGTTTGTTTGTTTTG 

TTTT 1041 

III I I I I I I I I I I I I I I I I 

I I I 

Db 1739 TTATTAAGCCATAAAGTTATGAAACCCTCAGCTCTTG- 

TGGAGAGATCTGGG C AG AAT T T 16 81 

Qy 10 42 AC C C AG AAAT AAG AT AAC TCCATCTC 

GCCCCTTCCCTTTCATCTGAAAGAAGATAC 1097 

I II I I I I I I I I I I I I I I I I I III I I I I I 

Db 1680 

ATACAAGAAGAAGTTTAATTTATCACTTAAAAATCATCTCATAATTGTGGTAACAGATTT 1621 

Qy 1098 CT CCCT CCC 

AGTCCAC 1113 

I I I I I III I I I I I I 

Db 1620 

T T AAAG T AAC C C T T T GG AAAAAAT T T T AC G T G AT ACCC AAAGGC AC T AG T T C AC AT AAGG 1561 

Qy 1114 CTCATTTAG 

AAAACCAAAGTGTGGGTA 1140 

III II I I I I I I I I I I III 

Db 1560 

GGTGTGGCT G AAAAAG C AAAAC AAAG CCATTGTG T AAG T AAAG AGC AC AC AAAAAAAG G A 15 01 

Qy 1141 G AAAC C C CAAAT G TCCAAAA 

GCCCTTTTCTGGTGGG 1176 

I I I I I I II I I I I I I I I I I I I I I II 

I I 

Db 1500 

TATTAAAAGGAAACCTGTTAAGCTTTAAAGTTATTCTAAAAATGGCACTTTTCACGTTGG 14 41 



Qy 1177 TGAC CCAGTGCATCCAACAGAA- 

ACAGCCGCTGCCCGA 1213 

I I I I I I I I III 



Db 1440 

TATAAAATGAAAGTGCTTTAAGATGAAATTTTTATGTATTTTTTTAAAGCTGATTTTTAA 13 81 



Qy 1214 ACCTCTGTGTGAAG CTTTACGCGCACACGGA CAAAAT 

GCCCA 1255 

I I I I I II I I I I I I I I I I I I I I 

Db 1380 

CCCCCTGAAAATAGGGGGTTATAACAAGAACACTGATAGACTCTAACTCATAGCGGCAAT 1321 
Qy 12 56 AACTGGAG 

CCCTTGCAAAAACACGGCTTGTGGC 1288 

I I I I I III III III I I I I I 

Db 1320 AAGTAGTGGATGTCACT — 

AAAATGCTCTGTGTGTGTCTTTATTTTTTAGTATTTTTAAT 12 63 



Qy 12 89 ATTGGCATACTTGCCCTTACAGGTGGAGTATC 

TTC 1323 

III I I I I I I I I I I I I I I I 

Db 1262 

GTTGATAGACTTGCTTTATCTATCTGTGTATCATTAGTGAGATTTGCTTTTCAATTCTTT 12 03 

Qy 132 4 GTCACA CATCTAAA TGAGAAAT 

CAGT 1349 

I I I I I I I I I I I I II I I I I 

I I I I 

Db 1202 

AC T AGAAAG T T AC AGC T C AT T T AAAAT AGC TGGT AAAT AC AGAAAGC AT AAAT AT AC AGT 1143 



Qy 1350 GA CAACAAGTCTTTG 

AAATG 1369 

I I I I I I I I I 

I I I I I 

Db 1142 

AAG T T T AAAC AC T GAT T G T AC T AAATGC AAC AAT AC AAAGAAGC AAATGGAAC AC AAAT G 10 83 

Qy 13 70 GTGCTATGGAT- 

TTACCATTCCTTATTATCACT 1401 

I I I I I II I I I I II I I I I I 

I 

Db 1082 

GTAACACAATAAAACTGTATTACATTTGTGCTTCAAATATTACAATACATTATATACAAT 1023 

Qy 1402 AATC— ATCTAAACAACTCA CTGG 

AAATCC 1429 

I I I I I I I I I I I I I I II 

I I I I I I 

Db 1022 

AG T C CAAAAT AAAC AT C T C AT GC C AAGTGAC AC AAAAACGAAT AGC AAGC AAAAAAAT C C 963 

Qy 143 0 A ATT A 

ACAAT 1439 

I I I I I 

I I I I 

Db 962 

AGCCATATGTACATTACTTTTTTCTTTAATAATAAACATTCAACTCTGAACTGGGGCAAT 9 03 

Qy 1440 T T T AC AAC AT A- AG AT AGA ATGGAG 

ACCTGAATAATT 14 75 

I I I I I I I I I I I I I I III III 



I 

Db 902 

TTCACTACATACAGATGCAGTCCGCCTTTTATTCCATATAACCATCCTGCCTTCCTATAT 8 43 



Qy 14 76 CTGTGTAATATAAATGGTT TATAACTGCTTTTGTACCT 

AGCTAGGC 1521 

II I II I I I I I I I I I I I I I I I I I I III 

Db 842 

CTACGCTATTCAGTAGGTTCCTGTGTCAATTCTTATGTACATGTGGAGCAAAAAAGATAA 7 83 
Qy 1522 

TGCTATTATTACTATAATGAG 15 42 

I I I I I I I I I I I I I I 
Db 782 AAGAATGTAGTGCTCTGTATTCT- 

TAATGGGGTGATGTTGATAGATCAGCGAACCTGGGG 72 4 

Qy 15 43 TAAATCATAAAGCCTTCATC— ACTCC 

CACATTTTT 1576 



Db 723 

TGCATAACCTTCTTTCATTAAACCTTCCTCGTAGTCCGTCTGGCAAAGGATCATGTTATT 66 4 

Qy 1577 CTTACGG TCGGAGCATCAGA 

ACAAGC 16 02 

I I I I I I I I I I I I I I I 

Db 663 

CTTTAGGAAAAATTTGTCTCCAACACAAAATCTCTGATTACAAAGCTGACATGCAAAGCA 6 0 4 

Qy 16 03 GTC TAGAC TCCTTGGGAC-CGTGA — 

GTTCCTAGAGCTTGGCTGGG 1645 

III I I I I I I I I I I I I I I I I I II II II 

Db 603 

GTCCAGGTGGTAAACATTGTCCTTGGCACGCATCACCATCTCAAAGGCAGGGATGAGCTT 54 4 

Qy 1646 — TCTAGGCTG TTCTGTGCCTCCAAGGA — CT GTCT 

GGCAATG 16 8 4 



Db 543 

ACTACAGGCAGCGCAGTTTCCCGTTACACCAAAGAGCCTCAGATAGTCTCTGCGACAAAG 4 8 4 

Qy 16 85 ACT TGTATTGG CCAC 

CAACTGT 1706 

I I I I I I I I I I I III 

Db 483 

GATAAGATTAGCTTTAGTGTACAGGGTGGAGCCCACCTCCCCCAAGCGACAGTCACAGCA 42 4 

Qy 170 7 AG AT GTATATATG GTGCCCTT 

CTGATG 1733 



Db 423 

GGCACACTTCAGGCAGTCTTCATGCCAGTATTTGTCCAGTGCCTTTAGAAGATACCGGTC 36 4 

Qy 173 4 CTAAGACT CCAG ACCTTTTGTTTT — 

TGCTTTGCATTTTCTGA — 1774 

II II I I I I I I I I I I I I I I I I I I 

Db 363 CTTGATCTTTCGGTTGCAGCCAGCACAACCTTTCGGCTTGGTGTCTGGC- 

TGGACTGAGA 3 05 



Qy 1775 TTTTATACCAA CTGTGTG- 

GACTAAG 1799 



Db 304 

GCATTTGTATACCAAAACTTTTCTCCTTTTTCTGCATGAGTTGACTTAGGCGTGTCTGAG 2 45 

Qy 1800 ATGCA T T AAAATAAACATCAGAGT AAC TC 182 8 

I I I I I I I I I I I I I I I I 

Db 2 44 TTGCAGCAGCTTCGCATTGAGCACCA AACCC 214 



RESULT 7 

US-10-478-914-64 

; Sequence 64, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 

; TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
; FILE REFERENCE: 7388-80899 

; CURRENT APPLICATION NUMBER: US / 1 0 / 478 , 9 1 4 

; CURRENT FILING DATE: 2003-11-26 

; PRIOR APPLICATION NUMBER: PCT/ JP02 / 0 52 9 4 

; PRIOR FILING DATE: 2001-05-30 

; PRIOR APPLICATION NUMBER: JP 2001-162775 

; PRIOR FILING DATE: 2001-05-30 

; PRIOR APPLICATION NUMBER: JP 2001-255226 

; PRIOR FILING DATE: 2001-08-24 

; NUMBER OF SEQ ID NOS : 417 

; SOFTWARE: Patentln version 3.2 

; SEQ ID NO 6 4 

LENGTH: 212 0 

TYPE: DNA 
; ORGANISM: Homo sapiens 
US-10-478-914-64 

Query Match 38.9%; Score 4221.5; DB 1; Length 2120; 

Best Local Similarity 44.1%; Pred. No. 9e-277; 

Matches 990; Conservative 0; Mismatches 705; Indels 549; 
Gaps 13 4; 

Qy 1 CAGTTACAG-GGAGCACCACCAG — GGAACATCT- 

CGGGGAGCCTGGTTGGAAGCTGCAG 56 



Db 5 CCTGTACGGAAGTGTTACTTCTGCTCTAAAAGCTGCGGAATTCC 

TCGAGCACTGTTG 61 

Qy 57 GCTTAGTCTGTCG-GCTGCGG — GTCTCTGAC-TGCCCTGTG — 

GGGAGGGTCTTGCCTT 110 

I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 62 GCCTA— 

CTGGAGTGCTGAAGTAGGCGCGGACGTGCCCGGTGCCTGGCGCGTGGTAGC-A 118 
Qy 111 

AACATCCCTTGCATTTGGCTGCAAAGAAATCTGCTTGGAAGAAGGGGTTACGCTGTTTGG 17 0 



Db 119 GGCGCCCGGTGCCCCGGCCGGCGAAGACCATGGCGTTCATGGTGAAG — 



ACCATGGTGGG 176 

Qy 171 CCGGGCAGAAACTCCGCTGAGCAGAACTTGCCGCC-AGAATGCTCC- 

TCCTGTTGC 224 



Db 177 -CGGCCAG 

CTGAAGAACCTCACTGGGAGCCTGGGAGGCGGCGAGGATAAGGGAGA 23 0 

Qy 225 TGAGTATCA-TCGTCCTCCACGTCGCGGTGCTGGTGC TGCTGTTCG- 

TCT 272 



Db 231 

TGGGGACAAGTCGGCAGCCGAAGCTCAGGGCATGAGCCGGGAGGAGTACGAGGAGTATCA 2 9 0 

Qy 2 73 CCACGATCGT CAGCCAATGGATCGTGG-GCAATGGACACGCAACTGAT — 

CT 321 



Db 291 GAAGCAAC — 

T C G T G G AAG AG AAG AT GGAGCGGGAT G C AC AG T T C AC AC AG AG G AAG G C A 3 4 8 

Qy 322 CTGGCAGAACTGTAGCACCTCTTCC TCAGGAAATGTCCACC 

ACTGTTTCT 371 



Db 349 

GAGCGGGCCACACTGCGGAGCCACTTCCGAGACAAATACCGGCTACCCAAGAACGAGACA 4 0 8 

Qy 3 72 CATCATCACCAAA — CGAATGGC TGCAGT CTGTC 

CAGGCCACC 412 

III I I I I I I I I I I I I I I I I I I 

Db 409 

GATGAGAGCCAGATCCAGATGGCAGGTGGAGACGTGGAGCTGCCCCGGGAGCTGGCCAAG 46 8 

Qy 413 ATGATCCTGTCGATCATCTTCAGC 

ATTCTGTCTCTGTTCCTGTTCTTCT 461 



Db 469 ATGATCGAGGAGGACA 

CAGAGGAGGAGGAGGAGAAGGCCTCAGTCCTTGGGCAGCT 52 4 

Qy 462 -GCCAACTCTTC ACC CTCAC-CAAGG 

GGGGCAGGTTTTACATC 502 



Db 52 5 GGCCAGC-CTTCCTGGCTTGAACCTGGGCTCACTCAAGGACAAGGCCCAGG- 

CCAC 578 

Qy 503 ACT — GGAATCTTCCAAATTCTTGCTGGTCTGTG CGTGATGAGT — 

GCTGC G 550 



Db 5 79 ACTGGGGGATCTCAAGCAATC- 

AGCTGAGAAGTGTCACGTCATGTGACCACTTCCCCGGG 637 

Qy 551 GCCATCTACACGGTGAGGCACCCGGAGTGGC ATCTCAACT — CGG- 

ATT 596 

I I I I I II I I I I I I III I I I I I I I II 



GTTACCCACTGGGCTGGGCCCCCATGAGGGCTAAGAGTGTGTCAACTTCCAGGGACCCAT 69 7 

Qy 59 7 ACTCC-TACG GTTTCGC CTACATCCTGG — CCTGGG 

TGGCCTTC 637 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I 

Db 698 

ACTCCATTTGGGGCTTTGTTTCCCTTGCCCCATCCTAGTTCCAAGACCTTTCCCATCCAT 75 7 

Qy 63 8 -CCCCTGGCC CTTCTCAGC — GGTG 

TCATCTAT 66 7 

I I I I I I I I I I I I I I I I I I I I I 

I 

Db 758 

GCCCCAAGCCTATCTTCTGGTTTCTTCCTCTCCGCTGGGAGTAAAGTCCCCATCTTCACT 817 

Qy 66 8 GTGATCTTGCGGAAACGC GAATGAGG 

CGCCCA 699 

I I I I I I I I I I I I I I I I I I I 

Db 818 

CTACCCTTCAGGACCCTCCCCACCAGCTCAGCCTGTGGAGGCCTCCCAAGATTGTAGGAA 8 7 7 

Qy 70 0 GACGGTCTGTCTGAGGCTCTGAGC 

GTACATAGGGAAGGGAGGAAGGGAAAACA 752 

II II III III II I I I I I I I I I I I 

I I I 

Db 8 78 TAGGCCCATCCCTCTCTGGCCATGGCCCCAAGTTCCTGCACACAGGAGCA 

CCCACA 933 

Qy 753 G AAAG - C AG AC AAAGAAAAAAGAG CTAGC CCAAAATCC- 

CAAACTCA 79 7 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 934 

GAGAGACACACACAGGACACAAAACCCCTGGCACGTTCAGAGACAGAAGCCACAGACACA 993 

Qy 79 8 AACCAAACCAAACAGAAAGCAGTGGAGGTGGGGGTTGCTG- 

TTGATTGAAGATGTATATA 856 

II I I I I I I I I I I I I I I III III III 

I 

Db 99 4 T C C C G G C AC AG AC AG AC A- C AC AC G AGGC CAGCTCCCTTGC 

GTGTCCA 1040 

Qy 857 ATATCTCCGGTTTATAAAACCTATTTATAACACT 

TTTTACATATATGTAC 9 06 

I I I I I I I I II I I I I I I I I I I I 

I 

Db 10 41 GCCCCTCCAG 

AC AC C AC C AC T C AG AAAC T C T G AG AGAGAGC ATGGGC AGAC AC C C T C 109 7 

Qy 907 ATAGTATTGTTTGCTTTTTATGTTGACCA 

TCAGCCTCGTGTTGAGCCTTAAAGAAGT 963 

I I I II I III III II III I I I I I I I I I I 

Db 109 8 AGCAGACAGGAGGC CTGAGTT — CCAGTCTCCACCTT- 

TATTGTTCTTGAAAGCCCC 1151 

Qy 96 4 AGCT AAGG AAC T T T AC ATCC T - AAC AG T AT AAT 

CCAGCTCAGTATTTTTGTTTT 1016 

III I I I II I I I I I I I I I I I I I I I I I I I 



Db 1152 

TGCTCTCTCTGAGCCTTATTTCATCATCTGTAAAATGGGAATGTCCTGAATGACTTCTAA 1211 



Qy 1017 

GTTTTTTGTTTGTTTGTTTTGTTTTACCCAGAAATAAGATAACTCCATCTCGCCCCTT — 10 7 4 
I III I I I I I III I I I I III III II 

Db 1212 GGCTCTTTCTGGCTTGAACTGT CAGAGCCAAGC 

CCACATCCCTCCTTGG 1260 

Qy 10 75 CCCTTTCATCTGAAAGAAGATACCTC— CCTCC— CAGT— CCAC 

CTCATTTAG 1122 



Db 1261 GCAGGGCAGC AGCTGCTGCCACAGCCTCCAGCGGCTGCCACTGTGGGCTC 

TG 1312 

Qy 1123 AAAACCAAA GTGTG 

GGTAGAAACCCCAAATGTCCAAAAGCCCTTTTCTG 1171 



Db 1313 

GGAGCCGGAGCGATGCTGTGTGAGAGGCAGAGTGCCAAGGATGAAGCTGGCACTGAACAG 13 72 

Qy 1172 GTGGGTGACCCAGTGCATCCAACAGAAACA— GCCGCTGCCCGAACCTCTGTG- 

TGAAGC 122 8 



Db 13 73 TAAGCGGCTCCAG-GCCTCCTCTGGGCCCAGGGCC-CAGCC — 

AATTTCTGTTCTGTTCC 1428 

Qy 1229 TTTA- 

CGCGCACACGGACAAAATGCCCAAACTGGAGCCCTTGCAAAAACACGGCTTGTGG 12 8 7 



Db 1429 TGTAGAACGCTCTCTG GATTCCATAGCTGG 

AATCTCCTCTCTTAG 1473 

Qy 12 8 8 CATTG-GCATACTTGCCCTTACAGGTGGAGTATCTT- 

CGTCACACATCTAAATGAGAAAT 13 45 

I I I I I I I II I I I I I II I I I I I I I I I I I I 

Db 1474 CTCAGTGAAAAATAAAAATCCCAAATGGTGTGCCTACCTTCCCACTTCTTACTG- 

GCTTC 1532 

Qy 13 46 C AG T G AC AAC AAG T C T T TGA AATGGTGCTATGGATTTACCAT 

TCCTTA 1393 

III I I I I I I I I I I I I I I I I I I 

Db 1533 CAG 

GAGTCTTGGAGTTCATAGCCCCCCGAGCCTGCCTTAAAGGGGTGTCCT — 15 83 

Qy 139 4 TTATCACTAATCATCTAAACAAC-TCACTGGA AATCCAAT 

TA 143 4 



Db 158 4 — CCACCCCCCACCT— 

ACAGCTTCACAGGAGGGGAGAGGGCATCCAGTGCTAGGAGTA 163 8 

Qy 1435 AC AAT T T T AC AAC AT AAGAT AGAATGG AG AC C T G AAT AAT TCTG- 

TG 1480 

III II I II I I I I I I I I II I I I I 

I I 



Db 1639 

GAAGTGTCTCCAGCTCTGTTCTCTTGGGGCCCTGGGTGAAGGTGGGGTCTGGGGCTTATG 169 8 

Qy 1481 TAATATAAATGGTTTAT AACTGC TTTTGT 

ACCT 1513 

I I II I I I I I I I I I I I I I I 

I I I 

Db 1699 

AAATAGGTCTGGGCTTTGAGGAGGATGGAGCAGCCTCATTATGTGGGGAAGATGGGGCCT 175 8 

Qy 1514 

AGCTAGGCTGCTATTATTACTATAATG 1540 

I I I I I I I III 

I I 

Db 1759 CTGGGGCGTCACTGAGACCACAGGTGGGGCCGGGGCTGGACCGC 

AGCTGTCTTG 1812 

Qy 1541 AGTAAATCATAAAGCCTTCATCACTCC- 

CACATTTTTCTTACGGT C GG AGC AT C AG AAC A 1599 

II I I I I I I I I I I I I I I I III I I I I I II I 

I I 

Db 1813 GGTGCCT GTGCCTGCACCCCTCCTCAC C C T AG AG AC GG AAGAT — 

GTGCA 186 0 

Qy 16 0 0 AGCGTCTAGACTCCTTGGGACCGTGAGTTCCTAGAGCTTGGCTG GGTCT 

AGGCT 1653 

I III I I I I I I I I I I I I I III III I 

I I 

Db 1861 AAAAGAAAGAAGGAAGGGCAAC-TGCATTCC- 

AGCCCCACACTGTGATGACTTTGAGCCT 1918 

Qy 1654 GTTCTGTGCCTCCAAG-GACTGTCTGGC AATGACTTGT 

ATTGGC 1696 

I I I I I I I I I I I I I I I I I I I I I I I I I II 

I 

Db 1919 

GTCCTTTCCCTCCTTGAGCCTGTCTTGCTTGTCCCCTGTAAAATGAACAGTCCCCCTCTC 19 78 

Qy 169 7 CACCAACT GTAGATGTAT-ATATGGTGCCC— TTCTGATGC-TAAGAC- 

TCC 1743 



Db 1979 

CCCCAAATAGTAATAATACATGTTTCAAAGGGTGACCATTTATAAAGCATATGACAAACC 2 03 8 
Qy 1744 

AGACCTTTTGTTTTTGCTTTGCATTTTCTGATTTTATACCAACTGTGTGGACTAAGATGC 18 03 
I I I I I I I I II Mil I I I I III 

I I I 

Db 2 039 ATATCAATAAATGTAACT CA — 

TTCTTTTAAAAAAAAAAAAAAAAGGCCACATGTGC 2 093 

Qy 1804 ATTAAAATAAACATCAGAGTAACT 182 7 

I I I I I I I II 
Db 2 09 4 -TCGAGCTGCAGGTCGCGGCCGCT 2116 



RESULT 8 

US-10-478-914-20 

; Sequence 20, Application US/10478914 



; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 

TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 

FILE REFERENCE: 7388-80899 
; CURRENT APPLICATION NUMBER: US/ 1 0 / 478 , 9 1 4 

CURRENT FILING DATE: 2003-11-26 
; PRIOR APPLICATION NUMBER: PCT/JP02/05294 
; PRIOR FILING DATE: 2001-05-30 
; PRIOR APPLICATION NUMBER: JP 2001-162775 
; PRIOR FILING DATE: 2001-05-30 

PRIOR APPLICATION NUMBER: JP 2001-255226 

PRIOR FILING DATE: 2001-08-24 
; NUMBER OF SEQ ID NOS : 417 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 2 0 

LENGTH: 2 495 
TYPE: DNA 
; ORGANISM: Homo sapiens 
US-10-478-914-20 

Query Match 38.8%; Score 4214; DB 1; Length 2495; 

Best Local Similarity 42.5%; Pred. No. 7.8e-276; 
Matches 1079; Conservative 0; Mismatches 682; Indels 776; 
Gaps 146; 

Qy 1 CAGT TACAGGGAGC-ACCACCAGG GAACATCTCGGGGAG 

CCTG 42 

III I I I I I I I I I I I I I I I I I I III 

Db 3 CTGTTGGCCTACTG 

GCTACTATCAGGATCGTGGCTATTTTGAAGAGCTGATCACCAT 59 

Qy 43 GTTGGAAGCTGCA GGCTTAGTC 

TGTCGGCTG 73 

I I I I I I I I I I I I I I I I I III III 

Db 6 0 

GTTGGAAGCAGCACTGGGACTTGAGCGAGCTCACATGGGAATGTTTACTGAATTAGCTAT 119 

Qy 74 -CGGGTCTCTGACT GCC CTGTGG GGAG 

GGTCTTG 106 



Db 120 

TCTATACTCTAAATTTAAGCCTCAGAAAATGAGGGAGCACCTGGAGCTGTTCTGGTCTAG 179 

Qy 107 CCTTAACA-TCCCT TGCATTTGGCTGCAAAGAAA — TCTGCTT — 

GGAAGAAGGGGT 158 



Db 180 

AGTGAATATTCCCAAGGTGCTAAGAGCTGCAGAACAAGCTCATCTTTGGGCAGAACTGGT 239 

Qy 159 TACGCTGTTTG GCCGGGCAGAAAC- 

TCC 185 



Db 240 

GTTTTTGTATGACAAGTATGAAGAATATGATAATGCCATAATTACCATGATGAATCATCC 2 99 



Qy 



186 -GCTGA- 



-GCAGAAC- 



-TTGCC- 



GCCAGAATGCT 213 

I I I I III I I I I I I I I I II 

Db 300 

AACTGATGCCTGGAAAGAAGGGCAATTCAAAGATATCATTACCAAGGTTGCCAATGTGGA 359 

Qy 214 CCTCCTGTTGCTGAGTATCATCGTCCTC 

CACGTCGCGG 251 

II I I I I I I I I I I I I I I I I I I 

Db 36 0 ACT- 

ATACTACAGAGCAATACAGTTCTACTTAGAATTCAAGCCTCTGTTGTTAAATGATT 418 

Qy 252 TGCTGGTGCTGCTGTTCGTCTCCACG ATCGT- 

CAGCCAATGGATCGTGG 299 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 419 TGCTGATGGTGCT GTCTCCACGGTTGGATCACACTCGTGCAGTCAATTATT — 

TCA 472 

Qy 300 GCAATGGACACGCAACTGATCTCTGGCAGAACTGTA GCACCTCTTCC— 

TCAGGA-A 353 

I I I I I I I I I I I I I I I I I I I I I I I II III I I I I I 

Db 4 73 GCAA-GGTTAAACAGCT-ACCACTGGTGAAACCGTATTTGCG 

TTCAGTTCAGAACC 52 6 

Qy 354 ATGTCCACCA— CTGTTTCT-CATCATC-ACCAAACGAATGGCTGCAG 

TC 399 

II I I I I I I I I I Mill I I I I I I I I I I 

I I 

Db 527 

ATAACAACAAATCTGTGAATGAATCATTGAACAATCTTTTTATTACAGAAGAAGATTATC 586 

Qy 40 0 TGTCCAGGCCACCAT GATCCTGTCGATCATCTTCAGCATTCTGTCTC- 

TGTTC — 451 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 58 7 AG GC T C T GC G AAC AT C AAT AG AT GC T T AT G A- 

CAACTTTGACAATATCTCGCTTGCTCAG 6 45 

Qy 452 CT GTTC TTCTGC- 

CAACTCTTC 472 

II I I I I I I I I I I I I I I I I 

Db 646 

CGTTTGGAAAAACATGAACTCATTGAGTTCAGGAGAATTGCTGCTTATCTCTTCAAAGGC 7 05 

Qy 4 73 -ACCCTCAC CAAGGGGGGCAGGTTTTACAT 

CA 503 

I I I I I I I I I I I I I I I I I I I 

I I 

Db 706 

AACAATCGCTGGAAACAGAGTGTAGAGCTGTGCAAGAAAGACAGCCTTTACAAGGATGCA 765 

Qy 50 4 CTGGAATCTTC — CAAATTCT TGCTGGTCTGTGCGTGATGAG- 

TGCTGCGGCCAT 555 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 766 ATGCAGTATGCTTCTGAATCTAAAGATACTGAATTG-GC- 

TGAAGAACTCCTGCAGTGGT 823 

Qy 556 CTACACGGTGAGGCACCCGGAGTG-GCATCTCAAC TCGGATTA- 

CTCCTACGGTTTC 610 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I 



Db 824 TTTTGCAG- 

GAAGAAAAAAGAGAGTGCTTTGGAGCTTGTCTGTTTACCTGTTACGATCTT 8 82 



Qy 611 GCCT ACATCCTGG — CCTG GGTGGCC 

TTCCCCCTG 643 

III I I I I I I III I I I I I I I I I 

I I 

Db 883 

TTAAGGCCAGATGTCGTCCTAGAAACTGCATGGAGGCACAATATCATGGATTTTGCCATG 9 42 

Qy 644 GCCCTTCTCAGCGGTGTCATCTATGTGATCTTG 

CG 678 

II I I I I I I I I I I III I I I I 

I 

Db 943 

CCCTATTTCATCCAGGTCATGAAGGAGTACTTGACAAAGGTGGATAAATTAGATGCTTCA 10 02 

Qy 6 79 GAAACGCGAATGAGGCGCCCAGACGGTCTGTCT 

GAGGCTCTGAGCGTACATAGGGAA 735 

I I I I I I I I I I I I I I II I I I I I II I I 

Db 10 03 GAATCAC 

T G AG AAAAG AAG AAG AAC AAGC T AC AG AG AC AC AACCC AT TGTTTATGGT 1059 

Qy 736 GGGAGGAAGGGAAAACAGAAAGCA- 

GAC AAAGAAAAAAGAGC T AGCCCAAAATCCCAAAC 794 

I II I I I I I I I I I I I II II III I I I I 

I 

Db 106 0 C AG C C C C AG T T G AT GC T G AC AGC AGG AC C C AG T G T TGCCGTCCCTC 

CCCAGGC 1112 

Qy 795 TC AAACCAAACCA AACAGAAAGCAG — 

TGGAGGTGGGG 83 0 

I I I I I I I I I I I I I I I I I I I I 

I I I I 

Db 1113 ACCTTTTGGTTATGGTTATACCGCACCACCGTATGGACAGCCA- 

CAGCCTGGCTTTGGGT 1171 

Qy 831 GTTGC-TGTTGATTGAA — GATGTATATAATATCTCCGGTTT 

ATAAAACCTATTT 8 82 

I I I I I I I I I III I I I I I III I I I I I I 

I 

Db 1172 

ACAGCATGTGAGATGAAGCGCTGATCCTGTAGTCACCTATTTTCGTACTGAAACATCGTC 1231 

Qy 8 83 ATAACACTTTTTACA TAT -AT G 

TACATAGTATTGTTTGCTTTTT 925 

I I I I II II I I I I I I III I I I I I I I I I 

I 

Db 1232 

TTTACCCACTTCTCAGTTTATAATGGGGGAAAACAGGCAACGTGTTCTTGTAACCTTTAT 12 91 

Qy 926 

ATGTTGACCATCAGCCTCGTGTTGAGCCTTAAAGAAGTAGCTAAGGAACTTTACATCCTA 9 85 

I I I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 1292 

TTCATGAAGGACTTCTTTTTGTTTCTAACTATAAACTTGGATCACCTATGTTAAAACCTT 1351 



Qy 9 86 ACAGTATAATCCAGCTCAGTATTTTTGTTTTGTTTTTTGTTTG TTTGTTT 

TGTT 1039 



Db 1352 ATTTCACATTCCACATC 

AT T T T AG AAT TTATTTTC G AAGGGGAAT AGT T TC AATGT T 14 0 8 

Qy 10 40 TTACCCA G AAAT AAG AT AAC TCCATCTCGCCC CTTCCC 

TTTCAT 1083 

I I I I I I I I 1 I I I I I I I I I I II 

I I 

Db 1409 

TTATTCACTTGGGCTTTTTTTCTTCCCCCTCTTTCTTTAAAGAACTGCTCAATATTCAAT 146 8 

Qy 1084 C T G - AAAG AAG AT AC C T CCCTC CCAGTCCAC- 

CTCAT 1118 

III I I I I I I I I I I I I I II I I I I 

I I 

Db 1469 CTGTTGTGAAGA- 

ACCTGATTTGCACTCTGTAGTGTTTAAAGAAACAAAGAAACTCTAAT 152 7 

Qy 1119 TTAGAAAACC AAAGTGTGGGTAGAAACC — CCAAA 

TGTCCAAAAGCC 1163 

I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 1528 

ATTGAATCTCTTAAATTTAGTGTATGTAAACAGCTTACAAATACGTATTGTCTAAATGCA 15 8 7 



Qy 1164 CTTTT 

CTGGTGGGTGACC 1181 

I I I I I I I I I I I I I 

Db 1588 

T T T AAAT C T G T T T T AT TC AAAG AAAAGC T AAAGC AAAAAC AC TGGC AT ATGAC C AT GC AA 16 4 7 

Qy 1182 C AG T GC AT C C AAC AGAAAC AGC CGCTGCCCGAAC CTCTG 

TGTGA 1225 

I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 16 48 GACTGTCAGTG C C AAC AAAGAC AAC AC T AAT C AGC AC - 

ATCGTACACTGGATTGCAG 1703 

Qy 1226 AGCTTTAC GCGCAC ACGGACAAAATGCCCAA 

ACTGGAGCCCTT 126 8 

I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1704 

TGCTTCCCAGATTATTGAAAAATGTTACAGACAACTTGCCTGATTTTTAAATGAGC 1759 

Qy 1269 GCAAAAACACGGCTTGTGGCATTGGCATACTTGCCC TTACAGGTGG 

AGT 1317 

I I I I I I I I I II III I I I I I III II 

I I I 

Db 176 0 GTAAAAG 

GCCCTCTAACCTATGCAGGTTTCCCCATTATGCATATAGAAAATGCTAGT 1816 

Qy 1318 ATCTTCGT C AC AC ATC T AAATGAGAAAT C AG T G AC AAC AAG T 

CTTTGAAATGGTGC 1373 

I I I I I I I I I I II III I I I I III III I 

I I I I 

Db 1817 ATGTTTTGCTCACTTC ATATGTAACAGGTGCCCTTATGTTGTGCTGTATCCT- 

GTGC 1872 



Qy 1374 TATGGATTT ACCATTCCTT- 

ATTATCACTAATCATCTAAAC 1413 



Db 1873 

TTTTTCTGTGGGACCATTCCATTCAGGAGCAAAGAGCACCATGATTCCAATCTTGTGTGT 1932 

Qy 1414 AAC TCACTGGAAAT CCAATTAACAATT 

TTAC — 1444 

I I I II I I I I II III I 

Db 1933 

GTTTACTAACCCTTCCCTGAGGTTTGTGTATGTTGGATATTGTGGTGTTTTAGATCACTG 1992 

Qy 14 45 AAC AT AAG AT AG AATGGAGACC TG AAT AAT T C T G T 

GTAATATAAATGG— 1492 

I I I I I I I I I I I II I I I I I I I I I II II 

Db 1993 

AGTGTACAGAAGAGAGAAATTCAAACAAAATATTGCTGTTCTTCAGTTTTGTTTGTGGAA 2 052 

Qy 1493 TTTATAACTGCT TTT GTACC T AGC T AGGC TGC T AT TAT T AC - TAT AAT 

GAG 1542 



Db 2053 

TTTGAAATTACTCAAATTTAAAATAAATTACTGGACTGTGGAAATAACATAGAATTGAAG 2112 

Qy 15 43 TAAATCATAAAGCCTTCATCACTCCCAC 

ATTTTTCT 1578 

I I I I I I I I I I I I I I 

Db 2113 TTT T AAT T AAAT AC 

C AC T C AAAC G AAAAG AAC AG TAG T T T T T G TAG TTTTATATT 216 7 

Qy 1579 TACGG TCGG — AGCATCAGAACAAGCGTCTAGACTCCTTGGGACC- 

GTGAG 1626 



Db 216 8 GGATACTGAGGCATTAGGGAGGCATGAAAGGAAGAGGAATGA- 

GG AT T GAG AC AT G T G AA 2226 

Qy 1627 TTCCTAGAGCTTGGCTGGGTC — TAGGCTGTTCTGTGCCTC 

CAAGGACTGTCTGG 1679 



Db 222 7 GACATTGTGCAT TATATCAATGTGCATTCCTGTAGTTCATTAACAAGGT 

ACATG 22 8 0 

Qy 16 8 0 CAAT-GACTTGTATTGGCCA CCAACTGTAGATG 

TATATATGGTGCCC 172 5 



Db 22 81 CAATAGTCT — 

AAAGAACCAGAGTCACTACTATAGTGGCTTAACATTTAATCTGTCTCCA 233 8 

Qy 1726 TTCTGAT-GCTAA — GACTCCAGACCTTTTGTTTTTGCTTTGCATTT- 

TC 1771 



Db 2339 

ATATTTTAACCAAGTGACACCGGGGTTTTTATCGAAGCATTTCACTTAAATGAACAAATC 239 8 

Qy 17 72 T — GATTTTATACCAACTGTGTGGACTAAGATGCATTAAAA 

TAAACATCAGAGTAA 1825 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 



Db 2399 ATGGCTGTTATATTAACT 

TGAAATAAAATATATTTAAACATGTAAAAAAAAAAAAAA 2 455 

Qy 1826 CTC 1828 

I I I 

Db 2456 AAAGGCCACATGTGCTC 2472 



RESULT 9 

US-10-478-914-64/C 

; Sequence 64, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 

; TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
; FILE REFERENCE: 7388-80899 

; CURRENT APPLICATION NUMBER: US/ 1 0 / 478 , 9 1 4 

; CURRENT FILING DATE: 2003-11-26 

; PRIOR APPLICATION NUMBER: PCT/ JP02 / 0 52 9 4 

; PRIOR FILING DATE: 2001-05-30 

; PRIOR APPLICATION NUMBER: JP 2001-162775 

; PRIOR FILING DATE: 2001-05-30 

; PRIOR APPLICATION NUMBER: JP 2001-255226 

; PRIOR FILING DATE: 2001-08-24 

; NUMBER OF SEQ ID NOS : 417 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 6 4 

LENGTH: 212 0 

TYPE: DNA 
; ORGANISM: Homo sapiens 
US-10-478-914-64 

Query Match 38.8%; Score 4211; DB 1; Length 2120; 

Best Local Similarity 43.9%; Pred. No. 4.9e-276; 
Matches 994; Conservative 0; Mismatches 669; Indels 602 
Gaps 139; 

Qy 5 TACAGGGAGCACCACCAGGGAACATCTCGGGGA 

GCCTGGTT 45 

I III I I I I I I I I I I I I I I I I I I II 

Db 2119 TCTAGCGGCCGCGACCTG 

CAGCTCGAGCACATGTGGCCTTTTTTTTTTTTTTTTAA 2 06 4 

Qy 46 — GGA — AGCTGCA GGCTTAGTCTGTCGGCTGCGGGTCTCTGACTG 

CCCT 91 



Db 2 063 AAG AAT G AG T T AC AT T T AT T G AT AT GG T TTGTCATATGC 

TTTATAAATGGTCACCCT 2007 

Qy 92 GTGGGGAGGGTCTTGCCTTAACATCCCTTGC- 

ATTTGGCTGCAAAGAAATCTG 143 

I I I I I I I I I I I I I I I I I I 

Db 2 0 06 TT G AAAC AT G T AT T A 

TTACTATTTGGGGGAGAGGGGGACTGTTCATTT 1959 

Qy 144 CTTGGAAGAAGGGGTTACGCT — GTTTGGCCGGGCAGAAACTC 

CGCTGA 19 0 

I I I I I I I I I I I I I I I I I I I I I I I 



I 

Db 1958 

TACAGGGGACAAGCAAGACAGGCTCAAGGAGGGAAAGGACAGGCTCAAAGTCATCACAGT 1899 



Qy 191 GCAGAACTTGCCGCCAGAATGCTCCTCCTG 

TTGCTGAGTATCATCGTCCTCCACG 2 45 

I I III III I I I I I I I I II II I I I I I I I I I 

I 

Db 189 8 GTGGGGCTGGAATGCAG- 

TTGCCCTTCCTTCTTTCTTTTTGCACATCTTCCGTCTCTAGG 18 40 

Qy 2 46 TCGCGGTGCTGGTGCTGCTGTTC- 

GTCTCCACGATCGTCAGCCAATGGATCGTGGGCAAT 304 

I I I I I I I I I I I I I I I I I III II I II 

Db 1839 GTGAGGAG-GGGTGCAG — GCACAGGCACCCAAGACAGCTGC GGTCC 

AGCCCC 179 0 

Qy 3 05 GGACACGCAACTGATCTC — TGGCAGAACTGTAGCACC- 
TCTTCCTCAGGAAATG 356 

I I I I I I I I I I I III II I I I I I I I I I I I I I I I I 

Db 1789 

GGCCCCACCTGTGGTCTCAGTGACGCCCCAGAGGCCCCATCTTCCCCACATAATGAGGCT 173 0 

Qy 357 TCCACCACTG 

TTTCTCATCATCACCAAACGAATGGCTGCAGTCT 400 

I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 1729 GCTCCATCCTCCTCAAAGCCCAGACCTATTTCATAAGCCCCAGAC 

CCCACCTTC- 1676 

Qy 401 GTCCAGGCCACCA — TGATC CTGTCGA-TCATCTTC AGCATT 

CTGTC 444 

I I I I I I I I I III III II I I I I I I I I I I I 

I I 

Db 1675 

ACCCAGGGCCCCAAGAGAACAGAGCTGGAGACACTTCTACTCCTAGCACTGGATGCCCTC 1616 

Qy 445 TCTGTTCCTGTTCTTCT 

GCCAACTCTT 471 

II I I I I I I II I I I I I I I 

Db 1615 

TCCCCTCCTGTGAAGCTGTAGGTGGGGGGTGGAGGACACCCCTTTAAGGCAGGCTCGGGG 1556 

Qy 4 72 CACC CTC ACCA AGGGGGGCAGGTT-TTACATCA- 

CT 505 

III III III I I I I I I I I I I I I I I 

I 

Db 1555 

GGCTATGAACTCCAAGACTCCTGGAAGCCAGTAAGAAGTGGGAAGGTAGGCACACCATTT 1496 

Qy 506 GGAATCTTCCAAATTCTTGCTGGTCTGTGCGTGATGAGTGCTGC GG 

CCA 55 4 

I I I I I I III III II III I I I I I I II III 

Db 1495 GGGATTTTTATTTTTC — 

AC T G AG C T AAG AG AG GAGATTCCAGC T AT GG AAT C C AG AG AG 143 8 

Qy 555 — TCTACACGGTGAGGCACCCGGAGTGGC 

ATCTCAACTCGGATTACTCCTACGGT 607 

I I I I I I I I III I I I I I II I I I I I III 



Db 143 7 CGTTCTACA-GGAACAGAACAGAAATTGGCTGGGCCCTGGGCCCAGAGGAGGCCT — 

GGA 13 81 



Qy 608 TTCGCCTAC — ATCCTGGCCTGGGTGGCCTTCC-CCCTGGCCCT 

GCGGT 658 



Db 13 8 0 GCCGCTTACTGTTCAGTGCCAG- 

CTTCATCCTTGGCACTCTGCCTCTCACACAG- 132 8 

Qy 659 GTCATCTATGTGATCTTGCGGAAACGCGAATGAGGC-GCC-CAGACGGTCTGT — 

CTGAG 714 

I I I I I I II III III I I I I I I I I I I I I I I I 
Db 132 7 — CATCGCTCCGG-CTCCCAGAGCC-C-ACAGTGGCAGCCGCTGGAGG- 

CTGTGGCAGCA 12 74 

Qy 715 GC T - C T G AGC G T AC AT AGGGAAGGGAGGAAGG 

GAAAACAGAAAG 75 7 



Db 1273 GCTGCTGCCCTGCCCAAGG — 

AGGGATGTGGGCTTGGCTCTGACAGTTCAAGCCAGAAAG 1216 

Qy 75 8 CAGACAAAGA 

AAAAAGAGCTAGCCCAAAATCCCAAACTCAAACCAAACCAAACAGA 813 



Db 1215 - 

AGCCTTAGAAGTCATTCAGGACATTCCCATTTTACAGATGATGAAATAAGGCTCAGAGA 1157 
Qy 814 

AAGCAGTGGAGGTGGGGGTTGCTGTTGATTGAAGATGTATATAATATCTCCGGTTTATAA 8 73 
I I I I I I I I I I I I I I III II I I I I 

Db 1156 

GAGCAGGGGCTTTCAAGAACAATAAAGGTGGAGACTGGAACTCAGGCCTCCTGTCTGCTG 109 7 
Qy 874 A 

ACC T AT T T AT AAC AC T T T T T ACATATATGTACATAGTATTGTTTGC T T T T T 92 5 

I I I I I I I I I I III I I I I I I 

Db 1096 AGGGTGTCTGCCCAT 

GCTCTCTCTCAGAGTTTCTGAGTGGTGGTGTCTGGAGGGG 10 42 

Qy 926 ATGTTGACCATCA GCCTCGTGT 

TGAGCCTTAAAGAAGTAGCT 96 7 



Db 10 41 CTG — GACACGCAAGGGAGCTGGCCTCGTGTGTGTCTGTCTGTGCC 

GGGATGTGTCT 987 

Qy 968 

AAGGAACTTTACATCCTAACAGTATAATCCAGCTCAGTATTTTTGTTTTGTTTTTTGTTT 102 7 



Db 9 86 GTGG — CTT CTGTC — TCTGAACGTGC-CAGGGGTTTTG 

TGTCCTGTGTGT 941 

Qy 102 8 GTTTGTTT-TGTTTTACCCAGAAATAAGATAACT 

CCATCTC G 1068 



-TCTCA- 
I I I I I I 



Db 9 40 GTCTCTCTGTGGGTGCTCCTGTGTGCAGG- 

AACTTGGGGCCATGGCCAGAGAGGGATGGG 882 



Qy 1069 CCCCTTCC — CTTTCAT 

CTGAA — 1088 

II I I I I I III 

Db 881 

CCTATTCCTACAATCTTGGGAGGCCTCCACAGGCTGAGCTGGTGGGGAGGGTCCTGAAGG 82 2 

Qy 10 89 AGAAGAT ACCTCCCTCCCAG 

TCCA 1112 

I I I I I I III I I I I I I I III 

Db 821 

GTAGAGTGAAGATGGGGACTTTACTCCCAGCGGAGAGGAAGAAACCAGAAGATAGGCTTG 762 

Qy 1113 CCTCATTTAGAA AACCAAAGTGTGGGTAG 

AAACCCCAAATG 1153 



Db 761 

GGGCATGGATGGGAAAGGTCTTGGAACTAGGATGGGGCAAGGGAAACAAAGCCCCAAATG 7 02 

Qy 1154 TC-CAAAAGCCCTTTTCTGGTGGGTGACCCAGTGCAT — 

CCAACAGAAACAGCCGCTGCC 1210 



Db 701 GAGTATGGGTCC CTGGAAGTTGACACACT-CTTAGCCCTCATGGG-GGCC- 

CAGCC 6 49 

Qy 1211 CG AACCTCTGTGT 

GAAGCTTTACGCGCACACGGACAAAATGCCCAAACTG 1260 



Db 6 48 CAGTGGGTAACCCCGGGGAAGTGGTCACATGACGTG-ACAC 

AGCT- 600 

Qy 1261 

GAGCCCTTGCAAAAACACGGCTTGTGGCATTGGCATACTTGCCCTTACAGGTGGAGTATC 132 0 
II I I I I I III I II I I I I I I I I I I I I I I I 

Db 599 GATTGCTTGAGATCCCCCAG — TGTGGCCTGGGC CTTGTCCTT 

GAGTG — 554 

Qy 1321 

TTCGTCACACATCTAAATGAGAAATCAGTGACAACAAGTCTTTGAAATGGTGCTATGGAT 13 8 0 



Db 553 — AGCCCAGGTTCAAGCCAGGAAGGCTG-GCCAGC-TGCCCAAGGACTGAGGCCTT — 

CT 500 

Qy 13 81 TTACC-ATTCCT TAT T ATC AC T AAT CATC- T AAAC AAC T C AC T GG — 

AAATCCAATT 1433 



Db 499 

CCTCCTCCTCCTCTGTGTCCTCCTCGATCATCTTGGCCAGCTCCCGGGGCAGCTCCACGT 4 4 0 
Qy 1434 

AACAATTTTACAACATAAGATAGAATGGAGACCTGAATAATTCTGTGTAATATAAATGGT 1493 
I I I I I I III I I I I I I I I I I 

I I I 



I I I I 
-TTCTC- 



Db 439 CTCCACCTGCCATCTGGATCTGGCTC — TCATCTGTCTCGTTCTTGG 

GTAGCCGGT 3 86 



Qy 149 4 TTATAACT — GCTTTTGTACCTAGC TAGGCTGCTAT- 

TATTACTATAATGAGTAAA 1546 



Db 3 85 ATTTGTCTCGGAAGTGGCTCC — GCAGTGTGGCCCGCTCTGCCTTCCT 

CTGTGTGAA 331 



Qy 1547 

TCATAAAGCCTTCATCACTCCCACATTTTTCTTACGGTCGGAGCATCAGAACAAGCGTCT 16 06 



Db 33 0 CTGTGCATCC CGCTCCATC — TTCTCTTCCACGAGTTGCTTCTG 

ATACTCCT 281 

Qy 16 0 7 AG-ACTCCTTGGGAC-CGTGAGTTCCTAGAGCTT-GGCTG- 

GGTCTAGGC 1652 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 

Db 280 CGTACTCCTCCCGGCTCATG 

CCCTGAGCTTCGGCTGCCGACTTGTCCCCATCTCCC 22 5 

Qy 1653 TGTTCTGTGCCTCCAAGGACTGTCTGGCAATGA 

CTTGTATTGGCCACCAACTG 1705 

I II I I I I I I I I I II I I I I I III I I I I I 

Db 22 4 TTATCCTCGCCGCCTCCCAGG 

CTCCCAGTGAGGTTCTTCAGCTGGCCGCCCACCA 170 

Qy 1706 TAG ATGTAT-ATATGGT GCC 

CTTCTGATGCTA- 1736 



Db 169 

TGGTCTTCACCATGAACGCCATGGTCTTCGCCGGCCGGGGCACCGGGCGCCTGCTACCAC 110 

Qy 173 7 AGACTCCAGAC-CTTTTGTTTTTGCIT- 

TGCATTTTCTGATTTTATACCAACTGT 1789 



Db 109 GCGCCAGGCACCGGGCACGTCCGCGCCTACTTCAGCA-CTCCAG 

TAGGCCAACAGT 55 

Qy 179 0 GT GG AC T AAGATGC AT T AAAAT AAAC AT C AG -AG T AAC T C 182 8 

I III I I I II I I I I I I I I I I I I 

Db 5 4 GCTCGAGGAATTCCGCAGCTTTTAG AGCAGAAGTAACAC 16 
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Qy 3 GTTACAGGGAGCAC CACCAGGGAAC 

ATCTCGGGGAGC 39 

I I I I I I I I I I I I I I II II 

I I 

Db 2198 

GCGACCTGCAGCTCGAGCACATGTGGCCTTTTTTTTTTTTTTTTTTTTTTTTTATGGTGC 2139 

Qy 40 CTGGTTGGAAGCTGCAGGCTTAGTCTGTCGGC — TGCGGGT — 

CTCTGACTGCCCTGTG- 9 4 

I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 213 8 CAATTTTAAA TAGTTTTATTT- 

AAGACATTGCATTTTCCACTTACAATACAGTGT 2085 

Qy 95 GGGAGGGTCTTGCCTTAACATCCCT- 

TGCATTTGGCTGCAAAGAAATCTGC 144 

I I I I I I I I I I I I I I I I I I I I I III III 

Db 2084 TTATAAAGTGCAATGTTATTTCCTTC CCCTGTGCATATG — TTCCA — 

TATTCAAG 2 033 

Qy 145 

TTGGAAGAAGGGGTTACGCTGTTTGGCCGGGCAGAAACTCCGCTGAGCAGAACTTGCCGC 2 0 4 

I I I I I III I I I I I I I I I I I I I I I I 

Db 2 032 TATTGAGAA TGCC CAGTAACTTACTATAGCAG — 

CTTAACTT 1993 

Qy 205 CAGAATGCTCCTCCTG TTGCTGAGTATCATCGTCCTCCACG 

TCGCGGTGCTG 256 

II III II I I I It III I I I I I I I I III 

I I 

Db 1992 TTTAAAACTGCCACAGAATTTGCTACGAATTTAGGTCCTTCAAATGTTTTAAATGTG- 

TG 1934 

Qy 257 GTGC — TGCTGTTCGTCTCCAC — GATCG TCAGCCAATG- 

GATCGTGGGCAATG 3 05 

I I III I I I I I I I I III I I I I I I I I I I I I I I 

I I 

Db 1933 GAACAATGC — 

TACATCTACACTTGGTTGGCTTAATCAACCTCTTCAATGGTGGGCCCTG 18 76 

Qy 306 

GACACGCAACTGATCTCTGGCAGAACTGTAGCACCTCTTCCTCAGGAAATGTCCACCACT 365 

I I I I II I II I I I I I I I I I I I I I I I I I I I I I 



Db 18 75 AGGAAGCAC C AC C AGAGGG A — GGAGCTCCAC — CACCAGGAAA — 

TCCCCCAGG 182 7 

Qy 366 

GTTTCTCATCATCACCAAACGAATGGCTGCAGTCTGTCCAGGCCACCATGATCCTGT 42 2 

III III I I I I I I I I I II I I I I I I I II 

Db 1826 CATTC CTCCTGGC — ATGCCTCCTG- 

CACTCTGGTACAGCTTGGTGATGATGG 1777 

Qy 423 CGATC-ATCTTCAGCATTCTGTCTCTGTTCCTGTTC 

TTCTGCCAACTCTTC- 472 

III I I I I I I I II I I I I I I I I I I I I I I I I I I III 

I 

Db 1776 GGTTGCAAACTTTCTCCAGC — TC — 

TTTCTGTTGATGTTCAAATTCTTCCTTCTCAGCA 1721 

Qy 4 73 ACCCTCACCAAGGGGGGCAGGTT TTACAT — 

CACTGGAATCTTCCAAATT 52 0 

I I I I I I I II III I I I I I I I I I I I I I I I 

I 

Db 172 0 GTCTGATTCTTATCAA GCCAGTTGATAATTTCATTACACTTG 

TCCAGAAT 16 71 

Qy 521 CTTGCTGGT CTGTGCGTGATGAGTGC — TGCGG CCATCTAC- 

ACGGTGAGGC 569 

I I I I I I I II I I I I III II I I I I I I I I I I I 

Db 1670 CTT- 

CTGTTTGTCCTCATCGTTAATCTTGCCTTGAAGTTTCTCATCTTCAACAGTTGCTT 1612 

Qy 570 ACCCGGAG-TGGCAT CTCAA CTCGGATTACTCCT 

ACGGTTTCG 611 

I I I I I I I I I I I I I I I I I I I I I I I I II 

I I I 

Db 1611 

TCATGTTGAAGGCATAGGACTCAAGTGAATTCTTGGATGACACCTTGTCCCTCTGCTTCT 1552 

Qy 612 C CTACATCCTGGCCTGGGTG GCCTTCC 

CCCT 642 

I I I I I I I I I I I I I I I I I I 

I 

Db 1551 

CATCTTCAGCTTTGTACTTCTCAGCTTCCTGGACCATACGTTCAATGTCTTCCTTGCTCA 1492 

Qy 6 43 GGCCCTTCTCAGCGG TGTCATCTAT GTGATCTTG 

CGG 679 

I I I I I I I I I I I III I I I I II I I I I I I 

I 

Db 1491 

AACGGCCCTTGTCATTAGTGATAGTAATCTTGTTCTCTTTTCCCGTACTCTTGTCCACAG 1432 

Qy 680 AAACGCGAATGAGGCGCCCA — GACGGTC — TGTC TGAGGC TCTGA- 

GCGTA 726 

I I I I I I I III II II I I I I III I I I I I I 

I 

Db 1431 C AG AG AC AT T GAG TAT AC CATTGGC- 

AT C AAT G T CAAAAG T G AC T T C AAT C TGAGGAAC A 1373 

Qy 72 7 CATAGGGAAG— GGAGGAA GGGAAAACAGA AAGCAG 

ACAAA 765 

I I I I I I I I I I I I III III I I I I I I I 



Db 1372 

CCTCGGGGTGCAGGAGGTATGCCTGTGAGTTCAAACTTGCCAAGCAGGTTGTTATCCTTT 1313 



Qy 766 GAAAAAAGA-GCTAGCC — CAAAATCCCAAACTCAAACCAAACCAAACAGAAAG- 

CAG— 819 

I I I I I I I I I I I I I I I II I I I I I I I I II I 

Db 1312 GTCATGGCACGCTCGCCTTCATAAACCTGAA 

TAAGCACACCAGGCTGGTTGTCAGAA 1256 

Qy 82 0 -TGGAGGTGGGGGTTGCTGT 

TGATTGAAGATGTATATAATATCTCCGGTTTATAAAA 875 

II III III I I I I I I I I I I I I I I I I I I I 

Db 12 55 TAGGTAGTGAAGGT — CTGTGTCTGCTTGGTAGGAATGGTGGTAT- 

TACGCTTGATGAGG 1199 

Qy 8 76 CCTATTTATAACAC TTTTTACATATATGTACATAGTATTGTT 

TG 919 



Db 1198 AC- 

AG T C AT G AC T C C AC C AGC AG T T T C AAT AC C AAGGGAAAG AGGAG T G AC AT C C AAG AG 114 0 



Qy 92 0 CTTTTTATGTTGACCA 

TCAGCCTCGTGTTGAGCCTTAAAGAAGTAGCTAAGGAACTT 9 76 

I I I I I I I I I I I I I I I I I I III I I I I I III 

I I 

Db 1139 CAGCAAATCTTGAACATTCTCAGACTTGTCTCCAGAC AAGATG— GCT- 



Qy 9 77 TACATCCTAACAGTATAATCCAGC 

TCAGTATTTTTGTTTTGTTTTTTGTTTGTTTG 1032 

I I I I II II I I I I I I I I II I I I I 

Db 1088 GACAGCTGCACCATAAGCAACAGCTTCATCAGGGTTGATGCTCT- 
TATTCAGTT 1036 



Qy 1033 TTTTGTTTTACCCAGAAATAAGATAACTCCATCTCGCCCCTTCCCTTTCAT- 

CTGAA 1088 

I I I I I I I I I I II I I I I I III 

Db 1035 CTTTTCCATTGAAGAAGT 

CTTGGAGAAGCTTCTGAATCTTGGGGATACG 987 

Qy 1089 

AGAAGATACCTCCCTCCCAGTCCACCTCATTTAGAAAACCAAAGTGTGGGTAGAAACCCC 114 8 
I I I I I I I I I I II III I I I I I I I I I I 

I 

Db 9 86 AG TAG A- 

ACCACCAACCAGGACAATATCATGAATCTGTGACTTGTCTAGTTTGGCATCTC 92 8 

Qy 1149 AAATGTCCAAAAGCCCTTTTCTGG TGGGTGACC 

CAGTGC 1187 

I I I I I I I I I I I I I I I I I 

I 

Db 92 7 GAAGGGCTTT 

CTCTACTGGGTCCAGGGTGCCACGGAACAGGTCAGCATTCAGTTC 8 73 

Qy 1188 ATCCAA-CAGAAAC AGCCGCTGCC 

CGAA 1214 

I I I I I I I I I I I I I I I 



TTCAAATCGGGCACGGGTAATGGAGGTATAGAAGTCGATTCCTTCATAGAGAGAATCGAT 813 

Qy 1215 C CT CTGTGTG AAGCTTTACGCG — CAC 

ACG 1242 



Db 812 

CTCAATACTGGCCTGGGTGCTGGAAGAGAGGGTACGCTTAGCACGTTCACAAGCAGTACG 753 

Qy 12 43 GA CAAAATGC CCAAACTGGAGCCCTTGCAAAAACACG- 

GCTTGT 1285 

II I I I I I I I I I I I I I I I I I I 

Db 752 GAGGCGTCTTACAGCTCTCTTGTTCTCACTGATGTCCTT- 

CTTATGCTTGCGCTTAAACT 694 

Qy 12 86 -GGCATTGGCATACTTGCCCTTACAGGTGG-AGTATCTTCGTCACACA 

TCTA 1335 



Db 693 

CAGCAATAAAATGGTTGACCATTCGGTTGTCAAAATCTTCTCCACCCAAGTGGGTGTCTC 63 4 

Qy 1336 AAT G AG AAAT C AG T G AC AAC AAG T C T T T G A 

AATGGTGCTATGGATTTACCATTCC 139 0 



Db 633 CAGCTGTAGACT-TGACCTCAAAGATTCCATCCTCAATAGTG 

AGGATTGACACATCA 578 

Qy 1391 TTATTATCACTAATCA — TC TAAACA 

AC T CAC T GG AAAT C C AAT T AAC AAT 1439 



Db 577 AAAGTGCCACCTCCCAGGTCAAAGATGAGCACGTTTCTTTCTG CTCCA 

ACCTTT 524 

Qy 1440 TTTACAA CAT AAG— AT AG 

AATGGAGACCTGAATAATTCTGTGTAATAT- 1486 



Db 523 

TTGTCTAAGCCGTAAGCAATAGCAGCAGCAGTTGGCTCATTAATAATTCTAAGTACATTG 46 4 

Qy 148 7 AAATGGTTTATAACTGCTTTTGTA-CCTAGCTAGGCTGCTA- 

T TAT T AC TATA 153 7 



Db 463 AGACCAGCAATAG-TTCCAGCATCTTTGGTAGCCTGAC 

GC T GAG AG T C AT T AAAG T A 40 8 

Qy 153 8 A T G AG T AAAT CAT AAAGCCTTCATCA- 

CTCCCACATTTTT 1576 



Db 407 

AGCTGGCACTGTGACCACAGCATTGGTAACAGTCTTCCCAAGGTAGGCTTCTGCAATTTC 3 4 8 

Qy 1577 CTTAC G G T C G G AGC AT C AG AAC AAGC G T C TAGACTCCTTGGG- 

AC 1620 

III I I I I I I I I I I I III I I I I I I I I I 



I 

Db 347 

CTTCATCTTTGTCAGAACCATAGAAGACACCTCCTCTGGATAGAAGCTTTTGGTCTCTCC 2 8 8 

Qy 1621 CGTGAGTTCCTAGAGCTTGG — CTGGGTCTAGGCTG-TTCTGT-GCCTCC 

AAGGACT 16 73 



Db 287 CTTGTATTC 

TACTTGGACCTTGGGCCTGCCAGCATCATTCACCACCATAAAGGGC- 233 

Qy 16 74 GTCTGGCAATGACTTGTATTGGCC — 

ACCAACTGTAGATGTATATATGGTGCCCTTCTGA 1731 



Db 232 

CAATGTTTCATATCAGACTGGACAACAGCATCATCAAATCTGCGTCCAATCAGA 179 

Qy 1732 TGCTAAGACTCCAGACCTTTTGTTTTTG 

CTTTGCATTTTCTGATTTTATAC 1782 

II I I I I I I I I I I I II I I I I I I I I I I I 

Db 178 CGTTTGGCATCAAAAAC-TGTGTTGGTGGGGTTCATTGCAACT — 

TGATTCTTTGCGGCA 122 

Qy 1783 CAACTGT GTG 

GACTAAGAT 1801 

I I I I I I III I I I I I I 

Db 121 

TCACCGATCAACCGTTCAGTGTCCGTAAAGGCGACATAGCTTGGAGTGGTTCGGTTTCCC 6 2 

Qy 18 02 G C AT T AAAAT AAAC AT CAGAGTAACTC 182 8 

I I I I I I I I I II I I I I 

Db 61 TGATCATTGGCAATTATCTCGACTTTTCCGTGCTGGAAAACAC 19 
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Query Match 



38.1%; Score 4134; DB 1; Length 2495; 



Best Local Similarity 42.4%; Pred. No. 3.1e-270; 
Matches 1084; Conservative 0; Mismatches 672; Indels 798; 
Gaps 158; 



Qy 1 CAGTTACAGGGAGCAC 

CACCAGGGAAC 2 7 

II II I I I I I III II I 

Db 2488 

CCGCGACCTGCAGCTCGAGCACATGTGGCCTTTTTTTTTTTTTTTTTACATGTTTAAATA 2 42 9 

Qy 2 8 ATCTCGGGG AGCC-TGGTTGG 

AAGCTGCAGGCTTAGTC 6 4 

I I I I I I I I I I I I I I III I I I I I 

Db 2428 

TATTTTATTTCAAGTTAATATAACAGCCATGATTTGTTCATTTAAGTGAAATGCTTCG-A 23 7 0 

Qy 65 TGTCGGCTGCGG-GTCTCT GACTG 

CCCT— 91 

I I I I I I I I I I I I I I I 

Db 2369 

TAAAAACCCCGGTGTCACTTGGTTAAAATATTGGAGACAGATTAAATGTTAAGCCACTAT 2310 

Qy 92 -GTGGGGA GGGTCTT — GCCT-TAACA — TCCCTTGCATTTG- 

GCTGCAAAGAAAT 140 

I I I I I I I I I I I I I I I II I I I I I I II I I I I I 

Db 23 09 AGTAGTGACTCTGGTTCTTTAGACTATTGCATGTACCTTGTTAATGAACTAC- 

AGGAATG 2251 

Qy 141 CTGCTTGGAAGAAGGGGTTACGCTGT-TTGGCCGGGCAGAAAC 

TCCGCTGAGCAG 19 4 

I III I I I I II II I I I I II III I I I I I 

Db 2250 C AC AT T G AT AT AAT G CACAATGTCTTCACATGTCTCAATCCTCATTCCTCT — 

TCCT 2196 

Qy 195 AACTTGCCGCCAGAATGC TC 

CTCCTGTTGC 224 

I I I I I I I I I I I I II I I I I I I I I 

Db 2195 TTCATGCCTCCCTAATGCCTCAGTATCCAATATAAAACTACAAAAACTACTGTT- 

CTTTT 2137 

Qy 225 TGAGT ATCATCG — TC CTCCAC- 

GTCGCGGTGCTG 256 



Db 2136 CGTTTGAGTGGTATTTAATTAAAACTTCAATTCTATGTTATTTCCACAGTC- 

CAGTAATT 2 0 78 

Qy 257 GTGCTGCTGTTCG TC TCCACGATC 

GTCAGCCAATGGATC 295 



Db 2 0 77 TATTTTAAATTTGAGTAATTTCAAATTCCACAAACAAAACTGAAGAACAG- 

CAATATTTT 2 019 



Qy 296 

GTGGGCAATGGACACGCAACTGATCTCTGGCAGAACTGTAGCACCTC 3 42 

II III III III III I I I I I I I I I I 

Db 2018 GTTTGAATT — 

TCTCTCTTCTG T AC AC T C AG T GAT C T AAAAC ACC AC AAT ATCC AAC AT A 1961 



Qy 3 43 TTCCTCAGGAAA TGTCCAC-CAC- 

TGTTTCTCATCAT C 378 



Db 1960 

CACAAACCTCAGGGAAGGGTTAGTAAACACACACAAGATTGGAATCATGGTGCTCTTTGC 19 01 

Qy 3 79 ACC — AAACGAATGG — CTGCAG TCTG— TCCAGGCCACCAT 

GATCCTG 421 



Db 1900 

TCCTGAATGGAATGGTCCCACAGAAAAAGCACAGGATACAGCACAACATAAGGGCACCTG 18 41 

Qy 422 TCGATCATCTTCAG CATTCTG TCTCTGTTCCTGTT — 

CTTCT 461 



Db 1840 T— 

TACATATGAAGTGAGCAAAACATACTAGCATTTTCTATATGCATAATGGGGAAACCT 17 83 

Qy 462 GCCAA CTCTTCACCCTCA 

CCAAGGGGGGCAGGTTTTACATCACTGGA 508 



Db 1782 

GCATAGGTTAGAGGGCCTTTTACGCTCATTTAAAAATCAGGCAAGTTGTCTGTAAC A 172 6 

Qy 509 ATCTTCCAAATTCTTG CTG — GTC TGTGCG 

TGATGAGTGCTG 548 

I I I I I I I I I I III II I I I I I I I I I I I I I 

Db 1725 

TTTTTCAATAATCTGGGAAGCACTGCAATCCAGTGTACGATGTGCTGATTAGTGTTGTCT 1666 

Qy 549 CGGCCA-TC-TACACGGTGAGGCACCCGGAGTG GCAT 

CTCAAC 589 



Db 1665 TTGTTGGCACTGACAGTCTTGCATGGTCATATGCC 

AGTGTTTTTGCTTTAGCTTTTC 1609 

Qy 59 0 TCGGATTACTCCTACGGTTTCGCCTACATCCTGGC 

CTGGGTGGCCTTCCCCCT 642 



Db 16 0 8 TTTGAATA 

AAACAGATTTAAATGCATTTAGACAATACGTATTTGTAAGCTGTTTACA 1552 

Qy 6 43 GGCCCT TCTCAGCGGTGTCA 

TCTATGTGATCTTGCGGAAACGCGAA 688 



Db 1551 TACACTAAATTTAAGAGAT-TCAATATTAGAGTTTCTTTGT-TTCTT 

TAAACACTAC 149 7 

Qy 6 89 TGAGGCGCCCAGACGGT CTGTCTGAGGCTCTGAGC- 

GTACATAGGGAAGGG 73 8 



Db 1496 AGA-GTGCAAATCAGGTTCTTCACAACAGATTGAATAT- 



T G AGC AG T T C T T T AAAG AAAG 1439 

Qy 739 AG G AAG G G AAAAC AG AAAGC AG AC AAAG AAAAAAG AGC - 

TAGCCCAAAATCCCAAACTCA 79 7 



Db 143 8 AGG — GGGAAGAAAAAAAGC 

CCAAGTGAATAAAACATTGAAACTATTCCCCTTCGAA 1384 

Qy 79 8 AACCAAACCAAACAGAAAGCAGTGGAG-GTGGGGGTTGCTGTTG — 
AT TG AAG AT G 850 



Db 13 83 AATAAATTCTAA 

AAT G AT G T GG AAT G T GAAATAAGGT T T T AAC AT AGGTGAT C C AAG 132 7 

Qy 851 TATATAAT ATC TCC GGTTTATAAAA- 

CC 877 



Db 1326 

T T TAT AG T T AG AAAC AAAAAG AAG T C C T T C ATGAAAT AAAGG T T AC AAG AAC AC G T T GC C 12 6 7 

Qy 8 78 TATTTATAACACTTTTTACA TATATGTAC 

ATAG 910 



Db 1266 TGTTT- 

TCCCCCATTATAAACTGAGAAGTGGGTAAAGACGATGTTTCAGTACGAAAATAG 12 0 8 

Qy 911 TATTGTTT GCTTTTT ATGTTG-ACCATCAGCC- 

TC 943 



Db 1207 

GTGACTACAGGATCAGCGCTTCATCTCACATGCTGTACCCAAAGCCAGGCTGTGGCTGTC 114 8 

Qy 9 44 GTGTTGAG CCTTA AAGAAGTAGCTAAGG 

AACTTT — 977 

I I I I I I I I I I I I I I I I I I I I I I I 

Db 1147 

CATACGGTGGTGCGGTATAACCATAACCAAAAGGTGCCTGGGGAGGGACGGCAACACTGG 10 8 8 

Qy 9 78 -ACATCCTAACAGTATAATC CAGCTCAGTATTTTTGTTTTGT-TTTTTGT 

TT 1027 



Db 1087 

GTCCTGCTGTCAGCATCAACTGGGGCTGACCATAAACAATGGGTTGTGTCTCTGTAGCTT 102 8 

Qy 102 8 GTTTGTTTTGTTTTACCCAGAAATAAGATAACTCC — ATCTCGCC- 
CCTT C 1075 



Db 1027 GTTCTTCTTCTTTT- 

CTCAGTGATTCTGAAGCATCTAATTTATCCACCTTTGTCAAGTAC 969 

Qy 10 76 CCTTTCAT CTGAAAGAA GATA 

CCTCCC 1102 

I I I I I I I I I I I I I I I I I I I I I I 



Db 968 

TCCTTCATGACCTGGATGAAATAGGGCATGGCAAAATCCATGATATTGTGCCTCCATGCA 9 09 

Qy 1103 TCCCAGTCCACCTC ATTTAGAAAA 

CCAAAG 1132 



Db 908 

GTTTCTAGGACGACATCTGGCCTTAAAAGATCGTAACAGGTAAACAGACAAGCTCCAAAG 8 49 

Qy 1133 TGTGGGTAGAAACCC CAAATGTCCAAAAGCC — 

CT 1165 



Db 848 

CACTCTCTTTTTTCTTCCTGCAAAAACCACTGCAGGAGTTCTTCAGCCAATTCAGTATCT 7 89 

Qy 1166 T TTCTGGTGGGTGACCCAGTGCATCCAACAGAAACAGC 

CGCTGCC 1210 



Db 78 8 T TAG AT T C AG AAGC AT AC T GC AT TGC AT C C T — 

TGTAAAGGCTGTCTTTCTTGCACAGCT 731 

Qy 1211 CGAACCTCTGTGTGAAGC 

TTTACGCGCACACGGACAAAATGCCCAAACT 1259 



Db 730 

CTACACTCTGTTTCCAGCGATTGTTGCCTTTGAAGAGATAAGCAGCAATTCTCCTGAACT 6 71 

Qy 126 0 GGAGCCCTTG CAAAAACACGGCTT — 

GTGGCATTGGCATACTTGCCCT 13 05 

III III I I I I I II II I I I I I I I 

Db 670 

CAATGAGTTCATGTTTTTCCAAACGCTGAGCAAGCGAGATATTGTCAAAGTTGTCATAAG 611 

Qy 13 06 TACAGGTG 

GAG TATCTTCGTCACACATC T AAAT G AG AAAT C AG T 1349 



Db 610 CATCTATTGATGTTCGCAGAGCCTGATAATCTTCTTCTGTAAT- 

AAAAAGATTGTTCAAT 552 

Qy 135 0 GA— CA ACAAGTCT-TTGAAATGGTGC 

TATGGATTTACCA— 13 86 

II II III I I I I I I I I I I I I I I I I I I I I 

Db 551 

GATTCATTCACAGATTTGTTGTTATGGTTCTGAACTGAACGCAAATACGGTTTCACCAGT 492 

Qy 13 8 7 — TTCCTTATTATC ACTAATCATCTAAAC AACTCACTG 

GAA 1425 



Db 491 

GGTAGCTGTTTAACCTTGCTGAAATAATTGACTGCACGAGTGTGATCCAACCGTGGAGAC 432 

Qy 142 6 ATC-CAATTAACA AT T T T AC AAC AT AAG AT AG AAT 

GGAGACCTGAATAATT 14 75 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 



Db 431 AGC AC CAT C AGC AAATC AT T T AAC AAC AGAGGC T TGAAT T C T AAG T AG AAC T G 

TATTG 3 74 



Qy 14 76 CTGTGTAATATAAATGGTTTATAACTGCTTTTGTAC — CTAGCTAGGC — 

TGCTATTATT 1531 

I I I I I I I I I I I I I I I I I I I I I I I I II 

I I 

Db 373 

CTCTGTAGTATAGTTCCACATTGGCAACCTTGGTAATGATATCTTTGAATTGCCCTTCTT 314 

Qy 1532 AC TATAATGAGTAAATCAT-AAAGCCTTCAT — 

CACTCCCACATTTTTCTTAC- 1581 

I I II I I I I I I I I III III I I I I I 

I I I 

Db 313 

TCCAGGCATCAGTTGGATGATTCATCATGGTAATTATGGCATTATCATATTCTTCATACT 2 5 4 

Qy 1582 GGTCGGAGCATCAGAACAAGCGTCTAGACTCCTTGGGACCGTGAGTTCC- 

TAGAGCTTGG 16 40 

I I I I I I I I I I I I I I II I I I I I I I 

I 

Db 2 53 TGTCATA-CAAAAACACCAGT-TCTG 

CCCAAAGATGAGCTTGTTCTGCAGCTCTTAG 199 

Qy 1641 C TGGG TCTAGGCTGTTCTGTGC — 

CTCCAAGGACTGTCT 16 77 

I I I I I I I I I I I II I I I I I I I II II 

Db 19 8 CACCTTGGGAATATTCACTCTAGAC 

CAGAACAGCTCCAGGTGCTCCCTCATTTTCT 143 

Qy 16 78 — GGCAATGACTT 

GTATTGGCCACCAACTGTAGATGTATATATGGTGCCCTTCTGAT 1732 

III III I I I I I I III I I I I I III 

I I I 

Db 142 GAGGCTTAAATTTAGAGTATAG AAT AGC T AAT T C AG T AAAC A 

TTCCCATGTGA- 9 0 

Qy 1733 

GCTAAGACTCCAGACCTTTTGTTTTTGCTTTGCATTTTCTGATTTTATACCAACTGTGTG 1792 
III I I I I I I I II I I I I I I I I I I I I I 

I 

Db 89 GCTC— GCTCAAGTCCCAGTG CTGCTTCCAACATGGTGATCAGCT — 

CTTCAAAATA 3 7 

Qy 1793 GACTAAGATGCATTAAAATAAACATCAGAGTAAC 1826 

I I I I I I I I I I I I II II III 
Db 36 GCC-ACGAT-CCTGATAGTAGCCAGTAGGCCAAC 5 



RESULT 12 
US-10-478-914-17 

; Sequence 17, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 

; TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
; FILE REFERENCE: 7388-80899 

; CURRENT APPLICATION NUMBER: US/ 1 0 / 478 , 9 1 4 
; CURRENT FILING DATE: 2003-11-26 



; PRIOR APPLICATION NUMBER: PCT/JP02/05294 

PRIOR FILING DATE: 2001-05-30 
; PRIOR APPLICATION NUMBER: JP 2001-162775 

PRIOR FILING DATE: 2001-05-30 

PRIOR APPLICATION NUMBER: JP 2001-255226 
; PRIOR FILING DATE: 2001-08-24 
; NUMBER OF SEQ ID NOS : 417 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 17 

LENGTH: 219 8 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-478-914-17 

Query Match 37.7%; Score 4097; DB 1; Length 2198; 

Best Local Similarity 45.1%; Pred. No. 5.9e-268; 
Matches 1026; Conservative 0; Mismatches 664; Indels 586 
Gaps 150; 

Qy 1 CAGTTACAGGGAG CACCACCAGGGAACATCTCGGGG 

TGGTT — G 46 



Db 5 CTGGTCTTGTGTGGGTGTTTTCCAGCACGGAAAAG- 

TCGAGATAATTGCCAATGATCAGG 63 

Qy 4 7 GAAGCTG CAGGCTTAG-TCTGTCGGC — TGCGGGTCTCTGAC 

TGCCCTGTGGG 96 

I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 6 4 GAAACCGAACCACTCCAAGCTATGTCGCCTTTAC- 

GGACACTGAACGGTTGATCGGTGAT 122 

Qy 9 7 G AGGGTC TTGCCTT-AACATCCCTTGCA 

TTTGGCTGCAAAGAAATCT 142 



Db 123 GCCGCAAAGAATCAAGTTGCAATGAACCCCACCAACACAGTTTTTGATGCCAA- 

ACGTCT 181 

Qy 143 GCTTGGAAGAAGG GGTTACGCTGTTTGGCCGGGC 

CC 185 

I I I I I I I I I I I I I I I I I I I I I I I 

Db 182 GATTGGACGCAGATTTGATGATGCTG- 

TTGTCCAGTCTGATATGAAACATTGGCCCTTTA 2 40 

Qy 186 GCTGAGCAGAACTTG CC 

GCCAGAA 2 09 



Db 2 41 TGGTGGTGAATGATGCTG- 

GCAGGCCCAAGGTCCAAGTAGAATACAAGGGAGAGACCAAA 2 99 

Qy 210 TGCTCC— TCCTGTTGCTGAGTATCATC GTCCT — 

CCACGTCGCGGTGCTGGTGCT 261 



Db 300 AGCTTCTATCCAGAGG — AGGTGTCTTCTATGGTTCTGACAAAGATGAAG- 

GAAATTGCA 356 



AGCC— 

III III 



-AGAAACT 

I I I I I II 



Qy 



262 GCTGTTCGTCTCCACGATCGTCAG 



CCAATGGATCGTGGGCAATGGACACGC 312 

I I II I I III I I I I I I I I I I I I I I 

Db 357 GAAG — CCTACCTTGGGAAGACTGTTACCAATG — 

CTGTGGTCACAGTGCCAGCTTACTT 412 

Qy 313 AACTGA-TCTCTG GC-AGAACTGTAGCACCTCT- 

TCCT 347 



Db 413 

TAATGACTCTCAGCGTCAGGCTACCAAAGATGCTGGAACTATTGCTGGTCTCAATGTACT 4 72 

Qy 3 48 CAGGA AATGT-CCACC ACTGTTTCT — 

CATCATCACCAAACGAATGGC 392 



Db 473 

TAGAATTATTAATGAGCCAACTGCTGCTGCTATTGCTTACGGCTTAGACAAAAAGGTTGG 532 

Qy 393 TGCAGTCTG-TCCAGGC-CACCATGATCCT GTCGATCATCTTCAG 

CATTCTG 442 

I I I I I I I I I I I I III III III I 

Db 533 

AGCAGAAAGAAACGTGCTCATCTTTGACCTGGGAGGTGGCACTTTTGATGTGTCAATCC- 591 

Qy 443 TCTCTGTTCCTGTT— CTTCT GCCAACTCTTC 

ACCCTCACCAAGG-G 486 



Db 592 

TCACTATTGAGGATGGAATCTTTGAGGTCAAGTCTACAGCTGGAGACACCCACTTGGGTG 651 

Qy 48 7 GGGCAGGTTTT-ACATC — ACTGGAATCTTCCAAATTCTTGCTG- 

GTCTGTGCGTGATGA 542 



Db 652 G AG AAG AT T T T G AC AAC C G AAT GG — TCAACCATTTTATTGCTGAGTTTAAGCGC- 

AAGC 708 

Qy 5 43 GTGCTGCGGCCATCTACACGGTGAGGCACCCGGAGTG — 

GCATCTCAACTCGGATTACTC 600 



Db 709 ATAAGAAGGACAT C AG T GAG — AAC AAG AG AG C T G T AAG AC GCCTCCG — 

TACTG 759 

Qy 601 CT ACG- 

GTTTCGCCTACATCCTGGCCTGGGTGGCCTTCCCCCTGGCCCTTCTC 

II I I I I I I I I I I III II 

Db 76 0 CTTGTGAACGTGCTAAGCGTAC — CCTCTCTT- 

CCAGCACCCAGGCCAGTATTGA 811 

Qy 653 -AGCGGTGTCATCTATGTGATCTTGCGGAAACG-CGAATGAGGCGCCCAGAC 

GGTC 706 



Db 812 GATCGAT-TC-TCTCTATGA 

AGGAATCGACTTCTATACCTCCATTACCCGTGCCC 864 



— 652 

II I I I I I I I I II 



Qy 



70 7 TGTCTGAGGCTCTGA — 



GCGTACATAGGGAAGGGAGGAAGGGAAAACAGAAAGCAGACAA 76 4 

I I I I I I I I I II III I III II 

Db 865 G AT T T G AAG AAC T G AAT GC T G AC C T GTTCCGTGGCACCCTG 

GACCC 910 

Qy 765 AGAAAAAAGAGCTAGCC — 

C AAAAT C C C AAAC T C AAAC C AAACC AAAC AGAAAGC AGTGG 822 

I I I I I I I I II I I II I I I I I I I I I I I I I I I I I 

Db 911 AGTAGAGAAAGC C C T T C GAG AT GC C AAAC T - AG AC AAG T C ACAGATT- 

CA 958 

Qy 823 AGGTGGGGGTTGCTGT — TGATTGAAGATGTATATAATATCTCC- 

GGTTTATAAAACCTA 8 79 

II III I I I I I II I I I I I I I III I 

I 

Db 959 

TGATATTGTCCTGGTTGGTGGTTCTACTCGTATCCCCAAGATTCAGAAGCTTC 1011 



Qy 880 

TTTATAACACTTTTTACATATATGTACATAGTATTGTTTGCTTTTTATGTTGACCATCAG 939 
I I II III I I I I I I I I I I I 

I I I I I 

Db 1012 TCCAAGAC TTCTTCAATGGA-AAAGAACTGAAT 

AAGAGCATCAA 1054 



Qy 9 40 CCTCGTGTTGAGCCT TAAAGAAGTAGCT 

AAGG AAC T T T AC AT C C T AAC AG T 99 0 

I I I I I I I I II III I I I I I I I I I I I I I I 

I 

Db 1055 CC-C-TGATGAAGCTGTTGCTTATGGTGCAGCTGTCCAGGCAGC 

CATCTTGTCTGG 1108 

Qy 991 ATAATCCAG-CTCAGTATTTT — TGTTTTGTT — 

TTTTGTTTGTTTGTTTTGTTTTACCC 1045 

II I I I I I I I I I I I I I I I I I I I I I III I I I I I 

I I I 

Db 1109 AGA— CAAGTCTGAGAATGTTCAAGATTTGCTGCTCTTGGATGTCACTCCTCTTT- 

CCC 1164 



Qy 10 46 AGAAATAAGATAACTCCATCTCGCCCCTTCCCTTTCATCTGAAAG- 

AAGATACCTCCCT- 1103 

I I I I I I I II II I I I I I I III I I I I I I I 

I 

Db 1165 TTGGTATTGA- 

AACTGCTGGTGGAGTCATGACTGTCCTCATCAAGCGTAATACCACCATT 12 23 



Qy 1104 CCCAGTCCACCTC 

ATT T AG AAAAC C AAAG T G T GGG TAG AAAC 1145 

I I I I I I II III I I I I I I 

Db 122 4 C C T AC C AAG C AG AC AC AG AC C T T C AC T AC C T AT T C T G AC AAC C — 

AGCCTGGTGTGCTTA 1281 



Qy 1146 CCCAAATGTCCAAAAGC CCTTTTCTGGTGGGTGAC — CCAGT 

GC 1187 

I I I I I I I I I I I I I I I I I I I I 

Db 12 82 TTCAGGTTTATGAAGGCGAGCGTGCCATGAC- 

AAAGGATAACAACCTGCTTGGCAAGTTT 1340 



Qy 1188 -ATCCAACAG — AAACAGCC-GCTGCCCGA ACCTCTGTGTGAAG 

CTTTACGC 1235 



II 1 1 1 1 III 1 1 1 1 II 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 

Db 1341 

GAACTCACAGGCATACCTCCTGCACCCCGAGGTGTTCCTCAGATTGAAGTCACTTTTGAC 14 0 0 
Qy 1236 GCACACGG 

ACAAAATGCCCAAACTGGAGCCCTTGCAAAAACACGGCTTGTG 12 86 

II III II I I I I I III I I I I I I I I I 

I 

Db 1401 ATTGATGCCAATGGTATACTCAATGTCTCTGCTGTGG 

ACAAGAGTACGGGAAAAG 1455 

Qy 12 8 7 -GCATTGGCATACTTGCCCTTA CAGGTGGAGTATCTTCGTCA 

CACATC 1333 

II I I I I I I I I I I I I I I I I III II 

I 

Db 1456 

AGAACAAGATTACTATCACTAATGACAAGGGCCGTTTGAGCAAGGAAGACATTGAACGTA 1515 

Qy 133 4 TAAATGAGAAATCAGTGAC-AACAAGTCT-TTGAAATGGTGCTATGG 

ATTTACCAT 13 8 7 

I I I I I I I I I I I II I I II III II II 

I I 

Db 1516 

TGGTCCAGGAAGCTGAGAAGTACAAAGCTGAAGATGAGAAGCAGAGGGACAAGGTGTCA- 15 7 4 

Qy 13 8 8 TCCTTATTAT C AC T AATC ATC T AAAC AAC TC AC T GG AAAT C — 

C AAT T AAC AAT T 1440 

III I I I I I III I II I I I I I I III II 

Db 1575 

TCCAAGAATTCACTTGAGTCCTATGCCTTCAACATGAAAGCAACTGTTGAAGATGAGAAA 163 4 

Qy 1441 T T AC AAC AT AAGAT AGA — ATGGAGAC CTGAATAATTCT 

GTGTAATATAAA 1489 

I I I I I I I I I I III III I I I I I I I I I I I I I I I I 

I I 

Db 1635 C T TCAAGGC AAGAT T AACGATGAGGACAAAC AGAA- 

G AT T C T GG AC AAG T G T AAT G AAA- 1692 

Qy 149 0 TGGTTTATAACTGCTTTTGTACCTAGCTAGGCTGCT 

AT TAT T ACT AT AA 153 8 

II I I I I I II II I I I I I I I I I III II 

I I 

Db 1693 TTATCAACTGGCTTGATAAGAATC- 

AG AC T GC T G AG AAGG AAG AAT T T G AAC AT C AA 1748 

Qy 1539 T GAG T AAAT C 

ATAAAGCCTTCATCACTCCCACATTTTTCTTACGGTCGGAGCATCAG 1595 

III I I I I I I I I I I I I I I I III I 

I I 

Db 17 49 C AG AAAG AG C T G G AG AAAG T T T GC AAC CCCATCATCACCAAGCTGT 

ACCAGAG 1801 

Qy 1596 AACAAGCGTCTAGACT 

CCTTGGGACCGTGAGTTCCTAGAGCTTGGCTGGGT 1646 

I I I I I II I I I I I I I I I I I I I I I I I I I I 

Db 18 02 TGCAGGAGGCATGCCAGGAGGAATGCCTGGGG GATTTCCTGG TGG- 

TGGAG 1851 



Qy 16 47 CTAGGCTGTTCT GTGCCTCC-AAGGACTGTCTGGCAATGAC 



TTGTATTGGCC 1697 



Db 1852 CTCCTCC-CTCTGGTGGTGCTTCCTCAGGGC 

C C AC C AT T G AAG AGG T T GAT T AAGC C 1907 

Qy 169 8 -ACCAACTGTAGATGTATATATGGTGCC 

CTAA 173 7 

I I I I I I I I I I I I II I II II II 

Db 19 0 8 AACCAAGTGTAGATGTA- 

GCATTGTTCCACACATTTAAAACATTTGAAGGACCTAAATTC 1966 

Qy 173 8 GACTCCAGACCTTT TGTTTT TGCTTT- 

GCATTT 1769 



Db 1967 

GTAGCAAATTCTGTGGCAGTTTTAAAAAGTTAAGCTGCTATAGTAAGTTACTGGGCATTC 2 02 6 

Qy 1770 TC TGAT 

TTTATACCA 178 4 

II III I I I I I 

Db 2027 

TCAATACTTGAATATGGAACATATGCACAGGGGAAGGAAATAACATTGCACTTTATAAAC 2 0 86 

Qy 1785 ACTGT GTGGACTAAG — ATGCATTAAAATAAAC ATCAGAGTAACTC 

1828 

I I I I I I I I I I II III I I I I I I I I I I I I I II 

Db 2087 ACTGTATTGTAAGTGGAAAATGCAATGTCTTAAATAAAACTATTTAAAATTGGCAC 

2142 



— CTTCTGATG 

I I I I I I I I I I 
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Qy 3 GTTACAGGGAGCACCACCAG GGAACATCTCGGGGA- 

GCCTGGTTGGAAGCT 52 



Db 11 GTTACT-TCTGCTCTAAAAGCTGCGGAATTCCTCGAGCACTGTTGGCCT- 

ACTGGAA — T 66 

Qy 53 GCAGGCTTAGTCTGTC — GGCTGC — GGGTCTCTGACTGC 

CCTGTGGGGAG 99 

II I I I I I I I I I I I I I I II I I I I I I I I 

Db 6 7 GCGAGCTGAG-CAGACAGGGCTGCAAGGAAATCTGGC- 
GCGGTTCAATACCT 116 

Qy 100 GGTCTTGCCT TAACATCCCTTGCATTTG GCTG- 

CAAAGAAA T 140 



Db 117 

CGTCTAGCCTGGGTTCCAGTATCTAATTTTTTTTTTGTTTTAACTGACAAACTCATTTCT 176 

Qy 141 CTGCTTGGAAGAAGGGGTTACGCTGTTTGG CCG GGC- 

AGAAACTCCGC 187 



Db 177 CTACTGGGAC- 

AGGATGCTGTGCTGGCTGGAAGTTCCATTTCTACAGCAAGAATCCTATC 235 

Qy 188 TG AGCAGAACTTG-CC GCCAGA-ATGCTCCTCCT- 

GTTGCT 225 



Db 236 

TGGAAACACAGAAGTTGTCCTCTAGCCACAGCAGCTCGAACTTTTTTGATTGTCGTTGCT 2 95 

Qy 226 GAGT ATCATCGTCCTCC 

ACGT 2 46 



Db 296 

GCTTTCTCCCATCACCCCCATCCCCTTTTGACAAAGATCCAACTGTAAAAAGTCTTACGT 355 

Qy 247 CGCGGTGCTGG — TGCTGCTGTTC 

GTCTCCACGATC 280 

I I I I I I I I I I I I I I II III II 

Db 356 

AACAGTTCAGGACTACTTCGGTTCTTTTACTGGGTAAGCACTTTCAATTTTTTTTTTTTA 415 

Qy 281 GTCAGCC AATGGATC-GT 

GGGCAATGGACACGCAACT 316 

I I I I I I I I I I I I I I I I I I I I III 

Db 416 ACTAAAAGCCATTTTAAAATTGAATCTGTTGAGGGGC — 

TTGACTAAAATCTTTTAAGTA 4 73 

Qy 317 GATCTCTGGCAG AACTGTA 

GCACCTCT 3 43 



Db 474 

ATTTGTGTAATGGAATACTGTCAGTGGATTTTTTCGTCTCATTTCTGCACGTGCTCCTTT 533 



Qy 3 44 -TCCTCAGGAAATGTCCACCACTGTTTCTCATCATCACCAAACGAA — 

TGG C 392 

I I I I I I I I I I I I I I I I I I I I I I I I 

Db 53 4 GTTCTCAGAA 

C AG AAGC T T T T TAT AC AC AT C C C AT AACGC AGC TGGAGAGAGT T A 5 8 8 

Qy 393 TGCAGTCTG TCCAGGCCACCATGAT— CCTGTCGATCAT 

CTTCAGCATTCT 441 



Db 589 TGAAGTCAGTTATTATAAGGAACACAAAGGTTGCTTTCCATTCTTTGCCTTTAG- 

ATAAT 6 47 

Qy 442 GTCTCTGTTCCTGTTCTT CTGCCAACTCTTCACCC — 

TC 478 

I I I I I I I I I I II I I I I I 

Db 648 

T AAT TTTTTTGTTTTCT T AAAAT GG AG T AT T T AAAGAAGGAAGAAAT T C AC AAG AAAT AA 7 0 7 

Qy 479 ACCAAGGGGG GCAGGTTTTAC 

ATCACTGGAATCTT 513 



Db 708 

ACTGTTGGAGAATTTAGAAAAGTTTGAAGTTTTTACCACCTTTTCTATCTCTAGTTTTGT 76 7 

Qy 514 CCAAATTCTTGCTG — GTCTGTGCGTGATGAGTGCTGCGGCCATC- 

TACACGGTGA 566 



Db 76 8 GTGGCCAAACACTTG-TGCCGCCTGGG- 

GCGGTGGGGGTAGAG G C AAGC AT AG AC AG AG A 825 

Qy 56 7 GGCACCCGGAGTGGCAT-CTCAA CTCG-GATTACTCCT-ACGGTTTCGCCT- 

ACAT 618 



Db 826 GGAACTAAGCCAGACATGGACAAAGGCACGAGCCAAAACCAGACAGT 

CCTGGCCGT 881 

Qy 619 CC TGGCC TGGGTG GCCTTCCCCCTGGCCCTTCTC- 

AGC 655 



Db 882 

TCGATAGGCCAGCAGTGGGTGAGACAGGTCAGCCAGCTGCAGGGGCGGGAGCGGGGGGGC 9 41 

Qy 656 G GTGTCATCTAT GTGATCTTGCG GAAACGCGA — 

ATGAGGCG 695 



Db 942 

GGGTAGGGACATGGATGTGTGAGGTGCTCATGCGTGCGATCGTGACTAGATTCTGAAACT 10 01 

Qy 696 CCCAGACGGTCTGTCTGAG — GCTCTGAGCGTACATAGGGAAGG 

GAGGAA 7 43 



Db 1002 



GCCAGCCATTTTCCCAGCTCCGCTTTGTGCAATCTAAAGGAATGCATCCCCTCTGAAGCA 1061 



Qy 744 GGGAAAACAGAAAGCAGACAAAGAAAAAAG- 

AGCTAGCCCAAAATCCCAAACTCA 79 7 



Db 1062 GTCTTGCCAGAGCCTAGTGAGGGAGAGAAGTATGGTAAATACCACAACATATGGAA- 

TCA 1120 

Qy 79 8 AACCAAACC— AAAC 

AGAAAGCAGTGGAGGTGG 82 8 

I I I I I III I I I I I I I I I I I I I 

Db 1121 

GAAAATACCGGGAACTGGAGTGGGCAAGGGGGAATGCAGAGGGTGTGGAAATTTTTTAGT 118 0 

Qy 829 GGGTTGCTGTTGATTGA 

AGATGTATATAATATCTCCGGTTTATAAAACCTA 879 



Db 1181 GATCTGGAATG-TGTTGAGTGACAGGAAGTGCCCCAAGCTTCTCC 

CCCACCAA 1232 

Qy 880 TTTATAACA CTTTTTACATA 

TATGTACATAGTATTGTTT 918 

I I I I I I I I I I I I I I I I I I I I 

Db 1233 

CTCTTCTCAGTCGCGCCTGCTTTTGTCTAACTCTTGTAATCTACACACTACTGCTTACAA 12 92 

Qy 919 -GCTTTTTATGTT G AC CATC AG CCTCGTGTTGAGCCTTA — 

AAGAAGTAGCT — 96 7 

I I I I I III I I I I II III I I I I I I 

Db 1293 AGC T G T C T GAG T T T AAG AC AAAAAG AAAC C T AAAAG T - 

CTCCTTACTTCATAGTACCTGT 1351 

Qy 968 

AAGGAACTTTACATCCTAACAGTATAATCCAGCTCAGTATTTTTGTTTTGTTT 102 0 



Db 1352 GATATGGAAGGAATGTAAAAGCATGACCCTTTAAGC 

CTATGGACATTTTTTCAGGTA 140 8 

Qy 1021 TT TGTTTGTTTGTTTTGTTTTACCCAGAAATAAGATAACTC- 

CATCTCGCCC 1071 



Db 1409 

T AC AGG AG AAAG AT AAAAT AAT T T T CC AC AGAAAAATGAGAAT T ATG AAT TAT AT AG T T C 146 8 
Qy 1072 C — TTCCCTTTC- 

ATCTGAAAGAAGATACCTCCCTCCCAGTCCACCTCATTTAGAAAACC 112 8 



Db 1469 AGGTTCCAAATCTAATTTTTAAAAGAATTCTGATTC TGCTACACTTT- 

ACA 1518 

Qy 1129 

AAAGTGTGGGTAGAAACCCCAAATGTCCAAAAGCCCTTTTCTGGTGGGTGACCCAGTGCA 118 8 



Db 1519 AATGCTTAGGTTGGTTCCTAATTTG — AAGGAGACTTGTTTTATTTGGT 



TAATGCA 15 72 



Qy 1189 T C - C AAC AG AAAC AG 

CCGCTGCCCGAACCTCTGTGTGAAGCTTTACGCGCACACGGA 12 4 4 



Db 15 73 TTGCATTTGAACTTGTTTCTATTTTCTTTGCATAAATTTGGA- 

CTTTGGGAGAAAAATGC 1631 

Qy 12 45 CAA AATGCCC AAACTGGAG 

CCCTT 1268 

II I I I I I I I I I I I I I I I I I I 

Db 1632 

AAAGTAATAAGTAGAATGCACTTGGGGGAAAAAAGGAGGATTTTTCCCTTCATGGGTTGG 1691 

Qy 1269 GCAAAAACACGGCTTGTGGCATTG 

GCATACTTGCCCTTACAGGTGGAGTATC 132 0 



Db 1692 AAAGTATTTTAAAGGGTTGTTTTCTTGAAAAAACAAGCTCTCTCTTAC 

TTTC 1743 

Qy 1321 TTCGTC AC AC AT C TAAATG AGAAATC- 

AGT 1349 



Db 1744 

TGCATCTATGCTATAAAGATAACTATGCTATAATAAATGTAAGGTAGAAAACTTTAAAGA 1803 

Qy 1350 GACAACA — AGTCTT 

TGAAATGGTGCTATGGATTTACCATTCCTTATTATCACTA 1402 

I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1804 GAAAATAACAGTGTTCTAAGTGAAAAGCTACTTAGCATTTTCCCAAAC 

TCACAC 1857 

Qy 1403 ATCATCTAAACA-ACTCACTGGAAAT 

C C AAT T AAC AAT T T T AC AA- 14 46 



Db 1858 ATTATC — 

AACAGAAACACTAAAAATTAAGAGAAAAGCGGCCAACTAATTAAATAGCTAG 1915 

Qy 1447 -CATAAGATAGAATGGAGACCTGAATAATTCTGTGTAATATAAATGGT — 

TTATAACTGC 15 03 



Db 1916 

CCTTATTTTGGGGGTATGGGATAGAAAATTAAGTGTGAATAAAATGATACTTGGGAATGT 19 75 
Qy 1504 TT 

T TGT ACC T AGC T AGGC TGC T AT TAT T AC T ATAATGAGTAAATCAT 

II I I I I I I I I I I I I I I I I 

Db 1976 TTCTCCCTCGTACC- 

ACAAAGGATGTTAGTGGTCAGCCTACGAGTTAATCCTTCCTAGCA 2 03 4 

Qy 1551 AAAGCCTTCAT — CACTCCCACATTTTTC 

TCGGAGCATCAGAAC 159 8 



Db 2035 



1550 
I I I I I I I I I I 



TTACGG- 

I I I I I I I I II 



TGGCTCTGAGCCTTCATGCCGAGCAGACGTTATTCACATGACGATTCGAAAAGTCCATTC 2 09 4 



Qy 1599 AAGCGTCTAGACTCCTTGGGACCGTGAGTTCCTAGAGCTTGGCTGGGTCTAGGCT 

GT 1655 



Db 2 095 ATATATCTCG-CTACCTGGATT — TGAAT AGAAACCAGACAG 

CAATTCTTTAGT 2145 

Qy 1656 TCTGTGCCTCCAAG-GACTGTCTGG-CAATGAC-TTGTATTGGC 

CACCAACTG 1705 



Db 2146 TCC- 

AGCCACCATTCGCCCCACTGGACAATAGCGATTTGTTAGCACAGAGTCACAGGCTG 2 2 0 4 

Qy 1706 TAGATGTATATATGGTGCCCTTCTGATGCTAAGA- 

CTCCAGACCTTTTGTTTTTGCT 1761 

III II I I I I I I I I I I I I I 

Db 22 05 TGGGACAC 

AAAG CTTGGAGCT G C AG AAAG AT GGGGGATT C AG AG 2 2 48 

Qy 1762 TTGCATTTTCTGATTTTATAC-CAACTGTGTGGACTAAG- 

ATGCATTAAAATAAACA 1816 



Db 2249 

AACAGGAAATTACAGGCTCGATGCACTCCTGGCAGCTCTGAGAATACAAAAAAAAAAAAA 2308 

Qy 1817 TCAGAGTA ACTC 1828 

III III 
Db 23 09 AAAAAAAAGGCCACATGTGCTC 2330 
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Query Match 37.4%; Score 4064.5; DB 1; Length 2336; 

Best Local Similarity 42.4%; Pred. No. 1.6e-265; 



Matches 1021; Conservative 0; Mismatches 680; Indels 707; 
Gaps 136; 

Qy 12 AGCAC CACCAGGGAAC ATCTCGGGG- 

AGCCTGG 43 



Db 2335 

AGCTCGAGCACATGTGGCCTTTTTTTTTTTTTTTTTTTTTGTATTCTCAGAGCTGCCAG- 22 77 

Qy 4 4 TTGGAAGCTGCAGGCTTAGTCTGTCGGCTGCGGGTCTCTGACTGCC 

CTGTGGGGAG 99 



Db 2 2 76 — GAGTGCATCGAGC 

CTGTAATTTCCTGTTCTCTGAATCCCCCATCTTTCTGCAG 222 4 

Qy 100 GGTCTTGCCTTAACATCCCTTGCATTTGGC TGCAAAGAAATCTGC — 

TTGGAAGAA 153 

I I I I I II MINI I I I I I I I I I I I I 

Db 2223 CTCCAAGCTTTGTGTCCCACAGCCTGTGACTCTGTGCTAACAAATC- 

GCTATTGTCCAGT 2165 

Qy 154 

GGGGTTACGCTGTTTGGCCGGGCAGAAACTCCGCTGAGCAGAACTTGCCGCCAGAATGCT 213 



Db 216 4 GGGG CGAATGGT — GGCTGGAACT- 

AAAGAATTGCTGTCTGGTTTCT 2121 

Qy 214 CCTCCTGTTGCTGAG 

TATCATCGTCCTCCACGTCGCGGTGCTGGTG C 26 0 



Db 212 0 ATTCAAATCCAGGTAGCGAGATATATGAATGGACTTTTCGAATCG — 

TCATGTGAATAAC 2 063 

Qy 261 TGCTGTTCGTC-TCCACGATCGTCAGCCA ATGGA 

TCGTGGGCAATGGA 307 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2 062 GTCTGCTCGGCATGAAGGCTCAG-AGCCATGCTAGGAAGGATTAACTCGTAGGC — 

TGAC 2006 

Qy 308 CACGCAACTGATCTCTGGCAGAAC — TGTAGCACCTCTTCC 

TCAGGAAATGTCCA 36 0 



Db 2 0 05 CAC-TAAC— 

ATCCTTTGTGGTACGAGGGAGAAACATTCCCAAGTATCATTTTATTCACA 19 49 

Qy 361 CCACTGTTTC — TCATCATCACCAAACGAATGGCT-GCAGTCTGTCCA 

GGCCACCAT 414 



Db 1948 

CTTAATTTTCTATCCCATACCCCCAAAATAAGGCTAGCTATTTAATTAGTTGGCCGCTTT 18 89 



Qy 415 GATC- 

GTCTCT 447 

I I 



-CTGTCGATCATCT-TCAG- 
I I I I III II I I II 



-CATTCT — 
II II II 



Db 1888 

TCTCTTAATTTTTAGTGTTTCTGTTGATAATGTGTGAGTTTGGGAAAATGCTAAGTAGCT 182 9 

Qy 448 GTTC CTGTTCT 

TCTGCCAACTCTTCACCCTCACCAAGGGGGGCAGG 493 

III I I I I I I III I I I I I I I I I I 

Db 182 8 TTTCACTTAGAACACTGTTATTTTCTCTTTAAAGTTTTCTACCTTA 

CATT 1779 

Qy 49 4 TTTTACATCACTGGAATCTTCCAA — ATTCTTGCTG 

GTCTGTGCGTGATGAGTGCTG 548 



Db 1778 

TAT T AT AGC AT AGT T ATC T T T AT AGC AT AGATGCAGAAAGTAAGAGAGAGC T TGT T T T T T 1719 

Qy 549 C-GGCCATCTACACGGTGAGGCACCCGGAGTGGCATCTCAACTC 

GGATTACTCC 601 



Db 1718 CAAGAAAACAACCCTTTAAAATA 

CTTTCCAACCCATGAAGGGAAAAATCC 1669 

Qy 6 02 TACGGTTTCGCCTACATCCTGGCCTGGGTGGCCTTCCCCCTGGCCC- 

TTCTC 652 



Db 166 8 TCCTTTTT TCC CCCAAGT- 

GCATTCTACTTATTACTTTGCATTTTTCTC 1621 

Qy 653 AGCGGTGTC ATC TAT G TGATCTTGCGGAAACG 

CGAATGAGGCGC C 69 7 



Db 162 0 C-CAAAGTCCAAATTTATGCAAAGAAAAT — 

AG AAAC AAG T T C AAAT GC AATGC AT T AAC 156 4 

Qy 69 8 CAGA CGGTCTGTCTGAGGCTCTGAGCGTACATAGG- 

G AAG G G AG G AAG 744 



Db 1563 

CAAATAAAACAAGTCTCCTTCAAATTAGGAACCAACCTAAGCATTTGTAAAGTGTAGCAG 15 0 4 

Qy 7 45 GGAAAACAGAAAGCAGACAAA GAAAAAAGAGC T AGC 

CCAAAATCC 789 



Db 1503 

AAT C AG AAT T C T T T T AAAAAT T AGAT T TGGAACC TGAAC T AT AT AAT T C AT AAT T C T 14 4 7 

Qy 79 0 CA AAC TC AAAC C AAAC C AAAC AG AAAG— 

CAGTGG 822 



Db 1446 

CATTTTTCTGTGGAAAATTATTTTATCTTTCTCCTGTATACCTGAAAAAATGTCCATAGG 13 8 7 



Qy 



823 AGGTGGGGGT — TGCTGTTGATTGAAGATGTATATAATATCTCCGGTT- 



TATA 8 72 

I I I I I I I I I I I 

Db 1386 CTTAAAGGGTCATGCTTTT 

ACATTCCTTCCATAT C AC AGG T AC T ATGAAGT AAG 1333 

Qy 8 73 AAACCTATTTATAACACTTTTT 

ACATATATGTACATAGTATTGTTTG 919 



Db 1332 GAGACT- 

TTTAGGTTTCTTTTTGTCTTAAACTCAGACAGCTTTGTAAGCAGTAGTGTGTA 12 7 4 

Qy 92 0 CTTTTTATG TTGACCAT — CAGCCTCG — TGTTGAGCCT 

TAAAGAAGTAGCTAA 969 

II II I I I I I I I I I I I II II I I I 

Db 1273 

GATTACAAGAGTTAGACAAAAGCAGGCGCGACTGAGAAGAGTTGGTGGGGGAGAAGCTTG 1214 

Qy 9 70 GG AAC T T T AC AT C C T AAC AGT AT AATC C AGC T C AG T ATTTT 

TGTTT 1015 



Db 1213 

GGGCACTTCCTGTCACTCAACACATTCCAGATCACTAAAAAATTTCCACACCCTCTGCAT 115 4 

Qy 1016 TGTTTTTTG TTTGTT TGTTTTGTTTTACCCAGAAATAAGATAACT- 

CCA 1063 



Db 1153 

TCCCCCTTGCCCACTCCAGTTCCCGGTATTTTCTGATTCCATATGTTGTGGTATTTACCA 109 4 

Qy 106 4 TC — TCGCCCCTTCCCTTTCATCTGAAAGAAGATACCTCCCTCCCAG 

TCCACCTCA 1117 



Db 1093 TACTTCTCTCCCTCACTAGGCTCTGGC — AAGA CTGCTTCAGAGGGGATGCA- 

TTCC 1040 

Qy 1118 T T T AG AAAAC - C AAAG T G TGGGTAGAAACC CCAAAT 

GTC 1155 

I I I I I I I I I I I I I I II I I I I I III III 

Db 1039 

TTTAGATTGCACAAAGCGGAGCTGGGAAAATGGCTGGCAGTTTCAGAATCTAGTCACGAT 9 8 0 
Qy 1156 

CAAAAGCCCTTTTCTGGTGGGTGACCCAGTGCATCCAACAGAAACAGCCGCTGC 12 09 



Db 9 79 CGCACGCATGAGCACCTCACACATCCATGTCCC — TAC — CCGCC- 

CCCCCGCT-C 930 

Qy 1210 CCGAACCT CTGTGTGAAGCTTTACGCGCACACGG 

ACAAAATGCCCAAA 1257 



Db 929 CCGCCCCTGCAGCTGGCTGA — 

CCTGTCTCACCCACTGCTGGCCTATCGAACGGCCAGGA 872 

Qy 1258 CTG GAGCCCTTGCAAAAACACGGCTTGTGGCAT TGGC- 



ATACT 1299 



Db 871 

CTGTCTGGTTTTGGCTCGTGCCTTTGTCCATGTCTGGCTTAGTTCCTCTCTGTCTATGCT 812 

Qy 13 0 0 TGCCCTTA CAGGTGG AGTATCTTCGTCACAC — 

ATCTAAATGAG 13 41 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I 

Db 811 TGCCTCTACCCCCACCGCCCCAGGCGGCACAAGTGT- 

TTGGCCACACAAAACTAGAGATA 753 

Qy 13 42 AAATCAGTGACAACAAGTCTTTGAAATGGTGCTATGGATTTACC 

ATTCCTTA 1393 

II III I I I I III Ml I I I I III II III 

I I I 

Db 752 GAAAAGGTGGTAAAAA — CTTCAAACT-TTTCTA — 

AATTCTCCAACAGTTTATTTCTTG 698 

Qy 139 4 T-TATCACTAATCATC-TAAACAACTCACT 

GGAAATCCAATTAACAATTTTACAAC 1447 

I II II I I I I I I I I I I I I I I I I I I II III I 

I 

Db 697 TGAATTTCTTC- 

CTTCTTTAAATACTCCATTTTAAGAAAACAAAAAAATTAATTATCTAA 639 

Qy 1448 A T AAG AT AGAATGGAGACC TGAAT AAT T C T G T G T AAT AT AAATGG - 

TTTATAA 1499 

I I I I I II I I I I I I III II I I I I I I I I I I I I 

Db 63 8 AGGCAAAGAATGGAAAGCAACCTTTGTGTTCCT 

TATAATAACTGACTTCATAACTCT 582 

Qy 1500 CTGCTTT 

TGTACCTAGCTAGGCTGCTATTAT 153 0 

I I I I I I III II I I I I I I I I I 

Db 581 CTCCAGCTGCGTTATGGGATGTGTATA- 

AAAAG CTTCTGTTCT G AG AAC AAAGG AGC AC G 523 

Qy 1531 TACTATAATGAG 

TAAATCATAAAGCCTTCATCACTCCCACATTTTTCTTAC 15 81 

II I I I I I I I I I I I I I II I I I I I I I I I I I 

Db 522 

T GC AG AAAT GAG AC G AAAAAAT C C AC T G AC AGT AT TCC AT T AC AC AAAT T AC T TAAAAGA 463 

Qy 1582 GGTCGGAG C AT C AG AAC A 

AGCGTCTAG 16 0 8 

I I I I I I I I I I I I I I I I I 

Db 462 TTTTAGTC- 

AAGCCCCTCAACAGATTCAATTTTAAAATGGCTTTTAGTTAAAAAAAAAAA 4 0 4 

Qy 16 09 ACT— CCTTG GGACCG TGA GTTCC- 

TAGAGCTT- 163 8 

I I I I I I I I I I III I I I I I I 

I I I 

Db 403 

ATTGAAAGTGCTTACCCAGTAAAAGAACCGAAGTAGTCCTGAACTGTTACGTAAGACTTT 3 4 4 

Qy 1639 GGCTGGGTCT AGGCTGTTCTGTGC 

CTCCAAGGACTGT 1675 

I I I I I I I I I I I I II I I I I I I I 



I 

Db 343 

TTACAGTTGGATCTTTGTCAAAAGGGGATGGGGGTGATGGGAGAAAGCAGCAACGACAAT 2 8 4 

Qy 1676 CTG 

GCAATGACTTGTATTGGCCACCAACTG 

I I I I I I I I 

Db 2 83 CAAAAAAGTTCGAGCTGCTGTGGCTAG 

AGG AC AAC TTCTGTGTTTC C AG AT AGG AT T 22 7 

Qy 1706 -TAGATGTATATATGGTGC — 

CCTTCTGATGCTAAGACTCCAGACCT 1749 

I I I I I I I I I I I I II I II II I I I I I I 

Db 226 CTTGCTGTAGAAATGGAACTTCCAGC — 

C AGC AC AGC AT C C T G T C C C AG T AG AG AAATGA 169 

Qy 1750 -TTTGTTTTT GCTTTGC 

ATTTT- 1770 



Db 168 

GTTTGTCAGTTAAAACAAAAAAAAAATTAGATACTGGAACCCAGGCTAGACGAGGTATTG 109 

Qy 1771 CTGATTTTATACCAACTGTGTGGACTAAGAT-GCATTAAAATA- 

AACATCAG 182 0 



Db 108 

AACCGCGCCAGATTTCCTTGCAGCCCTGTCTGCTCAGCTCGCATTCCAGTAGGCCAACAG 4 9 

Qy 1821 AGTAACTC 1828 

I III 

Db 48 TG CTC 44 



1705 

I I I I I III 
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Query Match 



36.7%; Score 3982; DB 1; Length 1946; 



Best Local Similarity 44.6%; Pred. No. 3.3e-260; 
Matches 949; Conservative 0; Mismatches 680; Indels 498; 
Gaps 121; 



Qy 1 CAGTTACAGGG AGCACCACCAG 

GGAACATCTCGGGGAGCCTGGTTG 46 

I I I I I I I I I I I I I I I III I I I I 

I I I I 

Db 2 CCTGTACGGAAGTGTTACTTCTGCTCTAAAAGCTGCGGAATTCCTC 

GAGCACTGTTG 5 8 

Qy 4 7 GAAGCTGCAGGCTTAGTC-TGTC- 

GGCTGCGGGTCTCTGACTGCCCTGTGGGGAGGGTCT 104 

I I I I I I I I I I I I I I I I I II III I I I I I I I I I 

Db 59 GC — CTACTGGCTGAGGCTTATCTGACATC TCATTGTC — 

CCCTGGTGTGTGTGTGT 111 

Qy 105 - 

TGCCTTAACATCCCTTGCATTTGGCTGCAAAGAAATCTGCTTGGAAGAAGGGGTTACGC 163 

III I I I I I II I II II I I I I I I I I 

I I 

Db 112 GTGTGTGGTGTTTTTGTGTGTGTGG — GGTGTGAGTGTGGCAGAGAGGAAAGGG — 

GAGC 16 7 

Qy 164 

TGTTTGGCCGGGCAGAAACTCCGCTGAGCAGAACTTGCCGCCAGAATGCTCCTCCTGT-T 2 2 2 

I I I I I I I I I I I I I I I I I I III III 

Db 16 8 TGTTTGATT CTGTAGTTCC 

TTTTCTGCCTTCTTCCTTTCTCTGTAA 213 

Qy 223 GCTGAGTATCATCGTCC TCC — ACGTCGCGGTG CTGGTGCTGCTG 

TTCGT 270 

II III I III II I I I I II I I I I I I I I 

I I 

Db 214 

ACTTTGGATACTTATCCAAATTACTAACGGCAGATTGAGCCCTATGCAGATGGCATGTGT 2 73 

Qy 2 71 CTCCACGATCGTCAGC 

CAATGGATCGTGGGCAATGGACACGCAACTGATCTCTGGCA 32 7 

II I I I I I I II I I I I I I I II II III III 

I I 

Db 274 CTGTGACAACCTCTGCTCTCCACATCTCTTGGGC-CTGTTTAC CTGCGCTCC- 

CA 326 

Qy 328 

GAACTGTAGCACCTCTTCCTCAGGAAATGTCCACCACTGTTTCTCATCATCACCAAACGA 3 8 7 

III III I III III I I I I I I I I I I I I I I I I I III 
Db 32 7 GAGCCTCCGC-CAGCATCC-CAG — AATCTCCATCCCCATCTCTCA- 

C T TAT AC AC AC AT 381 

Qy 388 ATGGCTGCAGTCTGTCCA — GGC — CACCATGATCCT — 

GTCGATCATCTTCAGCATTCT 441 

II I I I I I I I I II III I I I I III I II I 

I I 

Db 3 82 CAGTCATCGGT-TATCCATTAGCTAAACCGCCTTCCTTAATAGCTTTAC- 

ACTGTTTGCT 439 

Qy 442 GTCTCTGTTCCTGTTCTTCTGCCAACTCTTCA CC 

CTCACCAAGGGGGGCA- 491 



Db 440 TTCTCTGGAAC — 

ATTTT TAGTTAAAAT T TC AT AATGCAGTTGCACACAAATGAAGAC AC 49 7 
Qy 492 GGTT — 

TTACATCACTGGAATCTTCCAAATTCTTGCTGGTCTGTGCGTGATGAGTGCTGC 5 49 



Db 49 8 AGATGGCTGCATCCTCCGTCTCTTCCCCTCGTTTACAGG 

AAGCTGC 543 

Qy 550 

GGCCATCTACACGGTGAGGCACCCGGAGTGGCATCTCAACTCGGATTACTCCTACGGTTT 6 09 



Db 544 GG ATCAGGGAGG GGTGT 

TAGGGTTACCCACATGGTAA 58 0 

Qy 610 

CGCCTACATCCTGGCCTGGGTGGCCTTCCCCCTGGCCCTTCTCAGCGGTGTCATCTATGT 669 



Db 581 GGGCAGAGACAAG AGGGGACC — 

CCAGTTTTCCATGCTGCACATGGTCATTGCTGG 634 

Qy 670 

GATCTTGCGGAAACGCGAATGAGGCGCCCAGACGGTCTGTCTGAGGCTCTGAGCGTACAT 729 
I I I I I I II II I I I I I I I I I I III 

Db 635 GGAC-TGAGGTTT-GCACAT CACCCTGCC CTGTT CTC 

CCTCCGC 676 

Qy 73 0 AGGGAAGGGAGGAAGGGA — AAACAGAAAGCAG 

ACAAAGAAAAAAGAGCTAGCCC 782 



Db 677 

TGGGGAGAAAGTCAGGGATGGAGCAAGCTGCAGCATCTTCTGAAAAAGAAAAGGTGGCCT 736 

Qy 783 AAAATCCCAAACTCAAACCAA- 

ACCAAACAGAAAGCAGTGGAGGTGGGGGTTGCTG 83 7 

III III I I I I I I I III II 

Db 73 7 TGTCTCCAGGTCTCCCCTCAAGTCCCACCTTCCCATAGT 

TTTCTGCCAC 785 

Qy 83 8 T T GAT T G AAG AT G T AT AT AAT AT CTCCGGTT T AT AAAAC C T AT T T 

ATAACAC 8 89 



Db 786 TTCTCTTGATTTTC-CTCTCTGCCACTTCTCTTGATTTT 

CCTCTCTGCCACTTCTC 840 

Qy 89 0 TT — TTTACATATATG TACATAGTATT — 

GTTTGCTTTTTATGTTGACCATCAGCC 9 41 



Db 8 41 TTGATTTTCCTCTCTGCAGCTGCTTTGAGGTGGGTTTTCTCCAGATGCACACTTTC— 

CC 898 



Qy 9 42 TCGTGTTGAG-CCTTAAAGAAGTAG- 

CATCC 9 83 



-CTAAGGAACTTTA- 



I 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 I I 1 1 1 

1 1 1 

Db 899 

CTGCTTTGCGTCCTTATTCTGGTAGAAGCACAATCTAAAGCTCATTAAGGGAACTAATCA 95 8 

Qy 984 TAACAGTATA ATCC AGCTCAGTAT T T T T 

GTTTTGTTTTTTGTTTGTTTG 1032 

I I I I I I I I I I I I I I I I I I I 

I 

Db 959 

ATTCTGTGCATGGCGCTAGCTCAGCAGATCACCACACAGGCAGCACTATTAGCAAGTCGG 1018 

Qy 1033 TTTTGTTTTACCCA — GAAATAAGATAACTCCATCT — 

CGCCC 1071 

I I I I I I I I III I I I I I I I I I I I I 

Db 1019 

TGCTTAACACATGGCACTTCCATGAATCGATATGGAGCCCGTGTAGAACAAGGCATG 10 75 

Qy 10 72 C T T C C C T T TC ATC T G AAAG A- AG AT AC C 

TCCCTCCCAGTCCAC 1113 

I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 10 76 GGTTTTTTTCTCTTCCCATTAA 

GAAAAACTGATGCCAAAAATAACTTCTCAGATATT 1132 

Qy 1114 CTCATTTAGAAAACCAAAGTGTGGGTAGAAACCCCAAATGTC — 

CAAAAGCCCTTTTCTG 1171 

III I I II II I I I I I I I III I I I I I I 

I 

Db 1133 TTCAAGT — 

ATGACTTTTATGAAGGGAAAAAGCATTTTTGTTTGCAAAATCATGCTTCAG 119 0 

Qy 1172 -GTGGGTGACCCAGT GC 

ATCCAACAGAAACAGCCGCTGCCC 1211 

III I I I I I II I I I I II I 

Db 1191 TGCAGG 

CCAGTTGTGAATTGTGATGGCTTTTATTTCTCCTGGGGCTGTAACTTTAA 12 46 

Qy 1212 GAACCTCTG 

TGTGAAGCTTTACGCGCACACGGACAAAATGCCC 1254 

I I I I I I I I I I I I I I I I I I I I III 

I 

Db 12 47 GGGTTTAGAATTTGGAACCACAGCCTAGCTAATCATGACACACACACACACA 

CAC 13 01 

Qy 1255 AAACTGG— AGCCCTTGCAAAAACAC— GGCTTGTGGCATTGGCATAC 

TTGCCC 1304 

III I I I I I I I I I I I III II I I I I I III 

Db 1302 

ACACATGCACACACATACACATACACAAAGCATCACGAAGAACCATACAAATTGTACATT 1361 
Qy 1305 — 

TTACAGGTGGAGTATCTTCGTCACACATCTAAATGAGAAATCAGTGACAACAAGTCT 1361 

I I I I I I I I I I I I I I I I I I I I I I I I I I I III 

Db 1362 ATTTTACACATGGA GGCTCAC- 

T C T AAAAT AG AT AC C AT T T T AAAT AT T AAC 1412 

Qy 1362 TTGAAATGGTGCT-ATGGATTTACCATTC CTTATT 

ATCACTAATC-A 1406 

I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1413 



TAAAACTTGTGCTCATTGTATGTTCATTCTATATGTACTGATTTGTTATTCACATTTCTT 14 72 



Qy 140 7 TCTAAA CAACTCACTGGAAATC — CAATTAACAATT 

TTACAACATAA 1451 

I I I I I I I I I III I I I I I I I I III I I I I 

Db 1473 

T CAAAAT C G T T CAAAT T T C T AGCCC AC ATC AAT T AAAT AT T AAT AGT AC TTTCTACA 1529 

Qy 1452 GAT AG AAT G - GAG AC C TGAAT AATTCTGTGTAATATAAATGGT-TTAT- 

AAC 1500 

I I I I I I I III I I I I I I I I I I I I I I I I I 

I I 

Db 153 0 AACATGAGCGCACAAAATAAATTCAAATCTAT- 

TTTTCCCACTGGTGTTATCAATAC 1585 

Qy 1501 TGCT TTTGTACCTAGCTAGG 

CTGCTATTATTACTATAATGAGTAAATCA 1549 

I I I I I I I I I I I I I I I I I I I I I II I I I I I 

I 

Db 1586 TGCTCATACTTTGTCAGTAACTAAGTATCACATGATCTTAAAAC — 

TAATGTCACATACT 16 43 

Qy 1550 TAAAGCCTTC ATCACTC — 

CCACATTTTTCTTACGGTCGGA 158 8 

I I I I I I I I I I I I I I I I I I I I I II 

Db 1644 

AAAAAGCTTCTGAGGCAAATTGTAGAAAGAACTCTCAACATCATTGTTCTACTGGAC 17 0 0 

Qy 1589 

GCATCAGAACAAGCGTCTAGACTCCTTGGGACCGTGAGTTCCTAGAGCTTGGCTGGGTCT 16 4 8 
I I I I I I I I I I I I I I I I I I I I 



Db 1701 AACATAC — ATAAAATTATT TTAC- 

AGTGATGGGAGAGAAAT 1739 

Qy 1649 AGGCTGTTCTGTGC CTCCAAGGACTGTCTGGCAATGACTTGTATTGGC 

CA 1698 

I I I I I II II I I I I I I I I III I I I I I I I I I I I 

I 

Db 1740 AGGCTCCTC-ATCCTAAAAGCTGCGAAGACAG — 

TAGCGGTGCCGTGTTTTGGCGTTACT 1796 

Qy 1699 CCAACTGTAGATGTATATATGGTGCCCTTCTGATGCTAAGA 

CTCCAGACCTTTTGTT 1755 

II I I I I I I I III I I I I II I I I I I I I I I 

Db 179 7 CC-CCTGTGGAT CCCAGC- 

GACGGGTGGATTTCTCCTGTGCTTTATCA 18 42 

Qy 1756 TTTG-CTTTGCATTTTCTGATTTTATACCAACT GTGT 

GGACTAAGATGC— 18 03 

III II III I I I I I I I I I I I I I I I I 

Db 1843 

TCAGACACAAAATGGACCAAAATGGACCAAAATGGACGAGTGTGAGGGACACAGAGGCTG 19 02 

Qy 1804 — ATTAAAATAAACATCAGAGTAACTC 182 8 

I I I I I I I I I II II 
Db 19 03 CTGTAAAAAAAAAAAAAAAAAGGCCAC 1929 
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RESULT 1 

US-10-478-914-21 

; Sequence 21, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 

; TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
; FILE REFERENCE: 7388-80899 

; CURRENT APPLICATION NUMBER: US/10/478,914 

; CURRENT FILING DATE: 2003-11-26 

; PRIOR APPLICATION NUMBER: PCT/JP02/05294 

; PRIOR FILING DATE: 2001-05-30 

; PRIOR APPLICATION NUMBER: JP 2001-162775 

; PRIOR FILING DATE: 2001-05-30 

; PRIOR APPLICATION NUMBER: JP 2001-255226 

; PRIOR FILING DATE: 2001-08-24 

; NUMBER OF SEQ ID NOS : 417 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 21 

LENGTH: 3589 

TYPE: DNA 
; ORGANISM: Homo sapiens 
US-10-478-914-21 

Query Match 22.0%; Score 8423.5; DB 1; Length 3589; 

Best Local Similarity 37.0%; Pred. No. 0; 

Matches 2396; Conservative 0; Mismatches 1139; Indels 2937; 
Gaps 331; 

Qy 171 CCAAGTCCTGTGGTGCGCCAGATTGACAAGCAGTTTCTGATTTGCAGTATATGC- 

CTGGA 229 

II I I I I I I I I I I I II I I I I I I I I I I 

Db 2 CC TACTG-GATTCGCCA TC-ATTTTGCA 

CACTG— 32 

Qy 23 0 ACGGTACAAGAATCCCAAGGTTCTCCCCTGTCTGCACACTTTCTGCGAGAGGTGCCT- 

GC 288 

I I I I I I I I I I I I I I I I I I I I I I I I 

I I 



Db 33 
TGTCTGGC 6 8 



-CGAGGGGCT-CCGTGTGTGCGCCCTGTCT- 



Qy 289 

AGAACTACATTCCTGCCCACAGTTTAACCCTCTCCTGCCCAGTGTGCCGCCAGACCTCCA 3 4 8 

I I I I I I I I I I I I II II III I I I I I I I I 

I I I I 

Db 69 CGTCCT-CATCCCTTCCCCCA CC— CCC TGCCGCCA- 

CTCCG 106 

Qy 349 

TCCTGCCCGAGAAAGGGGTGGCCGCGCTCCAGAACAATTTCTTCATCACAAACCTGATGG 4 0 8 

I I I I I II I I I I I I I II I I 

I I 

Db 107 ACCCGCTCCCAAA GTGGCTTCACAATAG TCGGTC 



Qy 409 ACGTGCT- 

GCAGCGAACTCCAGGCAGCAACGCTGAGGAGTCTTCCATCCTGGAGACAGTC 46 7 

III I I I I I I I I I I I I I I I I I I I III I I I I I I 

I 

Db 146 CGGTGTAGGCTGCG— CACCAGGT — CCACACTTAAGCG 

AAATCAAGGAGACC — C 195 

Qy 468 

ACTGCTGTGGCTGCGGGAAAGCCTCTCTCTTGCCCAAACCACGATGGGAATGTGATGGAA 52 7 
I I I I I I I I I I I I I I I 

Db 196 CCTTCT ACTTCTACCTT 

TGGG 216 

Qy 52 8 TTTTACTGCCAGTC- 

CTGTGAGACTGCCATGTGTCGGGAGTGCACGGAGGGGGAGCACGC 586 

I I I I I I I I I I I I I I I I I I I I I 

I 

Db 217 -TTTGGTGC TCAATGCGAAGCTGC TGCA 

ACTC 2 47 

Qy 587 

AGAGCACCCCACAGTTCCACTCAAGGATGTGGTGGAACAGCACAAGGCCTCGCTCCAGGT 6 46 
I I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 2 48 AGA-CACGCCTAAG-TCAACTCATG CAGAAAAAGG 

AGA 2 83 



Qy 647 

CCAGCTGGATGCTGTCAACAAAAGGCTCCCAGAAATAGATTCTGCTCTTCAGTTCATCTC 7 06 
III I I I I I I I I I I I I I I 

Db 284 AAAGTT — TTG — G TAT AC AAA 

TGCTC 306 

Qy 707 

TGAAATCATTCATCAGTTAACCAACCAAAAGGCCAGCATCGTGGATGACATTCATTCCAC 766 
I I I I I III III I 

I I I 

Db 307 TCAGT CCAGCCAGA 



Qy 767 

CTTTGATGAGCTCCAGAAGACTTTAAATGTGCGCAAGAGTGTGCTGCTTATGGAATTGGA 82 6 
I III I I I I I II I I I I I 



Db 324 C- 

CGAAAGGTTGTGCTG- 



-AAGC 

343 



Qy 827 

GGTCAACTATGGCCTCAAACACAAAGTCCTCCAGTCGCAGCTGGATACTCTGCTCCAGGG 8 86 

II I I I I I I I I I I I I I I I I I I 

I I I 

Db 3 44 G C T G C AAC C G AAAG AT C AAGG AC C GGTA- 

TCTTCTAAAGGC 3 83 

Qy 887 

GCAGGAGAGCATTAAGAGCTGCAGCAACTTCACAGCGCAGGCCCTCAACCATGGCACGGA 9 46 

I I I I II I I I I I I I I I 

I 

Db 384 ACTGGA CAAATAC 

TGGCATGAA 405 



Qy 947 

GACCGAGGTCCTACTGGTGAAGAAGCAGATGAGCGAGAAGCTGAACGAGCTGGCCGACCA 10 06 
I I I I III I I I I I I I I II III 

Db 406 GACTG CCT GAAG TGTGC CT 

GCTG 428 



Qy 1007 

GGACTTCCCCTTGCACCCGCGGGAGAACGACCAGCTGGATTTCATCGTGGAAACCGAGGG 1066 
I I I I I I I I I I I I I 

I I 

Db 42 9 TGACTGTCGCTT GGGGGA 

GGTGG 451 



Qy 106 7 G C T G AAG AAG T C C AT C C AC AAC - 

CTCGGGACGATCTTAACCACCAACGCCGTTGCCTCAG 1125 

II I I I I I I I I I I I I I I I I I I I 

I I I 

Db 452 GC TCCACCCTGTACACTAAAGCTAATCTTATCCT- 

TTGTCGCAG 49 4 



Qy 1126 

AGACAGTGGCCACGGGCGAGGGGCTGCGGCAGACCATCATCGGGCAGCCCATGTCCGTCA 1185 
I I I I I I I I I I I I I I I I I 

Db 495 AGACTAT CTGAGGC TCTTTGG 

TGT 518 



Qy 1186 

CCATCACCACCAAGGACAAAGACGGTGAGCTGTGCAAAACCGGCAACGCCTACCTCACCG 12 45 
I I I I I I I I 

I I 

Db 519 AACGGGAAACTG 

CG 532 



Qy 1246 

CCGAACTGAGCACCCCCGACGGGAGCGTGGCAGACGGGGAGATCCTGGACAACAAGAACG 13 05 
II I I I I 

Db 533 CTG 

CCTG 539 



Qy 1306 

GCACCTATGAGTTTTTGTACACTGTCCAGAAGGAAGGGGACTTTACCCTGTCTCTGAGAC 1365 
I III 

Db 540 



TAGTA 544 

Qy 1366 

TCTATGACCAGCACATCCGAGGCAGCCCGTTTAAGCTGAAAGTGATCCGATCCGCTGA-T 142 4 

I I I I I I I I I I I I I I I I I I I I I I I I I I II II 

Db 545 AGCTCATCC CTGCC — TTTGAGATG 

GTGATGCGTGCCAAGGACA 586 

Qy 1425 

GTGTCTCCCACCACAGAAGGCGTGAAGAGGCGCGTTAAGTCCCCGGGGAGCGGCCACGTC 14 8 4 
I I I I I I I I I II 

Db 58 7 ATGTTTACCACC 

TG 600 

Qy 1485 

AAGCAGAAAGCTGTGAAAAGACCCGCAAGCATGTACAGCACTGGAAAACGAAAAGAGAAT 15 4 4 
II I I I I I I I I I I I I I I III 

Db 601 GACTGCTTT GCATGT- 

CAGCTTTGTA 625 

Qy 1545 

CCCATCGAAGACGATTTGATCTTTCGAGTGGGTACCAAAGGAAGAAATAAAGGAGAGTTT 16 0 4 

I I I I I I I I I I I I I I III I 

I I I 

Db 626 AT C AG AG A TTTTGTGTTGGAGACAA 

ATTTT 655 

Qy 16 05 ACAAATCTTCAGGGGGTAGCTGCATCTACAAATGGAAAGATATTAATTG- 

CAGACAGTAA 1663 

I I I I I III I I I I I I I I I I I I I I 

I 

Db 656 TC CTAAAG AAT AAC AT GATCCT — 

TTGCCAGACGG — A 6 89 

Qy 1664 

CAACCAATGTGTGCAGATATTTTCCAATGATGGCCAGTTCAAAAGTCGTTTTGGCATACG 1723 
III I II III I I I I I I III 

Db 69 0 CTAC GAGGAAGGTTT — 

AATGAAAGAAGGTT 718 

Qy 1724 

GGGACGCTCTCCGGGGCAGCTGCAGCGGCCCACAGGAGTGGCTGTACATCCCAGTGGGGA 17 83 

I I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 719 ATGCACCCCAGGTTCGCTG-ATCTATCAACA 

TCACCCCATTAAGAA 763 

Qy 178 4 CATA- 

ATCATTGCCGATTATGATAATAAATGGGTCAGCATTTTCTCCTCCGATGGGAAAT 18 42 

I I I I I I I I I I I II II I I I I I I I I I II 

I I 

Db 76 4 T AC AGAGC AC T AC — ATTCTTTTA TC 

TTTTTTGCTCCACATGTACAT 808 

Qy 1843 

TTAAGACAAAAATTGGATCAGGAAAGCTGATGGGACCCAAAGGAGTTTCTGTGGACCGCA 19 02 

I I I I I II I I I I I I III III I I I I I 

Db 8 09 AAGAATTGACACAGGAA CCTACTGAATAGC- 

GTAGATATAG 8 48 



Qy 1903 

ATGGGCACATTATTGTTGTGGACAACAAGGCGTGCTGCGTGTTTATCTTCCAGCCAAACG 1962 
I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 8 49 GAAGGCAGGATGGTTATATGGAATAAAAGGCGGACTGC 

ATCT 890 

Qy 1963 

GGAAAATAGTCACCAGGTTTGGTAGCCGAGGAAATGGGGACAGGCAGTTTGCAGGTCCCC 2 02 2 
I I I I I I I I I I I I 

I I I I 

Db 891 -GTATGTAGT GAAATTG 

CCCC 910 

Qy 2 023 ATTTTGCAGCTGTAAATAGCAATAATGAGATTATTATTACAGATTTCCATAAT 

CATT 2 0 79 

III III I I I I I I I I I I I I I I I I I I I I I II 

I 

Db 911 AG T T — C AG AG T T G AAT 

GTTTATTATTAAAGAAAAAAGTAATGTACA-T 956 

Qy 2080 

CTGTCAAGGTGTTTAATCAGGAAGGAGAATTCATGTTGAAGTTTGGCTCAAATGGAGAAG 2139 

III I I I I I I I I I I I III I I I I III III 

Db 957 ATGGCTGGATTTTTTTGCTTGC TATTC— GTT 

TTTGTGTCACTTGG 1000 

Qy 2140 GAAATGGGCAGTTTAATGCTCCAACAGGTGTA- 

GCAGTGGATTCAAATGGAAACATCATT 219 8 

I I I I I I I I I I I II I I I I I I I I I I I I I I I 

I I 

Db 1001 — CAT G AG ATG T T TAT T — TTGGACTATTGTATATAATGTATT GTAATAT — 

TT 1048 

Qy 2199 GTGGCCGACTGGGGAAACAGCAG- 

GATCCAGGTTTTTGATGGGAGTGGATCATTTTTGTC 2257 

III III I I I I I I I I I I I I I I I I I I 

I I I 

Db 10 49 GAAGC ACAAATGTAATACAGTTTTATTGTG 

TTACCATTTGTGTT 1092 

Qy 2258 

CTACATTAACACATCTGCTGACCCACTCTATGGCCCCCAAGGCCTGGCCCTAACTTCAGA 2317 
I I I I I I I I I I I I I 

Db 1093 C CATTTGCT 

TCTTTG 1107 

Qy 2318 TGGTCATGTTGTGGTTGCAGACTCTGG — 

AAATCACTGTTTCAAAGTCTATCGATACTTA 2375 

I I I I I I I I III I I I I I I I I I I I 

I I I I I 

Db 1108 TATTGTTGCA TTTAGTACAATCAGTGTTT 

AAACTTA 1143 



Qy 2376 

CAGTAATGGTGGGCAGGTGGATACCCGCTTCCATGGTCTTGCACTATAAACTGGAATGGA 2 435 
I I I I II I I I I I 

I 

Db 1144 CTGTAT ATTTATGCTTTC 

TGTA 1165 



Qy 2436 

TTTCTCAATGCGGGACCAGATTATGACTAGAGTTTTTATGCCAGAAGGAATCATTGGTGA 2 495 
III I I I I I III I I I I I 

I I I 

Db 1166 TTT ACCAG CTA TTTTA 

AATGA 1186 

Qy 2496 

ACTTTCCAAGGTTATTTCTGAATGTAACAATTTCCTTAAAAATGACTTATCCAATTTCTG 2 555 
I I I I I I I I I I I I I I I I I I I 

Db 1187 GC TGTAAC — TTTC — 

TAG T AAAG AAT T 1210 

Qy 2556 

TATTTCACCTTTAGGGTTAAAAAAAACTCTTCTACTGAATCTATAAAAACTGCAGTTTTA 2 615 
I II I I I I II I I I I I I I 

Db 1211 GAAAAGCAAATCTC- 

ACTAATGATAC 1235 

Qy 2616 

CATCTGTGAACTATGGCTTAAGGGACAGGATTTATGTAGCTAAACTAATTTTGCAAATCA 2 6 75 



Db 1236 ACAG ATAGATAAA- 

GCAAGTCT 1256 



Qy 2676 

AACAGACACTTAAAAAAACTAGCATATGTAAAGGTATTCGTTAATCCTGTGAATGGTAGC 2 735 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 

I 

Db 1257 AT C A- AC A- T T AAAAAT AC T A — 

AAAAATAAAGACACACACAGAGCATTTTAGTGACATC 1312 

Qy 2736 

TTTTGCACAGAACTTCCAAAAGCAAAACAAAAACAAAATCTATTGTAGTTATATACTTCA 2 795 

I I I I I I I I I I I I I I I III III 

Db 1313 CACTACTTATTGCCGCTATGAGTTAGAGTCTATCAGTGTT 

CTTGT 1357 

Qy 2 796 TTTAACCTAGGTCACAAGACCCAGGGAATCTTCTAACCTCACTTTTACAGTAGG- 

TATTA 2 854 

I I I I I I III I I I I I I I I I I 

I 

Db 135 8 TATAACC CCCT 

ATTTTCAGGGGGTTAAAA 1386 

Qy 2855 

CTCTTGTGACATTTTTTTGGTTATCAACAACTAAATATAAATTACTTTGGAAAAAGTAAG 2 914 
II I I I I I I I I I I I I I I I I I I I 

I I I 

Db 13 8 7 ATC AGC T T TAAAAAAAT AC AT AAAAAT T T C AT 

CTTAAA 1424 

Qy 2915 

GCTGTCTTGCAAAATGATCCCAGCTCTGATTAGCAGCCCTCTGGAGTTCAGAACTTAAGT 2 9 7 4 

II I I I I I I I I I I I I III II I I I I I II I I I I I I I 
Db 1425 GC — ACTTTCATTTT-ATACCAACG-TGAAAAG-TGCCATTTTTAG — 

AAT AAC T T T A- A 14 76 



Qy 2975 



ATCAGTGCAAATTTCTCAACCTTTCTGGGTTAGACAAAGATCCTTTTTTGTGTGTTCTTT 3 03 4 

II II III I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 1477 AGCTTAACAGGTTT CCTTT TAATATCCTTTTTT- 

TGTGTGCTCT 1519 

Qy 3035 

TCACCACCCCTTTGGCTCACCTTGTATCAGCAAACAAAGTACTTCTTCAGGGAAACCTGA 3 09 4 

III II I I I I I I I I I I I I I I I I I I I I I I I 

Db 152 0 T T AC T T AC AC AATGGC T TTGTTT 

TGCTTTTTCAGCCACACC 1560 

Qy 3095 

AATTTCTAATGCCTTGAAAAGCATATTACAAAAGTAATGCTACCTTTTGGGAAACAAACT 315 4 

I I I I I I I I I I I I I I I I I I I I 

Db 1561 CCTT ATGTGAACTAGTGCC- 

TTTGGGTATCA 1590 

Qy 3155 

GCCCCGTTAACTCCAGATCATTGCACTGGAATGTAATCAAGAAAGTTAGTCATGTTTTAT 3214 
I I I I I I I I I I I I I I I I 

I I I I 

Db 1591 CGTAA AATTTTTTCCAAAGGGTTA 

CTTTAA 1620 

Qy 3215 

GTACCATGTTTTCACACGTGTCTCTTCTCTTCGACTTCCTGAAAGCGAAAGCTTTACCTC 32 7 4 

I I I I I I I I I I II II I I I I I I I III 

I I 

Db 1621 AAATC-TGTTACCACA 

AT TAT G AG ATG AT T T T T AAGTG AT AAAT T AAAC T T 1670 

Qy 3275 

CTGCAAATGTCAGCACATGTAGTAGGACACCAGTATCCTAGGACAGAGAGCCATAAGTAG 333 4 
II I I I I I I I I I I 

I 

Db 16 71 CT TCTTGTATAA 

ATTCT G 1688 

Qy 3335 

CCCTTTGGAGGACTGATGGTGTCAACCAAAGGCATGTGATTGATTAATGATTCCCCCTTA 339 4 

I I I I I I I III 

Db 16 89 CCC AGATCT 

CTCC 1701 

Qy 3395 

GAAAGCAAGTGTTACCAAAGTTGTGTTATCTTGAAAGCATTACAGGTAAGGGCATGTTAT 3 45 4 
II I II II I I I I I I I 

I I I I 

Db 1702 ACAAGAGCTGAG GGTTTCATAACT 

TTAT 1729 

Qy 3455 

GGTTATTTATCATTGTTTAATGAATAGTAGAGGTGTCAAGGGACTATGTATACATGATTA 3514 

II I I I I I I I I I I I I I I II 

I I I I 

Db 173 0 GG — CTTAATAAATGTAT GACACTG 

AAAAGATTT 1761 



Qy 3515 



GGGTAAGATAGAATGTATTATATATATATATATATATACACACACACATATATATAGCTG 35 7 4 
I I I 

I I 

Db 1762 GAGTG 

TG 1768 



Qy 3575 

AATCTTTGGTGTATTGAAATAGGCAGCACTCTGAAAGACAGAAGCTTCGTCCAGCCACTC 363 4 
I I I I I I I I I I I 

I I I I 

Db 1769 AATCT ACTGAAA 

TCACTA 1786 



Qy 3635 

TTCAGCACATTCCTTTACTAAGCAGTTTAAAGCCGTCCTAGTGGAGCAAGCCCTAAAGCA 369 4 
I I I I I I I I I I I I I I I I 

Db 178 7 TAATGCACA 

TTGAAGCTATGATGGT 1811 



Qy 3695 

GATTTAATTTTTGCCATTTTCCAAGAATGACGGTGGTGGCTTTTAGTCAGAAAATGGCCT 3 75 4 
I I I I I I I I I I I I I I I I I I I I 

Db 1812 - AT T T GAG TAG T 

GAGGTTACTTTTGATCGGA 18 41 

Qy 3755 

TCTGTGCTTT C AAAAAAAAAAC AAAAAAAAAACC AC AC AC AC AC AT AAAAAAC C C AAC AG 3 814 
II I I I I I I I I I I I I I 

Db 18 42 GC AACATAAT 

GCTCATAGAA 1861 



Qy 3815 

G T C AAAAT AAAAG T T G AAC T T GAG TTACATTTAATT T AAAT AT AAAT GC AT T T T GAG AAA 3874 
II I II III 

I I I I I I 

Db 1862 -TC TTCTAGAAGAA 

GAGAAA 1880 



Qy 3875 

TGTTAAGAACAATTTAGTCAATCGTTCATCTGTCATTGGTACTGTAAAATAAGCTGTGGT 393 4 

II I I I I I I I I I 

Db 1881 CAAAGGGATTG- 

ATAAAATGCTGAGAA 19 06 



Qy 3935 CTATTTCCACTGTTTAATTTTCTACTCAGT- 

TCTACCAAATAGGATGTCATGTTTGACAT 3993 

I I I I I I I I I I I I I I I I I I I I I I 

I I I I I I 

Db 19 0 7 C TAG T G — AT TAT AT AT T T T TC T GTATTTACC- 

TGACAT 19 42 



Qy 3994 

TTTTGATAGTGACTTTGGGGTCTTCTTCACTGAAAGCACCTTAGAACTGTACTATAAGAA 4 053 



AATGTTCAAAAAGTA 196 4 



Qy 4054 

AACATTTCCCCTATGTATAATTATATGAATGTGATGTTTATTGCTTATTAATTTATAATT 4113 



1 1 1 1 1 1 I I I 1 1 1 1 1 1 I 1 1 1 1 I 1 1 1 I 

1965 AACACTT TAAGTTTGATGTGTTTTACTCTCTCATTGTT- 



TT 2004 



Qy 4114 

CAGTCATTCTC T AT AT AGG AC T T C T TAAAAT T T AGAAGGGAAAT C T AGC T AC T T CAAAT T 4173 

I I I I I I I I I I I I I I I I I I I 

Db 2 0 05 AAGTAATT GCCAACTCAGAA 

TACATCA 2 031 

Qy 4174 

GTCTGTTAAATTTATTATGCCCAAATCAACCTCTGAAAAAAGGTTTTTCCAGGAAGATTT 4233 
I I I I I I I I I I I I I I I I I I I I 

Db 2032 TTCTTAGG CTGAAATTT- 

GTCTTTCCA 2057 

Qy 4234 

ACATTTAGGTTTAATATTTTTTTAGTTAGGTAGAGTTTTAAAAAATACTTGAGCCTGTCC 42 93 

I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 2 058 TTTTTTA AGGTG AAATAGTACT ACCT- 

TAC 2086 

Qy 42 9 4 G T G AT AAAGC T AT AAAAT TC AAT AAC T T T T T AGAAT G T T AAAT G A- 

AGACACTGTTTCCT 4352 

I I I I I III III III I I I I I I I I I I I I I I I I I I I 

I 

Db 2087 GTGAT — AGCATACAAAGAAGAAAGC — 

T C T AG AAAG AG AAAT T AT G GAG AAT GAT T AT T 2142 

Qy 4353 AACATCAGTGAGATACATCTTTGAATTTAAACAT 

T C AT AT T T AC T GAG T ACC T AC T 440 8 

III I I I I I I I I I I I II I I I I I I I I 

Db 2143 TAAAT 

TACAATTAAGGAAATGAGAATATGATCCCCTCTTCCGAGTTGC 219 0 

Qy 4409 AGGTACCAAGTACT — 

CTTTTAGGCACTGGAAATACAGTGATGGACAAAACAGGTAAAAA 4 466 
III III I I I I II I 

Db 2191 

CCACAAACTTGCTTCTTTGCTTT 2213 

Qy 4467 

ATCGCTGCCCCCTCAGAGCTGACATTCTGGGGTGGGAATTTCATTTTGCCACGTACTAAC 452 6 

I I I I I I I I I I I I I I I I I I III 

I I I I 

Db 2214 TGCTCC CTGTAAT AGAACTACTTTTCAACAAAT- 

CTAA- 2250 

Qy 4527 

GTTCTGCACAAAAGACAGGCTAGACTCTTGTCTAGATTGTTTAAAAGAAACTTTTCAAAT 45 86 

I I I I I I I III I I I I I I I I I I I I I 

Db 2251 -TTTTGCAC GGC ACCGTTAACCATA 

TTTTCAC — 2281 

Qy 4587 

TGGTTACATTAATTTTAGTTTATTTTCACAAGTAAAAATGGCTTTTTATTTAGATTCTTT 46 46 

I I I I II I I I I I I I I I I I I I III II 



Db 22 82 TACAGCAAACTTAGT- 

GCTATGGGTTTTCTTTTTCTTTGTTT 2322 



Qy 4647 

CTGTCCCAGGCTGTTGATCTTAAAACTAGTTGATTTAAAGAGTTTTTTTGCACAACATTT 4 7 06 



Db 2323 TTTTC TTGATC ACTTGT — ATAGGAA 

ACAACATTT 2355 

Qy 470 7 — C AAT TATATTTGT G AAC T T AGAAAT T AAC T T AC AAT C T AAC C AGC C AT CAT AT - 

CATA 4763 



Db 2356 TCCAGTGTTATTTG CATA 

TATATTTTGTCCTTCCAATATATGCATT 2 401 

Qy 4 76 4 TCCTATCAGGCTAGATATCTCAATAGTAGA-CTGAATAC- 

AAAGCTAATTTTTT 4815 



Db 2 402 ACAGAT 

GAAAATTAAATGTTATACCTGAATTCTTGGGTTGGGGCCAAAATATT 2 45 4 

Qy 4816 T T AC AT G T C AAT AT TGGC AC AAAC TGGAATGAAAGAAT AG T T T GAT T C AG AC C — 

TGCTC 4873 



Db 2455 AAGC-TGAAAATAATG CTGGTGTGGA 

TTTGTTTTAAAACAAAGCTT 2499 

Qy 48 74 CACTATGTGTTGCTAAAACACATGCTATGAGCA- 

C T C G AG G AAAC AC TATATTTTTTCCA 4932 

I I I I I I I I I I I I I I I I I I I I I I I I 

Db 250 0 TATTATG AACATGC-ATGTGAATCTGGATATTGC- 

CTCTTATTTTTAAG 2546 

Qy 4933 

AAAAATATGTGATTATATATGTTAAAGTATAGATAACATTTCACACTTGGATACATATGT 4992 



Db 2547 AAAA TGGTTCTGTG AAAAGT 

GAATGATATGT 2577 

Qy 4993 GCATTTAC- 

TGTATTTCTTGGTAAGCATATTTTTGGGGGAAAGTGCTGCTGATATGATAC 5 051 



Db 2578 ATTTTTCCAAATGCTTCATGGTTAG GAGT-CTTC 

AAGTTCC 2617 

Qy 5052 

AAGTAGACAAAATTTAAATGAAATTTTGTCACATTCTATGGAAAATGGTTTCTGGTAAAC 5111 



Db 2618 ATGTTCCCCAGATTTGA- 

GATATAC 26 41 



Qy 5112 T- 

GAGAAGGATATTAAAATAAGTGGCTTTTTTCTGGGCTACCATTATTGTTTGATTTCTC 517 0 



Db 2 6 42 TAAAGAAAGAAATTCAA — AAGTAGCTATTT — GGGGC — 

CCACAAAAATAACTATTATT 2695 



Qy 5171 TTTGTCAAGTGTATAGAACCTGTCA- 

TACATTCATGATAAGTAGCACTGAAAAATTACTC 5229 

I I I I I I I I I I I III I I I I I I I III 

Db 2696 TTAGCC T T AG AGC C T T AC AC TTGTTTCAT G AAG AG AA — 

AGGACTTGCAT 2 7 43 



Qy 5230 ATTCAAATTTCCC 

CTGGGCACGTAAGGCAAAATATTGCCGGTTGGGATTTCAAGGT 52 85 

I I I I I I I I III II I I I I I I I I I I I I 

I 

Db 2 744 AACCAAAATAAACAAAGCAAGACAAATTA AAAATAT GTGGGGG- 

AGAGAT 2 792 



Qy 5286 

CAGTGACGACGCATTTCCTCCCAGTACAGACCCCCCAGCCCCCCTTGCTGGACATGGGGA 53 45 
I I I I I I I III II I I I I I I I I I I I I I I I 

Db 2 793 CAGTGAAAAGTGGTTT — TCTTAATGCAGCCCTGCTGGTCCCCATT 

AAC 2839 

Qy 5346 

GGCAGAGAGTCACTTGACCATCCAGAAATACATGACTACAAGTCCTTTATGACTGTTTGC 5 4 05 



Db 2 8 40 AATTGCTTGAAATTC ACATGGATGTAA AATTATAA 

TTGT 2878 

Qy 5406 

CATTTTTTTTAATGGTACTTAGTATTTTGATCAAACTTTAGTCTCCAGAACTAAACAAGT 5465 

II I I I I I I I I I I I I I I I I II III 

Db 2 8 79 CA GGATCTTATTCAGATGATCT TTTA 

AGGTTTAA 2912 



Qy 5466 

CCCTAAGTTTCCTTATTTTAATTTACTGTGACTAGATTTGAAGCAAATAAATACTCCAGA 552 5 
II I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2913 — CTGGTTTTGCTTTTGTTTATCTA-TATGTCAAAAT 

ACTTGTAAAT 2956 



Qy 5526 

T C C AT GC AGC TAG AAC AC AC T T GC T T C C AC T AC T AAAT AT AC AGGG T AT G T C C T AAC AT G 55 85 
II I I I I I I I I I II I I I I 

I I 

Db 2957 TG GGAACAAACTTCTCTCAGCTTCT 

TG 2983 



Qy 5586 

GAGTTAACTGGAATAGCAGTACACTAGCAAGTATCTGTGAATCCTTAGCACTGACGGGTT 56 45 
I I I I III III I I I I I I I I I 

Db 29 8 4 AAGTT GTTC AACT 

ATCCTTGCCACTGGAAG 3 013 

Qy 5646 

AACAGAAATGCTTTGGTAATACCTACTTAGTTAATTGGAGGAAGTAGTAAATAAACATTA 5 7 05 



Db 3014 ACCAAACAAGGTTT TCACTGCTTTTTCTTTT- 

ACATAA 3 050 



Qy 5706 GGTAATCTGCAGATTACTTCAAATGGGAAAAATCTTTTTG- 

T AG AC T C TAT AG T AC C C T C 5 76 4 

II III I I I I I I I III I I I I I I III I I I I I I I 

I 

Db 3 051 — TATGCTGAGAATTATTTCTTATG 

C T T T T T AC T AC AAAC AAAAT T AC T C AC 3100 

Qy 5765 TCTATTCACTAGCTTCTGAA — 

AAGGG AGG AG T AT T T T TAG T T T G AC AAT T T AAT AAT T T 5 82 2 

I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 3101 CTGGATTAAAGATTAAGGCCTTAATCTGTTTAGATTATC 

TTT 3142 

Qy 5823 

AAAAACAAGACATCTCCAGGTAGGAAAAAATGAAAGCTATTTCATGCAAACATTATCTAA 5 8 82 
I I I I I I I I I I I I I I I I I 

I I 

Db 3143 A ATCTCC AT G AAA TCGTG 

AA 3162 

Qy 5883 

TTTAGCTTAAAAGTGAAAGTGGTAATACTGTTGGTTTCTGTAAATGTTGCAGGGTTTTAA 59 42 

III I I I I I I I II I I I I I I I 

Db 3163 ATAAG ACAAG- 

AATAGTGTTTCAGCTGTAGG 3192 

Qy 5943 

ACTTTATAATTACTTTAATATTTTTGATAACTAGAAATCTAGTATTGCCATAAAGGAAAC 6 0 02 

I I I I I I I I 

I 

Db 3193 CCATT TTAC 

AGC 3204 

Qy 6003 

TAAGTGCCCATCAAAGATTTGTTTGGTATAAATAAAGAATTATTTGTTTTGTTTTCAATG 6 062 

I I I I I I I I I I I I I I I I I I I I I III II II 

Db 32 05 TAATTGCCCATAAA TTGT AGCATTTATTGACCTG 

AAGT 3242 

Qy 6 063 ACAGTAAGCTACAAATCATGA- 

TGCTTAAAAACTTTCTAAAGATGAATTGTGTGGCAGTG 6121 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 32 43 AC — TAAGCTAATTGTCTTGACTACTCAAAGCC C CTGAATTGT- 

TGTCA 32 8 8 

Qy 6122 

ATTGGTCTGTTTGTGGAGAATGTATGAAAGCTATTAATATTCTAGAATAGATTAATAAAT 6181 

II I I I I I I I I I I I I I I I I II 

I 

Db 32 89 ACTTTCCCCTTTGTG TTGTGT AGCCCTAA 

CGTCATT 3324 



Qy 6182 

TGGCTATGTTGTTCCAATGAATGTACAGCACTTCCATTAACTTTTGAAAGCAACACAGCC 62 41 

I I I I I I I I I I I III I I I I I I I I I I I 



Db 3325 TAGCT-TGTTG-TCTGATG- 

GACACCTCC 336 0 



-CCTCCAGTAG- 



Qy 6242 

TTAAACTCAATGCTTTTGCTTTATGACATGGGAATGTTCTGTCATCAATGGAGTGTATTC 63 01 
I I I I I I I I I I I I I I I I I I 

Db 3361 GATGGAGC 

TTTGATTTCTGAGCAGCGAAAGCTC 3393 

Qy 6302 

TTGTAATAGAATTCTTTATATCGTTCTCAATTCTATAGACTTTCAAGCCTATGTATGAAT 6361 
I I I I I I I I I I I I I I I I I I I I I I 

Db 339 4 CCTTCCTAAGATGCATCTCGC ATAGGCT 

GCCTATG 3 42 8 

Qy 6362 

ATGAAGGGGTTTTTTTTTTTTTGCTTTGTTTTCTTTTTAGATTTTGTACATTCCATCTTT 6 421 
I I I I I I I III I III 

Db 3 42 9 ATGAAGG ACCGTG CACC 

TCCACTCCAAC 3 456 

Qy 6422 ATAGGTCTGT- 

TTCATATGTTTTATGTATAGAACACTAAGTCTTGCACTCTCTGACATTG 6 4 8 0 



Db 3457 AGAGTGCTGAGTTTAAAAGTT GAC-CTGTGT-TTGTAATTTC — 

ACTTTC 3502 

Qy 6 481 ATACTGATATATTCTCGTCATTTG — 

TTCTTTTATGAATCAAAATGTTGACTGCCTATTT 6538 

I I I I I I I II I I I I I I I I I I I I I I I I 

Db 3503 ATCTTGCTTAATAAATATCTGCTGGATTCTTTCAAAAAAAAAAA 

AAAA 3550 

Qy 6539 AAAGAAAAGAATGAACGCTGTGCATCAAAGTGTTTGTATGTTCGTAGCTACA 6590 

I I I I I I I I I III II II I I I I I I I I I I I I 

Db 3551 AAAAAAAAAAAAAAAAG GCCACA TGTGCTCG-AGCTGCA 3588 
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Qy 639 

CTCCAGGTCCAGCTGGATGCTGTCAACAAAAGGCTCCCAGAAATAGATTCTGCTCTTCAG 69 8 
I I I I I I I I I I III II II I III 

Db 3589 CTGCAGCTCGAGC ACATGTGG 

CCTTTTTTTTTTTTTTTTTTTTTT 3545 



Qy 699 TTCATCTCT G AAAT C AT T C AT C AG 

TTAACCAACCAAAAGGCCAGCATCGTGGATGAC 755 



Db 3544 TTTTTTTTT G AAAG AATCC AGC AGAT AT T T AT T AAGC AAG AT G AAAG 

TGAA 3 49 4 

Qy 756 

ATTCATTCCACCTTTGATGAGCTCCAGAAGACTTTAAATGTGCGCAAGAGTGTGCTGCTT 815 



Db 3 493 AT T AC AAAC AC AGGTC AACTTTTAAACTCAGCA 

CTG 3456 

Qy 816 

ATGGAATTGGAGGTCAACTATGGCCTCAAACACAAAGTCCTCCAGTCGCAGCTGGATACT 8 75 
I I I I I I I I I I I II I III I I I I I I I I 

Db 3 455 TTGGAG-TGGAGGTGCACGGT — CCTTCATCATAGG 

CAGCCTATGCGAGA 3 409 

Qy 876 

CTGCTCCAGGGGCAGGAGAGCATTAAGAGCTGCAGCAACTTCACAGCGCAGGCCCTCAAC 935 



Db 3 40 8 -TGCATCTTAGGAAGG-GAGCTTT CGCTGCTCAGAAATCAAAGC- 

TC 3365 



Qy 936 

CATGGCACGGAGACCGAGGTCCTACTGGTGAAGAAGCAGATGAGCGAGAAGCTGAACGAG 995 

III I I I I I I I I I I I I I I! I I I I I I I I I I I I I I I I 

I I 

Db 336 4 CAT CGGAGGT GTCCTACTGGAG — GCATCAGA 

CAACAAGCTAAATGA- 332 0 

Qy 996 

CTGGCCGACCAGGACTTCCCCTTGCACCCGCGGGAGAACGACCAGCTGGATTTCATCGTG 1055 

II I I I I I I I III I I I I I III I I I I 

Db 3319 CGT-TAGGGCTACAC — AAC AC AAAGGGG AAAG T T G AC AAC 

AATTC 3277 



Qy 1056 GAAACCGAGGGGC- 

TGAAGAAGTCCATCCACAACCTCGGGACGATCTTAACCACCAACGC 1114 
I I I I I I I I I I I I I I 

Db 32 76 AGGGGCTTTGAGTAGTCAA 

GACAAT 32 52 



Qy 1115 

CGTTGCCTCAGAGACAGTGGCCACGGGCGAGGGGCTGCGGCAGACCATCATCGGGCAGCC 117 4 

I I I I I I I I I I I I I I 

Db 3251 — TAGCTT AGTACTTCA — 

GGTCAATA 3229 

Qy 1175 

CATGTCCGTCACCATCACCACCAAGGACAAAGACGGTGAGCTGTGCAAAACCGGCAACGC 123 4 
III I II I I I I I I I I I I I I I I II 

I I 

Db 322 8 AATG — CTACAATTT ATGGGCAA TTAGCTGTAAAATG 

GC 3192 

Qy 1235 

CTACCTCACCGCCGAACTGAGCACCCCCGACGGGAGCGTGGCAGACGGGGAGATCCTGGA 12 9 4 

I I I I I I I I I I I I I I I I 

I 

Db 3191 CTAC AGCTGA — AACACT 

ATTCTTGT 3168 

Qy 1295 

CAACAAGAACGGCACCTATGAGTTTTTGTACACTGTCCAGAAGGAAGGGGACTTTACCCT 135 4 

I I I I I I I I I I I I I I I I I 

Db 316 7 C TTATTTCACGATTTC 

ATGGAGATTAAA 3140 

Qy 1355 

GTCTCTGAGACTCTATGACCAGCACATCCGAGGCAGCCCGTTTAAGCTGAAAGTGATCCG 1414 

I I I I I I I I I I I I I I I I I I I I I I 

I I I I 

Db 3139 GATAATCTA AACAGATTAAGGCC— TTAATCT 

TTAATCC- 3103 

Qy 1415 

ATCCGCTGATGTGTCTCCCACCACAGAAGGCGTGAAGAGGCGCGTTAAGTCCCCGGGGAG 14 7 4 

I I I I I I II II 

Db 3102 AGGTGAGTA 

ATTTTGTT 3086 

Qy 14 75 CGGCCACGTCAAGCAGAAAGC- 

TGT G AAAAG AC C C GC AAGC AT G T AC AGC AC TGGAAAAC 1533 

I I I I I I I I I I I I I I I I I I I I I I I II 

I I I I I 

Db 3 0 85 T G TAG T AAAAAGC AT AAGAAAT AAT T C T C - 

AGCATATTATGTAAAAGAAAAA 3 035 

Qy 1534 

GAAAAGAGAATCCCATCGAAGACGATTTGATCTTTCGAGTGGGTACCAAAGGAAGAAATA 1593 
I I I I I I III I I I I I I I I I I I I I I 

Db 3 03 4 G — CAGTGAAAACCTT GTTTGGTCTTCC- 

AGTGG C 3 0 03 

Qy 1594 

AAGGAGAGTTTACAAATCTTCAGGGGGTAGCTGCATCTACAAATGGAAAGATATTAATTG 1653 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 3 0 02 AAG G AT AG T T G A- AC AAC T T C A AGAAGCT 

GAGAGAAGTT 2965 



Qy 1654 

CAGACAGTAACAACCAATGTGTGCAGATATTTTCCAATGATGGCCAGTTCAAAAGTCGTT 1713 



1 1 1 I 1 1 1 1 1 I 1 1 1 1 

1 1 

Db 296 4 TGTTC CCAATTTACAAGT- 

ATT 29 44 

Qy 1714 

TTGGCATACGGGGACGCTCTCCGGGGCAGCTGCAGCGGCCCACAGGAGTGGCTGTACATC 17 73 

I I I I I I I I I II III I I I I I 

I 

Db 2 9 43 TTGACATATAGATA AACAAAAGCAAAAC 

CAGTTAAAC 29 0 7 

Qy 1774 

CCAGTGGGGACATAATCATTGCCGATTATGATAATAAATGGGTCAGCATTTTCTCCTCCG 1833 

I II I I I I I I I I I I I I I II II I I I I I I 

I I I 

Db 29 06 CTT AAAAG AT CAT — C TGAAT AAGATC C T G AC — AAT T AT AAT T T T - 

ACATCC- 2 859 

Qy 1834 

ATGGGAAATTTAAGACAAAAATTGGATCAGGAAAGCTGATGGGACCCAAAGGAGTTTCTG 1893 
I I I I I I I I I I I I I I I I I I I I I I III 

Db 2 858 ATGTGAATTTCAAG CAATT 

GTTAATGGGGACCA 2 826 

Qy 189 4 TGGACCGCAATGGGC- 

ACATTATTGTTGTGGACAACAAGGCGTGCTGCGTGTTTATCTTC 1952 

III I I I I I I I I I I I I I I I 

I I I 

Db 2 825 GCA — GGGCTGCATTA AGAAAACCA 

CTTT 2799 



Qy 1953 

CAGCCAAACGGGAAAATAGTCACCAGGTTTGGTAGCCGAGGAAATGGGGACAGGCAGTTT 2 012 
III I I I I 

Db 2 79 8 TCACTGATC 

TCTCCC 2784 

Qy 2013 

GCAGGTCCCCATTTTGCAGCTGTAAATAGCAATAATGAGATTATTATTACAGATTTCCAT 2 0 72 

I I I I I I I I I I I I I I I I I I I I I I III 

I I 

Db 2783 CCACATATT TTTAATTTGTCTTGCTTTGTTTATTTT 

GGTTATGCAA 2 73 8 

Qy 2073 

AATCATTCTGTCAAGGTGTTTAATCAGGAAGGAGAATTCATGTTGAAGTTTGGCTCAAAT 2132 

I I I I I I I I I I I I I I I I I I I I I I I 

Db 2 73 7 GTCCTTTCTCT TCATGAA ACAAGT 

GTAAGGCTC 2 705 

Qy 2133 

GGAGAAGGAAATGGGCAGTTTAATGCTCCAACAGGTGTAGCAGTGGATTCAAATGGAAAC 2192 
I I I I I I II II III II 

Db 2 70 4 TAAGGCTAAAA 

TAATAGTTATTT 26 82 

Qy 2193 

ATCATTGTGGCCGACTGGGGAAACAGCAGGATCCAGGTTTTTGATGGGAGTGGATCATTT 2 2 52 
I I I I I I I I I I I I I I II I I I I I I 



1 1 1 1 

Db 26 81 TTGTGGGCCCC AAATAGC TAC TTTTGA 

ATTT 2651 

Qy 2253 TTGTCCTACAT 

TAACACATCTGCTGACCCACTCTATGGCCCCCAAGGCCTGGCCCTA 23 09 

I I I I I I I I I I I I I III I I I I I I I I I 

I I I I 

Db 2650 CTTTCTTTAGTATATCTCAAATCTGGGGAAC AT G G AAC T T G AAG AC T 

CCTA 2600 

Qy 2310 ACTTCAGATGGTCATGTTG TGGTTGCAGACTCTGGAAATCACTGTTTCAAAG- 

TCT 2364 

II I I I I I III III I I I I I I I I I I I I I I 

I 

Db 2 599 AC CATGAAGCATTTGGAAAAATAC-ATATCATTCACT- 

T T T C AC AG AAC C 2 552 

Qy 2365 ATCGATACTTACAGTAATGGTGGGCAGGTGGATA-CCCGCTTC- 

CATGGTCTTGCACTAT 2422 

II I I I I I I III I I I I I I I I I I I I I I I I I III I 

I I 

Db 2 551 AT — TTTCT T AAA — AAT AAG AGGC A ATATCCAGATTCACATG — CATGTTC- 

AT 2504 

Qy 2423 

AAACTGGAATGGATTTCTCAATGCGGGACCAGATTATGACTAGAGTTTTTATGCCAGAAG 2 4 82 

II I I I I I I II I III I I I I I I I I I III 

Db 2503 AATAAAGCTTTGTTTT AAAAC AAAT CCA CACCAGC AT TAT T TT — 

CAGCTT 2455 

Qy 2483 

GAATCATTGGTGAACTTTCCAAGGTTATTTCTGAATGTAACAATTTCCTTAAAAATGACT 2542 

I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 2 454 AATATTTTGGC-CCCAACCCAAG AATTCAG-GTATAAC AT T T 

AAT 2412 

Qy 2543 TATCCAATTTCTGTATTTCACCTTTAGG — 

GTTAAAAAAAACTCTTCTACTGAATCTATA 26 00 



Db 2411 TTTC 

ATCTGTAATGCATATATTGGAAGGACAAAATATATATGCAAATAACACTGGA 2356 

Qy 26 01 AAAACTGCAGTTT — TACATCTGTGAAC 

T ATGGC T T AAGGGAC AGGAT T T ATGT AG 2 65 4 

III II I I I I III I I I I I I III I I I I 

I I I 

Db 2355 AAA— 

TGTTGTTTCC TAT AC AAG T G AT C AAGAAAAAAAC AAAGAAAAAGAAAAC CCATAG 2298 
Qy 2655 CTAAACTAATTTTGC — 

AAATCAAACAGACACTTAAAAAAACTAGCATATGTAAAGGTAT 2 712 

I I I I I I I I I II I I I I I II I I I I I II 

I I 

Db 229 7 C ACTAAGTTTGCTGTAGTGAAAATATGGTTAACGGTGCCGTGCAAAATTAG- 



Qy 



2 713 TCGTTAATCCTGTGAATGGTAG — 



C T T T T GC AC AG AAC T T C C AAAAGC AAAAC AAAAAC A 2 770 

III I I II I I I I I I II I I II I I I I I I I I I III 

I I 

Db 22 44 TTGT T G AAAAG TAG T T C T AT T AC AGG G AG CAAAAGCAA 

AGAAGCA 22 0 0 

Qy 2771 

AAATCTATTGTAGTTATATACTTCATTTAACCTAGGTCACAAGACCCAGGGAATCTTCTA 2 83 0 

I I I I I I II I I I I I I I I I I I I I 

I 

Db 2199 AG TTTGTGG GCAAC TCGGAAG 

AGGGGATCATAT- 216 7 

Qy 2 831 ACCTCACTTTTACAGTAGGTATTACTCTTGTGACATTTTTTTGGTTATCA- 

ACAACTAAA 2 8 89 

I I I I I I I III I I I I I III I I I I I I I 

I I 

Db 2166 TCTCATTTCC TTA — ATTGTAA 

TTTAAATAATCATTCTCCATAA 2125 

Qy 2890 

TATAAATTACTTTGGAAAAAGTAAGGCTGTCTTGCAAAATGATCCCAGCTCTGATTAGCA 2 9 49 

II III I I I I I I I I I I I I I I 

Db 2124 TTTCTCTTTC TAGAGCTTTCTT 

CTTTGTAT 2095 

Qy 2950 GCCCTCTGGAGTTCAGAACTTAAGTATCAGTGCAAATTTCTCAACCTTTCTG 

GGTTA 3 0 06 

II II I I I I I I I I I I I I I I I I I I I I II 

I 

Db 2 094 GCTATC ACGTAAGGT — AGTAC-TATTTC 

ACCTTAAAAAATGGAAA 2 052 

Qy 3007 

GACAAAGATCCTTTTTTGTGTGTTCTTTTCACCACCCCTTTGGCTCACCTTGTATCAGCA 3 066 

I I I I I I I I I I I I I I I I I I I I I I II 

I 

Db 2 051 G AC AAAT T T C AGC C T AAG AAT GAT GTATTCTGAG TTGGC 

AATTACTTAA 2 0 03 

Qy 3 06 7 AACAAAGTACTTCTTCAGGGAAACCTGAAATTTCTAATGCCTTGAAAAGCATATTAC- 

AA 3125 

I I I I I I I I I I I I I I I I I II 

I 

Db 2 0 02 AACAATG AGAGAG 

TAAAACACATCAAACTTA 19 72 

Qy 3126 AAGTAATGCTACCTTTTGGGAAACAAACTGCC 

CCGTTAACTCCAGATCATTGCACTG 3182 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 1971 AAGTGTT — 

T AC x T T T T G AAC AT T AAAAT AAAT G T C AGGT AAAT AC AGAAAAAT 192 0 

Qy 3183 GAATGTAATCAAGAAAGTTAGTCATGTTTTATGTACCATGTTTTCACAC 

GTGTCT 3237 

I I I I I I I I I I I I I I I I I I I I I I I II 

I I I 

Db 1919 — ATATAATCA CTAGTTCTCAG 

CATTTTATCAATCCCTTTGTTTCT 18 76 



Qy 3238 CTTCTCTTCGAC-TTCCTGAAAGCGAAAGCTTTACCTCC- 

TGCAAATGTCAGCACATGTA 3295 

I I I I I III III III I II I I I I I I I I I I I I I I 

I I 

Db 18 75 C T T C T T C T AG AAG AT T C T AT G AGC AT T A — TGTTGCTCCGATCAAAAGTAACCTCAC- 

TA 1819 

Qy 3296 GTAGGACACCAGTATCCTAG- 

GACAGAGAGCCATAAGTAGCCCTTTGGAGGACTGATGGT 335 4 

I I I I I I I I I I I I I I I I I I III 

I 

Db 1818 CTCAAATACC ATCATAGCTTCAATGTGCATTA 

TAG T GAT 178 0 

Qy 3355 

GTCAACCAAAGGCATGTGATTGATTAATGATTCCCCCTTAGAAAGCAAGTGTTACCAAAG 3 414 

III II I I I I I I I I I I I I I I I I 

Db 17 79 TTCA GT AGATTCACACT 

CAAATCTTTTCA 1751 

Qy 3415 

TTGTGTTATCTTGAAAGCATTACAGGTAAGGGCATGTTATGGTTATTTATCATTGTTTAA 3 4 7 4 

I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 1750 — GTGTCAT ACATTT ATTAAGCCA TAAAGTTATGAAACC 

CTCA 1710 

Qy 3475 

TGAATAGTAGAGGTGTCAAGGGACTATGTATACATGATTAGGGTAAGATAGAATGTATTA 353 4 

I I I I I I II III I I I I I I I I I I I I I I I I I 

I 

Db 1709 GCTCTTGTG GAG AG AT C T GGGC AG AAT T TAT AC AAG A AGAA- 

GTTTAA 1663 

Qy 3535 

TATATATATATATATATACACACACACATATATATAGCTGAATCTTTGGTGTATTGAAAT 359 4 
I I I I I I I I I I I I I I I I 

Db 1662 TTTAT C AC T T AAAAAT 

CATCT 1642 

Qy 3595 

AGGCAGCACTCTGAAAGACAGAAGCTTCGTCCAGCCACTCTTCAGCACATTCCTTTACTA 365 4 

I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 16 41 CATAATTGTGGTA-ACAGA TTTTTAAAGTAAC 

CCTTTGGAA 1602 

Qy 3655 

AGCAGTTTAAAGCCGTCCTAGTGGAGCAAGCCCTAAAGCAGATTTAATTTTTGCCATTTT 3 714 

I I I I I I I I I I I I I I I I I I I I 

I I 

Db 16 01 AAAATTTTA CGTGATA CCCAAAGGCA 

CTAGTT 1570 

Qy 3715 

CCAAGAATGACGGTGGTGGCTTTTAGTCAGAAAATGGCCTTCTGTGCTTTCAAAAAAAAA 3 7 7 4 

I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 1569 CACATAAGG— GGT-GTGGCT GAAAAAG 

CAAAACAAAG 1535 



Qy 3775 

ACAAAAAAAAAACCACACACACACATAAAAAACCCAACAGGTCAAAATAAAAGTTGAACT 3 83 4 

II II I I I I I I I I I I I III I I I I I I I I 

I I 

Db 153 4 CCATTGTG T AAG T AAAGAGC AC AC AAAAAA AGG— 

ATATTAAAAGGAAACCT 1485 

Qy 3 835 TGAGTTA — 

CAT T T AAT T T AAAT AT AAATGC AT T T TGAGAAATGTTAAGAACAAT T T AGT 3 892 

I I I I I I I I I I I I II I I I I I I II 

Db 148 4 GTTAAGCTTTAAAGTTA TTCTAAAAATG 

GCACTTT — T 1449 

Qy 3893 CAATCGTTCATCTGTCAT — TGGTACTGTAAAATAAGCTGT 

GGTCTATTTCCACTGT 3947 

II I I I I II II I I I I I I I I I I II I I I I I I 

I 

Db 1448 CA— 

CGTTGGTATAAAATGAAAGTGCTTTAAGATGAAATTTTTATGTATTTTTTTAAAGC 1391 
Qy 3948 

TTAATTTTCTACTCAGTTCTACCAAATAGGATGTCATGTTTGACATTTTTGATAGTGACT 4 0 0 7 

I I I I I I II II I I I I I I I II I I I I I I 

Db 139 0 TGATTTTTAACCCC CTG-AAAATAGGGGGT TATAACA 

AG 1353 

Qy 4008 

TTGGGGTCTTCTTCACTGAAAGCACCTTAGAACTGTACTATAAGAAAACATTTCCCCTAT 4 06 7 

I I I I I I I I II I I I I I I I I 

I I 

Db 1352 AAC AC T GAT AG AC T C T AACT 

CATAGCGGCAAT 1321 

Qy 4068 

GTATAATTATATGAATGTGATGTTTATTGCTTATTAATTTATAATTCAGTCATTCTCTAT 412 7 

I I I I I I I I I I II 

I 

Db 132 0 AAGTAGTGGATGTCACTAA 

AATGCTCTGT 1292 



Qy 4128 

ATAGGACTTCTTAAAATTTAGAAGGGAAATCTAGCTACTTCAAATTGTCTGTTAAATTTA 418 7 

I I I I I I I I I I I I I II I I I I I I I I 

I I 

Db 1291 GTGTGTCT — TTATTTTTTAGTA TTTTTAATGT- 

T GAT AG AC T T G 1250 

Qy 4188 TTATGCCCAAATCAACCTCTGAAAAAAGGTTTTTCC — 

AGGAAGAT T T AC AT T T AGGT T T 42 45 

II I I I I I I I I I I I II I I I I I I I I I I 

Db 1249 CTTT ATCTATCT 

GTGTATCATTAGTGAGATTTGCTTTTC 1211 

Qy 42 46 AATATTTTTTTAGTTAGGTAGAGTT — 

TTAAAAAATACTTGAGCCTGTCCGTGATAAAGC 4303 

III III III I I I I I I I II I I I I III I II 

I I I I I 

Db 1210 AATTCTTTACTAGAAAGTTACAGCTCATTTAAAATAGCTGGTAAAT-ACAG 



AAAGC 1156 



Qy 43 0 4 T AT AAAAT T C AAT AAC T T T T T AG AAT G T T AAAT G AAG AC AC T G T T — 

TCCTAACATCAGT 4361 

I I I I I I I I I I I I I I I I I I I I III 

Db 1155 AT AAAT AT AC AG T AAGT T T 

AAAC AC T GAT T G T AC T AA 1119 

Qy 4362 

GAGATACATCTTTGAATTTAAACATTCATATTTACTGAGTACCTACTAGGTACCA A 4 417 



Db 1118 ATGCAAC AAT AC AAAGAAGC AAAT - 

GGAACACAAATGGTAACACAATA 10 72 



Qy 4418 GT AC TC T T T T AGGCACTGGAAATACAGTGATGGAC AAAAC A- 

GGTAAAAAATCGCTGCCC 4476 

I I I I I I I I I I I III II I I I I I I I 
Db 10 71 AAACTGTATTA — CATTTGTGCTTCAAATAT- 
T AC AAT AC AT TAT AT AC AAT AG 1021 



Qy 4477 

CCTCAGAGCTGACATTCTGGGGTGGGAATTTCATTTTGCCACGTACTAACGTTCTGCACA 4536 
I I I I I II III I I I I I I III 

I 

Db 102 0 —TCC AAAAT AAAC AT C TCA TGCCAAG TGAC — 

ACAAA 987 

Qy 4537 

AAAGACAGGCTAGACTCTTGTCTAGATTGTTTAAAAGAAACTTTTCAAATTGGTTACATT 4596 
I I I I I I I I I I I I I I I I I I I 

I I I I I I 

Db 9 86 AACGAATAGCAAGC AAAAAAATCCAGCCATAT — 

GTACATT 9 48 



Qy 4597 

AATTTTAGTTTATTTTCACAAGTAAAAATGGCTTTTTATTTAGATTCTTTCTGTCCCAGG 4656 
III I I I I I I I I I I I I I I I I 

Db 947 ACTT TTTTC 

TTTAATAATAAACATTC 922 



Qy 4657 

CTGTTGATCTTAAAACTAGTTGATTTAAAGAGTTTTTTTGCACAACATTTCAATTATATT 4 716 
III I I I I I I III I I I I I I I I 

Db 921 AACTCTGAACTGG GGCAATTT CACTACA- 

TACAG 8 89 



Qy 4717 

TGTGAACTTAGAAATTAACTTACAATCTAACCAGCCATCATATCATATCCTATCAGGCTA 4 7 76 

I I I I I III I I I I I I I I I I I I II I I I I I I 

I I I 

Db 888 -ATGCAGTCCGCCTTTTA — TTCCATATAACCATCCTGCCT TCCTATAT- 



Qy 4777 - 

GATATCTCAATAGTAGACTGAATACAAAGCTAATTTTTTTTACATGTCAATATTGGCAC 4 835 
I I I I I I I I I I III I I I I I I I I I I I I I 

I 

Db 839 CGCTAT-TCAGTAGGTTCCTGTGT CAATTCTTATGTACATG 

TGGAGC 79 4 



Qy 4836 

AAACTGGAATGAAAGAATAGTTTGATTCAGACCTGCTCCACTATGTGTTGCTAAAACACA 4 895 

III I I I I I I I I I I I I I I I I I I I I I I I I 

Db 793 AAAAAAG - AT AAAAG AAT GTAGTGCTC TGTATT- 

CTTA 758 

Qy 4896 

TGCTATGAGCACTCGAGGAAACACTATATTTTTTCCAAAAAATATGTGATTATATATGTT 4955 
I I I I III 

I I I I I 

Db 757 ATGGG GTG 

ATGTT 745 

Qy 4956 

AAAGTATAGATAACATTTCACACTTGGATACATATGTGCATTTACTGTATTTCTTGGTAA 5 015 

I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 744 G ATAGATCA GCGAACCTGGG GTGCATAACCT-TCTTTCAT — 

TAA 703 

Qy 5016 

GCATATTTTTGGGGGAAAGTGCTGCTGATATGATACAAGTAGACAAAATTTAAATGAAAT 5 0 75 

I I I I I I I I I I I I I I I I I I I III 

I I I 

Db 702 AC— CTTCCTCGTAGTCCGT-CTG — GCAAAGGATCATGT 

TATTCTTTAGGAAAAA 652 

Qy 50 76 TTTGTCACATTCTATGGAAAATGGTTTCTGGT 

AAAC T G AG AAGG AT AT T AAAAT AA 5131 

I I I I I I I I II I I I I I I I I I I I I I I I I I 

I 

Db 651 TTTGTCTCCAAC 

ACAAAATCTCTGATTACAAAGCTGACATGCAAAGCAGTCCAG 59 8 

Qy 5132 GTGG 

CTTTTTTCTGGGCTACCATTATTGTTTGATTTCTCTTTGTCAAGTGTATAGA 518 7 

I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 59 7 GTGGTAAACATTGTCCTTGGCACGCATCA CCATCTCAAAGGCAGGGAT 

GA 548 

Qy 5188 

ACCTGTCATACATTCAT G AT AAG T AGC AC TGAAAAAT T AC TC AT TC AAAT TTCCCCTG— 5245 

I I I I I I III II I II I I I I I I I I I I I I I 

Db 547 GCT — TACTACAGGCA-GCGCAGTTTCCC GTTACAC 

CAAAGAGCCTCAGAT 50 0 

Qy 52 46 -GGCACGTAAGGCAAAATATTGCCGGTTGGGATTTCAAGGTCAGTGACGA- 

CGCATTTCC 5303 

I I I I I I I I I II II I I I I I I I I I I I I I I 

I I I 

Db 499 AGTCTC-TGCGACAAAGGAT 

AAGATTAGCTTTAGTGTACAGGGTGGAGCCCACCTCC 444 

Qy 53 0 4 TCCCAGT- 

ACAGACCCCCCAGCCCCCCTTGCTGGACATGGGGAGGCAGAGAGTCACTTGA 5362 

I I I I I I I I I I I I I I I I I I I I I I I I I III 

Db 4 43 CCCAAGCGACAGTCACAGCAGGCACACTT C AGGCAG 

TCTT— 404 



Qy 5363 

CCATCCAGAAATACATGACTACAAGTCCTTTATGACTGTTTGCCATTTTTTTTAATGGTA 5 42 2 
I I I I I I I III II I I I I I I I III 

I I I 

Db 4 03 CATGCCAGTATT TGTC — CAGTGCCTTTAGAA GATACC 

GGTC 364 

Qy 5 423 C T TAG T AT T T T GAT C AAAC T T T AGTC TC C AG AAC T AAAC AAG T C - 

CCTAAGTTTCCTTAT 5481 

I I I I I I I I II I I I I I I III II II I I I I 

Db 363 C TTGATCTTTCGGTTGCAGCCAGCAC — 

AACCTTTCGGCTTGGTGTC 319 

Qy 5482 

TTTAATTTACTGTGACTAGATTTGAAGCAAATAAATACTCCAGATCCATGCAGCTAGAAC 55 41 
I I I I I I I I I I I I III I I I I I I 

Db 318 TGGCTGGACTGAGAGCATTTGTATA — 

CCAAAACTTT 284 

Qy 5542 

ACACTTGCTTCCACTACTAAATATACAGGGTATGTCCTAACATGGAGTTAACTGGAATAG 56 01 

I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 2 83 TCTCCTTTTTCTGCATGAGTTGACTTA GGCG-TGTCT- 

GAGTTG 2 42 

Qy 5602 

CAGTACACTAGCAAGTATCTGTGAATCCTTAGCACTGACGGGTTAACAGAAATGCTTTGG 5661 

I I I I I I I I III I I I I I I I I I I I I 

I I 

Db 2 41 CAGCAGCTTCGCAT TGAGCACCAAACCCAAAGGTAGAAGTAGAAGGG 

GG 193 

Qy 5662 

TAATACCTACTTAGTTAATTGGAGGAAGTAGTAAATAAACATTAGGTAATCTGCAGATTA 5721 

I I I I I I I III I I I I I III III I I I I 

I I 

Db 192 T CTCCTTGAT TTCGCTTAAGT GTGGACCT — GGTGC 

GCAGCCTA 151 

Qy 5722 

CTTCAAATGGGAAAAATCTTTTTGTAGACTCTATAGTACCCTCTCTATTCACTAGCTTCT 5 7 81 

I I III I I I I I I I I I I I I I I II I I I I 

I 

Db 150 CACCGCCGAGGACCGACTATTGTGAAGCCACTTTGGGAGCGGGTCGG 

AGTGGCGGCA 9 4 

Qy 5782 

G AAAAGGG AGG AG T AT T T T TAG T T T G AC AAT T T AAT AAT T T AAAAAC AAG AC AT C T C C AG 5 8 41 
I I I I I I I I II I I I I I I I I 

Db 93 GGGGGTGGGGGAAGGGATGAGGACGGCC 

AG AC AAG AC 5 7 

Qy 5842 

GTAGGAAAAAATGAAAGCTATTTCATGCAAACATTATCTAATTTAGCTTAAAAGTGAAAG 59 01 

III II I I I I I I I I I I I I I I I I I 

I 

Db 56 —AGG GCGCACACACGGAGCCC — CTCGCAGTGTGC 

AAAATGATGG 15 



Qy 5902 TGGTAATACTGTTGG 5916 

I I I I I I I I I 
Db 14 CG — AATCCAGTAGG 2 



RESULT 3 
US-10-478-914-4 

; Sequence 4, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 

; TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
; FILE REFERENCE: 7388-80899 

; CURRENT APPLICATION NUMBER: US/ 1 0 / 478 , 9 1 4 

; CURRENT FILING DATE: 2003-11-26 

; PRIOR APPLICATION NUMBER: PCT/ JP02 / 052 9 4 

; PRIOR FILING DATE: 2001-05-30 

; PRIOR APPLICATION NUMBER: JP 2001-162775 

; PRIOR FILING DATE: 2001-05-30 

; PRIOR APPLICATION NUMBER: JP 2001-255226 

; PRIOR FILING DATE: 2001-08-24 

; NUMBER OF SEQ ID NOS : 417 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 4 

LENGTH: 3210 

TYPE: DNA 
; ORGANISM: Homo sapiens 
US-10-478-914-4 

Query Match 20.8%; Score 7966.5; DB 1; Length 3210; 

Best Local Similarity 38.3%; Pred. No. 0; 

Matches 2028; Conservative 0; Mismatches 1136; Indels 2131; 
Gaps 280; 

Qy 1 CCCAGTTGTCTGCGGGCTGCGGGGAGCTAAGTCCCCAGATTGGAGGAGGCTG — 

GCTCTG 58 

I I I I I I I I I I I I I I I I I I I I I I 

Db 2 CACTGTTGGC CTACTGG 

CCCAGGCCTTGCACACGCAGGTGCTC — 4 4 

Qy 59 

GTCTTCGATGCACAGGAGTGGCCGTTATGGAACGCAGCAGCAGCGTGCAGGGTCAAAGAC 118 

I I I I I I I I I I I I I I I I I I I I I I I III 

Db 45 -TCTTAG-TGCTC CCTGGCAAGGATGGGAAG 

GCATGTTCAGTCCTAGG- 9 0 

Qy 119 

AGCCGGCCCCCCATGTCAGTGGTCTAGGATGGCCAGTGAAGGCACCAACATCCCAAGTCC 17 8 

II II I I I I I I I I I I I I I I I I I I I 

Db 91 AGTAGG AAGGGGCAGAGGTGT 

TGATGGCCCCTACA 125 

Qy 179 TGTGGTGC-GCCAGATTGACAAGCAGTTTCTGAT — 

TTGCAGTATATGCCTGGAACGGTA 235 



Db 126 GAGCGGCCAAG — GACAAGGAGCTGCTGTTCGAAACAGCCTT — 

CCTGCTCCCCAA 177 



Qy 236 CAAGAATCCCAAGGTTCTCCCCTG — TCTGCACACTTTCTGC 

GAGAGGTGCCTGCA 2 89 

I I I I I I I II II I I I I I I I I I I I I I I I I 

I 

Db 178 CCTGCCTCCCA 

C C C AAC AGG T TTTGCATATACTC T AC TGGGAAGAGGGAC AC ACC 232 

Qy 290 

GAACTACATTCCTGCCCACAGTTTAACCCTCTCCTGCCCAGTGTGCCGCCAGACCTCCA- 3 4 8 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I 

Db 233 CGACTGCATCACTGCCCTCCAAGT — CTCTC-CCTGCCCTGT CCAG- 

CATCCAG 2 82 

Qy 349 - 

TCCTGCCCGAGAAAGGGGTGGCCGCGCTCCAGAACAATTTCTTCATCACAAACCTGATG 4 0 7 

I I I I I I I I I I I I I I I I II II 

Db 2 83 GAGCACCCCTAG-TTGGGGAAGC TTCTGTGACTC 

CC 317 

Qy 40 8 GACGTGCTGCAGCGAACTCCAGGCAGCAACGCTGAGGAGTCTTCCATC 

CTGGAGA 462 

I I I I I I I I I I I I I I I I I I I II II 

I I 

Db 318 — CCTACAACAGC CTAGGATGGA 

GTGGGGTTTGTGAACAAATGCAGAAGG 365 

Qy 463 

CAGTCACTGCTGTGGCTGCGGGAAAGCCTCTCTCTTGCCCAAACCACGATGGGAATGT-G 521 

I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 366 CAGT CTTAGGGAGGTCAGCTGACATGCCCCT 

GGCCTGTGG 405 

Qy 522 

ATGGAATTTTACTGCCAGTCCTGTGAGACTGCCATGTGTCGGGAGTGCACGGAGGGGGAG 581 

I I I I I II II II I I I I I I III III I I I I I 

Db 406 CTGGGAAGTAGCAG — AGG- 
CTAAGGTTCTTCCCCGCTCTGGGGTTGCCAGGAGTAG 459 

Qy 582 

CACGCAGAGCACCCCACAGTTCCACTCAAGGATGTGGTGGAACAGCACAAGGCCTCGCTC 6 41 

I I I I I I I I I I I I I I III I I I I 

I I I 

Db 46 0 CACTGGATCAGTCA GGTGACAGGGCTCT 

CCTCTCTC 495 

Qy 642 

CAGGTCCAGCTGGATGCTGTCAACAAAAGGCTCCCAGAAATAGATTCTGCTCTTCA 69 7 
I I I I I I I I I I I II II I I I I I I I 

I 

Db 496 TGAGCAGGTCCGG-TGGCAGCCTTC AAGGTCGCCACG 

CC 533 

Qy 698 

GTTCATCTCTGAAATCATTCATCAGTTAACCAACCAAAAGGCCAGCATCGTGGATGACAT 75 7 

II I I I I I I I I I I II I I I I II I I I I 

Db 53 4 G-TATTCCCTG TATGTCTGT CCCGAGGGGCAG- 

AACGT 569 



Qy 758 

TCATTCCACCTTTGATGAGCTCCAGAAGACTTTAAATGTGCGCAAGAGTGTGCTGCTTAT 817 

I I I I I I I I I I I I I I I I I I 

I 

Db 570 CACCCT CACCTGCAGGCTCT TGGGC- 

CCTGT 599 

Qy 818 

GGAATTGGAGGTCAACTATGGCCTCAAACACAAAGTCCTCCAGTCGCAGCTGGATACTCT 8 7 7 

III I I I I I I I I I I I I I I I I I III 

I 

Db 600 GGACAAAGGGCACGA-TGTGACCT TCTACAAGACG 

TGGTACCGCA 6 43 

Qy 878 

GCTCCAGGGGCAGGAGAGCATTAAGAGCTGCAGCAACTTCACAGCGCAGGCCCTCAACCA 93 7 

I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I 

I I 

Db 6 44 GCTCGAGGGGC-GAGGTGC AGACCTGC TCAGAGCGCCGGCC — 

CATCC- 688 

Qy 938 

TGGCACGGAGACCGAGGTCCTACTGGT G AAG AAGC AG AT GAG C G AG AAG C T G AAC G AG C T 99 7 

I I I I I I I I I 

Db 6 89 — GCA ACC TC 

ACGT 70 0 

Qy 998 

GGCCGACCAGGACTTCCCCTTGCACCCGCGGGAGAACGACCAGCTGGATTTCATCGTGGA 1057 
I I I I I I I I I I I I I I I 

I I I I 

Db 701 TCCAGGACCTTCACCTGCAC 

CATGGA 726 

Qy 1058 

AACCGAGGGGCTGAAGAAGTCCATCCACAACCTCGGGACGATCTTAACCACCAACGCCGT 1117 

II I I I I I I I I I I I I I I I III 

Db 72 7 GGCCACCAGGCTG CCAACACCAGCCACG 

ACC 757 

Qy 1118 

TGCCTCAGAGACAGTGGCCACGGGCGAGGGGCTGCGGCAGACCATCATCGGGCAGCCCAT 117 7 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 75 8 TGGCTCAGC GCCACGGGCTGGAGTCGGCCTCCGACCACCAT — 

GGCAACT 8 05 

Qy 1178 

GTCCGTCACCATCACCACCAAGGACAAAGACGGTGAGCTGTGCAAAACCGGCAACGCCTA 123 7 
I I I I I I I I I I I III I I I I I 

I 

Db 8 06 TCTCCATCACCA 

TGCGCAACCTG A 829 

Qy 1238 

CCTCACCGCCGAACTGAGCACCCCCGACGGGAGCGTGGCAGACGGGGAGATCCTGGACAA 12 9 7 

II I I I I I III I I I I I I I I I I I I I I I I I I I 

Db 83 0 CC CTGCTGGAT — AGCGGCCTCTACTGCTGCCTGGTGGT 

GGAGATC 8 73 



Qy 129 8 C AAG AAC GGC AC C T AT GAG T T T T T G T AC AC T G T C C AG AAGG AAG GGGACTT — 



TACCCTG 1355 

I I I I I I I I I I III I I I I I I I I I 



CGGAGCACAGGGTCCATGGTGCCATG 912 
Qy 1356 

TCTCTGAGACTCTATGACCAGCACATCCGAGGCAGCCCGTTTAAGCTGAAAGTGATCCGA 1415 

III I I I I I I I I I I I I I I II 

Db 913 GAGCTG CAGGTGCAGACAGGCAAAGATGCA 

CCAT 9 46 

Qy 1416 TCCGC— 

TGATGTGTCTCCCACCACAGAAGGCGTGAAGAGGCGCGTTAAGTCCCCGGGGA 14 73 

I I I I I I I I I I I I I I I I I I I 

I I 

Db 9 47 CCAACTGTGTGGTGTACCCATCCTCC 

TCCCAGGAGA 9 82 

Qy 1474 

GCGGCCACGTCAAGCAGAAAGCTGTGAAAAGACCCGCAAGCATGTACAGCACTGGAAAAC 1533 

I I I I I I I I I I I I I I I I I 

Db 9 83 G TGAAAGT AAGGGACCAAC 

CTCTTG 1007 

Qy 1534 

GAAAAGAGAATCCCATCGAAGACGATTTGATCTTTCGAGTGGGTACCAAAGGAAGAAATA 1593 
III I I I I I I I I I I 

Db 1008 CCC CTTT 

TGGGTTCTC 1023 

Qy 1594 

AAGGAGAGTTTACAAATCTTCAGGGGGTAGCTGCATCTACAAATGGAAAGATATTAATTG 1653 
I I I I I I I I I I I I I I I 

I I 

Db 1024 TGTTT TCTTCTG TCCTCATC 

CTG 1046 

Qy 1654 

CAGACAGTAACAACCAATGTGTGCAGATATTTTCCAATGATGGCCAGTTCAAAAGTCGTT 1713 

I I I I I I I I I I I I I I I I I I I I II 

I 

Db 10 47 CACCCAG ACC-CTGTTTGGAACT CTGGCC 

TCAT- 10 78 

Qy 1714 

TTGGCATACGGGGACGCTCTCCGGGGCAGCTGCAGCGGCCCACAGGAGTGGCTGTACATC 17 73 

I I I I I I I I I I I I 

Db 10 79 CACCCC AAGCCCTCAG 

AAC 1097 



Qy 1774 

CCAGTGGGGACATAATCATTGCCGATTATGATAATAAATGGGTCAGCATTTTCTCCTCCG 1833 
II II I I I I 

I I I I I I I 

Db 1098 CCC CC GGTC 

CTCCTCC- 1113 



Qy 1834 



ATGGGAAATTTAAGACAAAAATTGGATCAGGAAAGCTGATGGGACCCAAAGGAGTTTCTG 1893 



Db 1114 
TTTTCTG 1120 



Qy 1894 

TGGACCGCAATGGGCACATTATTGTTGTGGACAACAAGGCGTGCTGCGTGTTTATCTTCC 1953 
III I I I I I I I I I I I I 

I I I I 

Db 1121 CTGCT GCACAT C CCTTCTG 

CTTC- 1143 



Qy 1954 

AGCCAAACGGGAAAATAGTCACCAGGTTTGGTAGCCGAGGAAATGGGGACAGGCAGTTTG 2 013 
III I I I I 

I I 

Db 1144 CTCC TTGG 



Qy 2014 

CAGGTCCCCATTTTGCAGCTGTAAATAGCAATAATGAGATTATTATTACAGATTTCCATA 2 0 73 



Db 1154 CA-ATCCCCA GAAGC- 

CCACT 1172 



Qy 2074 

ATCATTCTGTCAAGGTGTTTAATCAGGAAGGAGAATTCATGTTGAAGTTTGGCTCAAATG 2133 

I I I I I I I I I I I I I II I I I I I 

I 

Db 1173 CTCCTTCCATC TGCTCTGGAG TC 

TCTGCTCCTCTT 1207 



Qy 2134 

GAGAAGGAAATGGGCAGTTTAATGCTCCAACAGGTGTAGCAGTGGATTCAAATGGAAACA 2193 
II III I I I I I I I I I I I 

Db 1208 GA CTC — TCTGGAGTGGCTGTGCCT 

TGGCAGTG 123 8 



Qy 219 4 TCATTGTGGCCGACTGGGGAAACAGCAGGAT- 

CCAGGTTTTTGATGGGAGTGGATCATTT 2 2 52 

I I I I I I I I I I I II II II I I I I 

Db 1239 ACCTT-TGGCC AGGGCAAGTGCCTCATGACAGG 

TACTGGG 12 77 



Qy 2 2 53 TTGTCCTACATTAACACATCTGCTGACCCACTCTATGGCCCCC- 

AAGGCCTGGCCCTAAC 2311 

I I I I II I I I I I I I I II I I I I I I I I I I I I I 

Db 12 78 -TGCCC— CAGGCAGCTAAGTGCCGCCCTGCCCACCAGCCCCCTATGGCTTGG- 

GAA 1330 

Qy 2312 

TTCAGATGGTCATGTTGTGGTTGCAGACTCTGGAAATCACTGTTTCAAAGTCTATCGATA 23 71 

II I I I I I I I I II II I I I I I I I I I III 

I 

Db 1331 GGCTGGGGGTCCTCTTG GCCAACAGGGTGAAACCCCATCTCTA 

CTAAAAACA 13 82 



Qy 23 72 C T T AC AGTAATGGTGGGCA- 



GGTGGATACCCGCTTCCATGGTCTTGCACTATAAACTGGA 2 43 0 

I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 13 83 CAAAAATTA — GCCAGGCATGGTGG-TGCGCGC — CTGTAGTC- 

CCAGCTACTCAA 1432 



Qy 2431 

ATGGATTTCTCAATGCGGGACCAGATTATGACTAGAGTTTTTATGCCAGAAGGAATCATT 2 49 0 

III I I I I I I II I I I I III I I I I I I 

Db 1433 GAGGCT GAGGCAAGAGAATCGCTTGA ACCCG— 

GGAGTCAGA 14 72 



Qy 2491 

GGTGAACTTTCCAAGGTTATTTCTGAATGTAACAATTTCCTTAAAAATGACTTATCCAAT 2 55 0 
II I I I I I I I I I I I 

Db 14 73 GG TTGCAG — TGAGCCGAGATC GC 

GCCA— 149 8 



Qy 2551 

TTCTGTATTTCACCTTTAGGGTTAAAAAAAACTCTTCTACTGAATCTATAAAAACTGC— 2 6 0 8 
I I I I I I I I I II I I I I I I I I I I I I I I 

Db 1499 — CTGCACTCCAGCCTGCTGATAGAGCAAGACTC— CATC 

T C AAAAAAAAC AAC C C 155 0 



Qy 2609 - 

AG T T T T AC AT C T G T G AAC T ATGGC T T AAGGGAC AGGAT T T ATGT AGC T AAAC T AAT T T T 2 66 7 
I I I I I I I I I I I I I I I I III II 

Db 1551 AAAATTTGC CTGGCAT-GGTGGCAGGC-ATCTG 

TAATCCCA 1589 



Qy 266 8 GCAAATC 

AAAC AGAC AC T T AAAAAAAC T AGCATATGTAAAGGTATTCGTTAATCC TG 2 72 4 
I I I I I I I I I I I I I I I I I I 

I I I I 

Db 159 0 GCTACTCGGGAGGCTGAGACAT — GAGAGCT-GC 

TTGAACCTG 1629 



Qy 2725 

T G AAT GG T AGC T T T T GC AC AG AAC T T C C AAAAGC AAAAC AAAAAC AAAAT C T AT T G TAG T 2 7 8 4 
II I I I I I I I I I II I I I I I I I I I 

Db 163 0 GGA — GGCAGAAGTTGCAGTGAG 

C C GAG AT C AC AC C A 1664 



Qy 2785 

TATATACTTCATTTAACCTAGGTCACAAGACCCAGGGAATCTTCTAACCTCACTTTTACA 2 8 4 4 
I I I I I I I I I I I I III I I I I I I I I II 

Db 1665 -CTGCACTCC AGCCTGGGT GACAGAGCGAGAC- 

TCTGTCC 1702 

Qy 2845 

GTAGGTATTACTCTTGTGACATTTTTTTGGTTATCAACAACTAAATATAAATTACTTTGG 2 9 0 4 

III II I II I I I I I I I I I I 

I 

Db 17 03 CAAAAAATCAA- 

AAAAATCACTTTTG 172 7 

Qy 2905 

AAAAAGTAAGGCTGTCTTGCAAAATGATCCCAGCTCTGATTAGCAGCCCTCTGGAGTTCA 2 96 4 

III I I I I I I I I I I I I I I I I I I I 



Db 172 8 G TAG AG ATGCA CTCTC-GCTATGTT GCC— CAGG 

CT 1760 

Qy 2965 

GAACTTAAGTATCAGTGCAAATTTCTCAACCTTTCTGGGTTAGACAAAGATCCTTTTTTG 3 02 4 

I I I I I III I I I I I I III 

Db 1761 GGTCTTGA ACTC CTGGGCT 

CAAT 1783 

Qy 3025 

TGTGTTCTTTTCACCACCCCTTTGGCTCACCTTGTATCAGCAAACAAAGTACTTCTTCAG 3 0 8 4 

I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 1784 — TGATCTT CCCACCTTG 

ACCTCCAAAGTGCT G 1814 

Qy 3085 

GGAAACCTGAAATTTCTAATGCCTTGAAAAGCATATTACAAAAGTAATGCTACCTTTTGG 314 4 

II I I I I I I I I I I I I I I 

Db 1815 GG ATTACAGGTGTGA- 

GCCACC 1835 

Qy 3145 

GAAACAAACTGCCCCGTTAACTCCAGATCATTGCACTGGAATGTAATCAAGAAAGTTAGT 32 0 4 

I I I I III III III II I I I I I I III 

I I I 

Db 1836 ATGCC TAGCCTCAGGGAATT C T T AT AAG AAC T C T ATG AAG - 

TAGG 18 79 

Qy 32 05 CATGTTTTATGTACCATGTTTTCACACGTGTCTCTTCTCTTCGACTTCCT- 

GAAAGCGAA 3263 

III I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 1880 CAT CACCA TCTTCTCT — 

G T AT C C AT GG AAAG AG AG 1913 

Qy 326 4 AGCTTTACCTCCTGCAAATGTCAGC — 

AC AT G TAG TAGG AC AC C AG T AT C C T AGG AC AGA 3321 

III I I I I I II I I I I I I I I I 



Db 1914 G CCTAGAGATGTATGCTAACTTG 

CCCAAGCTCACA 19 48 

Qy 3322 

GAGCCATAAGTAGCCCTTTGGAGGACTGATGGTGTCAACCAAAGGCATGTGATTGATTAA 33 81 

I I I I I I I I I I III I I I I II I I I I II I 

I I 

Db 19 49 TAGCCCAGGGTAGC ATAGCTGG- 

GATGTTGAGCTGAGGCCGGAG GAG AAG TAG 2 0 00 

Qy 33 82 TGATTCCCCCTTAGAAAGCAAGTGTTAC- 

C AAAG TTGTGTTATCTT G AAAGC AT T AC AGG 3440 

I I I I III II I I I I I I I I I I I I I I I 

I I I 

Db 2001 C-AGTCGCTGGCAGAGCACACAGGCTGCTCTGGG — GGATGAGCTGG — 

TGCGTTTAAGG 2055 

Qy 3441 

TAAGGGCATGTTATGGTTATTTATCATTGTTTAATGAATAGTAGAGGTGTCAAGGGACTA 35 0 0 

I III I III II II I III I I I I I I I I I I I 



I 

Db 2 056 AACAGGCCAGCACTGG— 

CATTCGCAAGCAGTGGGGAAGGGGAGAGATGCCGAGGTGGTC 2113 



Qy 3501 

TGTATACATGATTAGGGTAAGATAGAATGTATTATATATATATATATATATACACACACA 356 0 

I I I I I I I I I III II I I I I I I I I I I I I I 

I 

Db 2114 AGTAT-CCTGACTT TCAG — 

AGGCCTTTTTTTGTTTGTTTTAATTTTTGCT A 2162 

Qy 3561 

CATATATATAGCTGAATCTTTGGTGTATTGAAATAGGCAGCACTCTGAAAGACAGAAGCT 362 0 

II I I I I III III II I I I II I I I I I 

Db 2163 GATTGATATTAAAAACTC ATGT GGAGGAACTC — 

AAGGAATGTTTAG 22 0 7 

Qy 3621 

TCGTCCAGCCACTCTTCAGCACATTCCTTTACTAAGCAGTTTAAAGCCGTCCTAGTGGAG 36 8 0 

I I I I I I I I I I I I I I 

Db 22 0 8 AAGACCA AAAGTC — 

CCCAATG — A 222 8 

Qy 3681 

CAAGCCCTAAAGCAGATTTAATTTTTGCCATTTTCCAAGAATGACGGTGGTGGCTTTTAG 3 7 4 0 

III I I I I I I I I I I I I I I I I I I I II III 

Db 2229 CAGGAACAAAAGCAAC— 

CAATTTTTAACTTTCTCTTCTCATTCCTGT 22 74 

Qy 3741 

T C AG AAAAT GGC C TTCTGTGCT T TC AAAAAAAAAAC AAAAAAAAAACC AC AC AC AC AC AT 3 8 0 0 



Db 2275 TTTCATTGATTTC- 

CCACATGTAGTCCT 23 01 



Qy 3801 

AAAAAACCCAACAGGTCAAAATAAAAGTTGAACTTGAGTTACATTTAATTTAAATATAAA 3 86 0 
III I I I I I I I I I I I I I I I I I 

Db 2302 — TTTGCTCAGGAAGTC 

TTTGGGGAAATTAAGGATCTTTGAA 23 41 

Qy 3861 

TGCATTTTGAGAAATGTTAAGAACAATTTAGTCAATCGTTCATCTGTCATTGGTACTGTA 392 0 
II I I I I I I I I I I I I I I I I I II I I I I I I I 

I I I I 

Db 23 42 -GC — TCTGAAATAGGT GATCAGGTTAGT GGT-GTCTGTCA 

GCTGTC 2384 

Qy 3921 

AAATAAGCTGTGGTCTATTTCCACTGTTTAATTTTCTACTCAGTTCTACCAAATAGGATG 39 8 0 
II I I I I II III I I I I I I I 

I 

Db 23 85 TAA GAGGT TGGAAAATGAACTACTCA 

AGATAG 2 416 

Qy 3981 

TCATGTTTGACATTTTTGATAGTGACTTTGGGGTCTTCTTCACTGAAAGCACCTTAGAAC 4 0 4 0 

III I I I I I II I I I I I I I I I I I 

Db 2 417 TCA CG AAAAT AC TGAAAG TTTG 



ATTTTTC- 



2445 



Qy 4041 

TGTACTATAAGAAAACATTTCCCCTATGTATAATTATATGAATGTGATGTTTATTGCTTA 410 0 

I I I I I I I I I I I 

I 

Db 2446 TTTCC 

ATATTTGA A 2 459 

Qy 4101 

T T AAT T T AT AAT T C AG TCATTCTC T AT AT AGGAC T TC T T AAAAT T T AGAAGGG AAAT C T A 416 0 

I I I I I I I I I I I I I III I I I I I I I 

Db 2 46 0 TTAATTTTT TCTGTTT — GAC 

TGGAAGGG 2 486 

Qy 4161 

GCTACTTCAAATTGTCTGTTAAATTTATTATGCCCAAATCAACCTCTGAAAAAAGGTTTT 42 2 0 

II III III III I I I I I I I II 

Db 2 48 7 GT TTTTGTATAACTAA 

AACCTCAGCGCATA 2516 

Qy 42 21 T C C AGG AAG AT T T AC AT T T AGGT T T A- 

ATATTTTTTTAGT T AGG TAG AG T T T T AAAAAAT 42 79 

I I I I I I I I I I III III III I I I I I I 



Db 2517 AAGGAGAT T T AAA AGGAGCACATGATTTAGTGGGTGGGCCATG 

AA 2561 

Qy 42 8 0 AC T T G AGC C T G T C C G T G AT AAAGC TAT AAAAT — 

T C AAT AAC T T T T TAG AAT G T T AAATG 4337 

I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2562 ACTAGAG ATGGGATTTGGGGGTGAATTTGTCAATATCT 

GGATTTTAA 26 0 8 

Qy 4338 

AAG AC AC T G T T T C C T AAC AT C AG TG AGAT AC ATC T TTGAAT T T AAAC AT TC AT AT TTACT 4397 
II Mill II II III I 

I 

Db 26 09 TCCAGACATCTCTG CT AAC G AGC— 

CTTTGGT 2638 

Qy 4398 

GAGTACCTACTAGGTACCAAGTACTCTTTTAGGCACTGGAAATACAGTGATGGACAAAAC 4 45 7 

III I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2 639 AAGTCACTTC-AGATACTTT-TCCTCCTTTTTAC AAAGAGAGGGCTGG- 

CTTA — 2688 

Qy 4458 AGGTAAAAAATCGCTGCCCCCT- 

CAGAGCTGACATTCTGGGGTGGGAATTTCATTTTGCC 4516 

III I I I I I I I I I I I I I I II II I I I I I 

Db 26 89 — GTTATTTGCCAAAGCCCCTTCCAGGCCTGAATTCCACAAGTACGATTTAC — 

TGTA 2742 

Qy 4517 

ACGTACTAACGTTCTGCACAAAAGACAGGCTAGACTCTTGTCTAGATTGTTTAAAAGAAA 45 76 
I I I I I I I I I I I I I I I I I I 

I 

Db 2 7 43 GTGTCTTATCACTCT TTCATGTCACAATAGCGT 

GGAG 2779 



Qy 4577 

CTTTTCAAATTGGTTACATTAATTTTAGTTTATTTTCACAAGTAAAAATGGCTTTTTATT 4636 

III II I I I I I I I I I I I I I I I I I I I I I III 

Db 2 78 0 CAT T AG AG AAAAG — CCTAGACTTTTAGTTGA 

TAGCCAGTTGAAATATCATT 2 829 

Qy 4637 

TAGATTCTTTCTGTCCCAGGCTGTTGATCTTAAAACTAGTTGATTTAAAGAGTTTTTTTG 4696 

III II III I I I I I I I I I I I I III 

I 

Db 2 83 0 —GAT AGAATTTTAGTTTTAGGAAAAATTGGTTT 

GATTTCTAG 2 8 70 

Qy 4697 

C AC AAC AT TTCAATTATATTTGT G AAC T TAGAAAT T AAC T T AC AATC T AAC CAGCCATCA 4756 

I I I I I I I I I I I I I I I I I I I II 

I 

Db 2 8 71 — CTTTATTACTATTAGGTATGTGAGCTTGG 

GCAA— A 29 0 4 

Qy 4757 

TATCATATCCTATCAGGCTAGATATCTCAATAGTAGACTGAATACAAAGCTAATTTTTTT 4 816 

I III I I I I I I I I I I I I I I I II I I I I I I I I I I I 

I I I 

Db 29 05 TCGCTTAATCTTTGAGTCTAGTTTTCTCTCAAAATGA — 

GAACATTAGGCTAAATGATTT 2962 

Qy 4817 

TACATGTCAATATTGGCACAAACTGGAATGAAAGAATAGTTTGATTCAGACCTGCTCCAC 4 8 76 

II I I I I I I I I I 

Db 2963 CCG AGTTT 

CCAGCT 2976 

Qy 4877 

TATGTGTTGCTAAAACACATGCTATGAGCACTCGAGGAAACACTATATTTTTTCCAAAAA 4936 

II I I I I I I I I I I I I I I I 

Db 29 77 AGTCCTAGAG 

TTCTATATTTCTACATA 3 0 03 

Qy 4937 

ATATGTGATTATATATGTTAAAGTATAGATAACATTTCACACTTGGATACATATGTGCAT 4996 
I I I I I I I I III II 

Db 3004 GTTGAATTAT TTTATCA 

TGCTG 3025 

Qy 4997 

TTACTGTATTTCTTGGTAAGCATATTTTTGGGGGAAAGTGCTGCTGATATGATACAAGTA 5 056 

I I I I I II II III III I I I I I I I I I I 

Db 3 026 TTGCTGGGGAATATGACTAACCCTTTT GAA 

GCTACTAATTTTAT 3 069 

Qy 5057 

GACAAAATTTAAATGAAATTTTGTCACATTCTATGGAAAATGGTTTCTGGTAAACTGAGA 5116 

I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 3 0 70 GTCGAGCTTTAAA GTC-CAT AATTGTTATCT 

TCAGAAA 3106 



Qy 5117 

AGGATATTAAAATAAGTGGCTTTTTTCTGGGCTACCATTATTGTTTGATTTCTCTTTGTC 5176 



I 

1 1 

Db 3107 A 

TCCAAATC 3136 

Qy 5177 

AAG T G TATAGAAC C T G T C AT AC AT TC ATGAT AAGT AGC AC TGAAAAAT T AC T C AT T C AAA 5236 



Db 313 7 AATTTAATAAAATC-- 

GC T T T AT AAC AGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 319 4 

Qy 5237 TTTCCCCTGGGCACG 5251 

I I II II II 
Db 3195 GGCCACATGTGCTCG 32 09 
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Qy 13 CGGGCTGCGGGGAGC — 

TAAGTCCCCAGATTGGAGGA 4 7 



Db 32 09 CGAGC- 

ACATGTGGCCTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTCCTGTTATAAAGC 3151 

Qy 4 8 GGCTGGCT CTGGTCTTCGATGCACAGGAGTGGCCGTTATGGAACGCAGCAGC- 

AGCG 103 



Db 3150 GATTTTATTAAATTGATTTGGACATACTGTAG- 

GTCAAATAATATTTTCTGAAGATAACA 3 092 



II I II I I I I I I I I 

-TATTATTTGACCTAC AGTATG- 



; Score 7277; DB 1; Length 3210; 
; Pred. No. 0; 

0; Mismatches 1130; Indels 2434; 



Qy 104 TGCAGGGTCAAAGACAGCCGGCCCCCCATGTCAGTGGT — CT — 

AGGATGGCCAGTGAAG 159 

I I I I I I I I I I I I I I I I I 

Db 3 091 ATTATGGACTTTAA 

AGCTCGACATAAAATTAGTAGCTTCAAAAGGGTTAGT 3 0 41 



Qy 160 

GCACCAACATCCCAAGTCCTGTGGTGCGCCAGATTGACAAGCAGTTTCTGATTTGCAG 
II I I I I I II III I I I I I 

Db 3 0 40 CATATTCCCCA 

GCAACAGCATGATAAAATAATTCAACTATGTAGAA 2995 

Qy 218 -TATATGCCT G G AAC GG T AC AAG AAT C C C AAG G T T C 

TCCCCTGTCTGCACACTT 270 



Db 299 4 ATATAGAACTCTAGGACTAG — CTGGAAACTCGGAAATCATTTAGCCT- 

AATGTTCTCAT 2938 



— 217 
III I I I I I 



Qy 271 

TCTGCGAGAGGTGCCTGCAGAACTACATTCCTGCCCACAGTTTAACCCTCTCCTGCCCAG 33 0 
I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I I I 

Db 293 7 TTTGAGAG AAAACTAGA C T C AAAG AT T AAGC G AT T — 

TGCCCA- 2 89 7 

Qy 331 

TGTGCCGCCAGACCTCCATCCTGCCCGAGAAAGGGGTGGCCGCGCTCCAGAACAATTTCT 39 0 

I I I I III I I I I I I I I I I I I I I I 

I 

Db 2 896 AGCTCACATACCTAATAGTAATAAAGCTAG 

AAATCAAACCAATTT-T 2 851 



Qy 391 TCATCACAAACCTGATGGACGTGCTG- 

CAGCGAACTCCAGGCAGCAACGCTGAGGAGTCT 449 

III I I I I I I I I I I I I I I I I I I 

I I 

Db 2 850 TCCT AAAACT AAAATTCTATCAATGATAT TTCAAC- 

TGGCT 2811 



Qy 450 

TCCATCCTGGAGACAGTCACTGCTGTGGCTGCGGGAAAGCCTCTCTCTTGCCCAAACCAC 5 09 

III I I I I I I I I I I I I I I I I I I I 

I 

Db 2810 ATCAAC TAAAAGTCTAGGCTTT TCTCTAATGCTCCA- 

CGC 2772 



Qy 510 GATGGGAATGTGATGGAATTTTACTGCCAGTCCTGTGAGACTGCCATGTG 

TCGGGA 565 

III I III III II I I I I I I I I I I I I I I 
Db 2 771 T AT T G T G AC AT G AAAG AG T G AT A- 

AGACACTACAGTAAATCGTACTTGTGGAATTCAG — 2 715 



Qy 566 GTGCACGGAGGGGGAGCACGCAGAGCACCCCACAGTTCCACTCAAGG — 

ATGTGGTGGAA 623 

II I I I I I I I I I I I II I I I I I I I I II 

I I I I 

Db 2714 — 

GCCTGGAAGGGGCTTTGGCAAATAACTAAGCCAGCCCTCTCTTTGTAAAAAGGAGGAA 2 65 7 



Qy 62 4 CAG CACAAG-GCCTCGCTCCAGGTCC 

AGCTGGATGCTGTCAACAAAAGGCTCCC 6 76 



Db 2 656 AAGTATCTGAAGTGACTTACCAAAGGCTCGTTAGCAG — AGATGTCTGGATTAAAAT- 

CC 2600 

Qy 6 77 AGAAATAG 

ATTCTGCTCTTCAGTTCATCTCTGAAATCATTCATCAGTTAACCAA.ee 732 



Db 2 599 AGATATTGACAAATTCACCCCCAAATCCCATCTCTAGTTTCA 

TGGCCCACC 2549 

Qy 733 

AAAAGGCCAGCATCGTGGATGACATTCATTCCACCTTTGATGAGCTCCAGAAGACTTTAA 792 



Db 2548 CA CTAAATCATGTGCTCCTTTTAAATCTCCTTT-ATGCGCT 

GAGGTTTTAG 2 499 

Qy 793 ATGTGCGCAAG 

AGTGTGCTGCTTATGGAATTGGAGGTCAACTATGGCCTCAAAC 8 46 



Db 2 49 8 TTATACAAAAACCCCTTCCAGTCAAACAGAAAAAATT — 

AATTCAAATATGGAAAGAAAA 2 441 

Qy 847 A- 

CAAAGTCCTCCAGTCGCAGCTGGATACTCTGCTCCAGGGGCAGGAGAGCATTAAGAGC 9 05 
I I I I I I I I I I I I I III II I I I I I 

Db 2440 ATCAAA— CTTTCAGTATTTTC — GTGACT ATCTTG 

AGTAGTTCATTT 239 7 

Qy 9 06 TGCAGCAACTTC-ACAGC — GCAGGC- 

CCTCAACCATGGCACGGAGACCGAGGTCCTACT 961 



Db 2396 T C C AAC C T C T T AG AC AGC TGAC AGAC ACC AC T AAC 

CTGATCACCTATT 23 49 

Qy 962 

GGTGAAGAAGCAGATGAGCGAGAAGCTGAACGAGCTGGCCGACCAGGACTTCCCCTTGCA 1021 



Db 23 4 8 TCAG AGCTTCA-AAGAT 

CCTTAATTTCCCCAAAGA 2315 

Qy 1022 C — CC GCGGGAGAACGACCAGCTGGATTTCATCGTGGAAACCG — 

AGGGGCTGAAGA 10 74 



Db 2314 CTTCCTGAGCAAAAGGACTACATGTGGGAAATCA — 

AT G AAAAC AGG AAT G AG AAG AG AA 2257 

Qy 1075 

AGTCCATCCACAACCTCGGGACGATCTTAACCACCAACGCCGTTGCCTCAGAGACAGTGG 113 4 



Db 2256 AGT TAAAAATT — GGTTGCTTTT GTT-CCT 



GTCATTG- 2223 



Qy 1135 CCACGGGCGAGGGGCTGCGGCAGACCATCATC- 

GGGCAGCCCATGTCCGTCACCATCACC 1193 

I I I I I I I I I I I I I I I I I 

I I 

Db 2222 GGGACTTTTG GTCTTCTAAACATTCCTTG 

AGTTCC 2188 

Qy 119 4 ACCAAGGACAAAGACGGTGAGCTGTGCAAAACCGGCAACGCCTACCT 

CACCGCCGAA 1250 

II III I I I I I I I I I I I I I I I II I 

I I 

Db 2187 TCC ACA TGAGTTTTTAATATCAATCTA- 

GC AAAAAT T AAAAC AAAC AAA 2140 

Qy 1251 

CTGAGCACCCCCGACGGGAGCGTGGCAGACGGGGAGATCCTGGACAACAAGAACGGCACC 1310 

II III II I I I I I I I I I I III I I I I I 

I 

Db 2139 AAAAG-GCCTC TGAAAGTCAG GAT AC T G AC C AC C 

TCGGCATC 2 099 

Qy 1311 TATGAGTTTTTGTACACTGTCCAGAAGGAAGGG GACTTTAC 

CCTGTCTCTGAGA 136 4 

I I I I I I I I II I I I I I I I I I I I I 

I 

Db 2098 TC 

TCCCCTTCCCCACTGCTTGCGAATGCCAGTGCTGGCCTGTTCCTTAAA 2 0 49 

Qy 1365 C TC T ATGACCAGCACAT 

CCGAGGCAGCCCGTTTAAGCTGAAAGTGATCCGATCCGCT 1421 

II I I I I I I I I I II I I I I I I I I I III II I I I I 

I I I 

Db 2 0 48 CGC ACCAGCTCATCCCCCAGAGCAGCCTGTGTGCTCTGCCAG CGA- 

CTGCT 1999 

Qy 1422 

GATGTGTCTCCCACCACAGAAGGCGTGAAGAGGCGCGTTAAGTCCCCGGGGAGCGGCCAC 14 81 

I I I I I I I I I II I I I I I I I I I I 

Db 1998 ACTTCTCCTCCGGCCTCA 

GCTCAACATCCCAGCTATGCTACCCT 1955 

Qy 1482 GTCAAGC — 

AG AAAG C T G T G AAAAG AC C C G C AAG C AT G T AC AGC AC T GG AAAAC G AAAAG 1539 

I II I I I I I I I II I I I I I I I I I I I I I 

Db 1954 G GGCTATGTGAGCT-TG GGCAAGTTAGCA — 

TACATCTCTAGG 1915 

Qy 1540 

AGAATCCCATCGAAGACGATTTGATCTTTCGAGTGGGTACCAAAGGAAGAAATAAAGGAG 1599 

III I I I I I I I I I I I I I I I I I III 

I 

Db 1914 CCTC TCTTTCCA-TGGATAC 

AGAGAAGATGGTG 18 83 

Qy 1600 

AGTTTACAAATCTTCAGGGGGTAGCTGCATCTACAAATGGAAAGATATTAATTGCAGACA 1659 

I I I I I I I I I I I I III I I I I I I I 

Db 18 82 A TGCCTA-CTTCATAGAGT TCTT 



ATAAGAATT- 



1852 



Qy 1660 

GTAACAACCAATGTGTGCAGATATTTTCCAATGATGGCCAGTTCAAAAGTCGTTTTGGCA 1719 

II III III I 

I I I I 

Db 1851 CC CTGA GGC 

TAGGCA 183 7 

Qy 1720 

TACGGGGACGCTCTCCGGGGCAGCTGCAGCGGCCCACAGGAGTGGCTGTACATCCCAGTG 17 79 

I I I I I I I I I I I I I I I I I I I I I I 

Db 1836 T GGTGGCT CACAC CTGTA- 

ATCCCAG— 1812 

Qy 1780 

GGGACATAATCATTGCCGATTATGATAATAAATGGGTCAGCATTTTCTCCTCCGATGGGA 1839 

II I I I I II I I I I I I 

I I I I I 

Db 1811 CA CTTTG GA GGTCA 

AGGTGGGA 179 0 

Qy 1840 

AATTTAAGACAAAAATTGGATCAGGAAAGCTGATGGGACCCAAAGGAGTTTCTGTGGACC 1899 
I I I I I I I I I I I I I I I I I I I I 

I I I I 

Db 1789 A GATCA ATTGAGCCC — AGGAG-TTC — 

AAGACC 1761 

Qy 190 0 GCAATGGGCACATTATTGTTGTGGACAACAAGGCGTGC- 

TGCGTGTTTATCTTCCAGCCA 1958 

II I I I I I I I I I I I I III I I I I I 

I I I 

Db 176 0 AGCC TGGGCAACATAGCGAGAGTGC ATCT — 

CTACCA 1726 

Qy 1959 AACGGGAAAATAGTCACCAGGTTTGGTAGCCGAGGAAATGGGGACAGGCAGT-TTGC- 

AG 2016 

I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1725 AAAGTGATTTT TTTGAT TTTTTGGGACAG — 

AGTCTCGCTCT 1686 

Qy 2017 GTC-CCCATTTTGCAGCTGTAAATAGCAATAATGAGATTATTATTACAGATTTC — 

CATA 2 0 73 

I I I I I I I I I I I I I I I I III II III I II 

Db 16 85 GTCACCCA GGCTG — GAGTGCAGTGGTGTG 

ATCTCGGCTCACTGCAAC 16 40 

Qy 2074 

ATCATTCTGTCAAGGTGTTTAATCAGGAAGGAGAATTCATGTTGAAGTTTGGCTCAAATG 2133 

II II I I I I I II I I I I I I I I I I I I I I I 

Db 1639 TTCTGCCT-CCCAGGT TC AAGCAGCTCTCATGT 

CTCA 16 0 4 



Qy 2134 

GAGAAGGAAATGGGCAGTTTAATGCTCCAACAGGTGTAGCAGTGGATTCAAATGGAAACA 2193 

II I II I I I I I I I I I I I I I I I 

I 

Db 1603 GC CTCCC 

GAGTAGCTGGGATTACAGATGCCTGCC 1570 



Qy 2194 

TCATTGTGGCCGACTGGGGAAACAGCAGGATCCAGGTTTTTGATGGGAGTGGATCATTTT 2 2 53 
I II I I I I I I III I I I I I I I I I I I I 

Db 1569 ACCATG CCAGGCAAATTTTGGGTT GTTTTTTTTGAGA- 

TGGA 152 9 



Qy 2254 TGTCCTACATTAACACATCTGCTGACCCACTCTATGGCCCCCAAGGCC- 

TGGCCCTAACT 2312 

I I I I I I I I I I I I I III I I I I I I I I I 

I 

Db 1528 -GTCTTGCTCT ATCAGCAGGC TGG 

AGTGCAGTGGCGCGATC- 1489 



Qy 2313 

TCAGATGGTCATGTTGTGGTTGCAGACTCTGGAAATCACTGTTTCAAAGTCTATCGATAC 23 72 
III III I I I I I I I I I I I I I I I I I I I 

I 

Db 148 8 TCGGC TCA CTGCAACCTCTG — ACTCCCGGGTTCAAG 

CGATTC 1448 

Qy 2373 

TTACAGTAATGGTGGGCAGGTGGATACCCGCTTCCATGGTCTTGCACTATAAACTGGAAT 2 432 
I II I I I I I I I I I I II 

I 

Db 1447 T CTTG CCTCAGCC TCTTG 

AGTAGCTGGGA- 1419 



Qy 2 433 GGATTTCTCAATGCG- 

GGACCAGATTATGACTAGAGTTTTTATGCCAGAAGGAATCATTG 2 491 
II I I II I I I I I 

Db 1418 CTACAGGCGCGCACCA 

CCA 1400 



Qy 2492 

GTGAACTTTCCAAGGTTATTTCTGAATGTAACAATTTCCTTAAAAATGACTTATCCAATT 2 551 
III I I I I I I I I III III I I I I I I 

Db 1399 TGCCTGGCTAATTTTTG — TGT 

T T T TAG TAG AG AT G 1366 



Qy 2552 

TCTGTATTTCACCTTTAGGGTTAAAAAAAACTCTTCTACTGAATCTATAAAAACTGCAGT 2 611 
I I I I I I I I I II III III 

Db 1365 GGGTTTCACCCTGTTGGCCAAGAGGACCCC 

CAGC— 1332 



Qy 2612 

TTTACATCTGTGAACTATGGCTTAAGGGACAGGATTTATGTAGCTAAACTAATTTTGCAA 2 6 71 
III I I I I I I I I I I I III 

I I I 

Db 1331 CTTC CCAAGCCATAGGGGGCTG GTGG 

GCAG 1302 

Qy 2672 

ATCAAACAGACACTTAAAAAAACTAGCATATGTAAAGGTATTCGTTAATCCTGTGAATGG 2 731 

I I I I I I I I I I I I I III I II I I I I I I 

I 

Db 13 01 GGC GGCACTT AGCT-GCCT GGGGCACCCAGTA — 

CCTGTCAT — G 1262 



Qy 2 732 TAGCTTTTGCACAGAACTTCCAAAAG- 

CAAAACAAAAACAAAATCTATTGTAGTTATATA 2 790 

II I I I I I I I I I I I I I I III III I 

I 

Db 1261 AGGCACTTGCCCTGG 

CCAAAGGTCACTGCCAAGGCACAGCC A 122 0 

Qy 2791 

CTTCATTTAACCTAGGTCACAAGACCCAGGGAATCTTCTAACCTCACTTTTACAGTAGGT 2 85 0 
I I I I I I I I I II I I I I I I I I I 



Db 1219 CTCCAGAGA G TC AAGAGGAGC AG AG AC T C — C AG A 

GCAGATGGA 1178 

Qy 2851 

ATTACTCTTGTGACATTTTTTTGGTTATCAACAACTAAATATAAATTACTTTGGAAAAAG 2 910 

II III I I I I I I I I I I I I I 

I I 

Db 1177 AGGA GAGTGG-GCTTCTGGGGATTGCACCAAGGA 

GGAAGCAG 113 7 

Qy 2911 TAAGGCTGTCTTGCAAAATGATCCCAGCTCTGATTAGCAGCCCTCTGGAGTTC 

AGAAC 2968 

I I I I I I III III I I I I I I I I I I I I 

I 

Db 1136 AAGGGATGTGCAGCAG CAGAAAAGG — 

AGGAGGACCGGGGGGTTCTGAGGGC 1087 



Qy 2969 TTAAG — TATCAGTGCAAATTTCTCAACCT — 

TTCTGGGTTAGACAAAGATCCTTTTTTG 3024 

II I I I I I I I I I I I I I I I I I I I I I I 

Db 10 86 TTGGGGTGATGAGGCCAGAGTTC-CAAACAGGGTCTGGGT 

GCAGG A 10 42 



Qy 3025 

TGTGTTCTTTTCACCACCCCTTTGGCTCACCTTGTATCAGCAAACAAAGTACTTCTTCAG 3 0 8 4 
I I I I I I I I I I I I I I I 

Db 10 41 TGAGGAC 

AGAAGAAAACAGAGAAC 1018 



Qy 3085 

GGAAACCTGAAATTTCTAATGCCTTGAAAAGCATATTACAAAAGTAATGCTACCTTTTGG 314 4 
II I III I I I I I I I 

Db 1017 CCAAAAGGGGCA AGAGGT — 

TGGTCCCTT 991 



Qy 3145 

GAAACAAACTGCCCCGTTAACTCCAGATCATTGCACTGGAATGTAATCAAGAAAGTTAGT 32 0 4 

III I I I I I I II I I I I III 

I 

Db 99 0 ACT TTCACTC TC CTGG 

GAGGAGGATGG- 96 4 

Qy 3205 

CATGTTTTATGTACCATGTTTTCACACGTGTCTCTTCTCTTCGACTTCCTGAAAGCGAAA 32 6 4 

I I I I I I I I I I I III 

I 

Db 963 GTACAC C AC AC -AG T 

TGGATGGTGCA 939 



Qy 3265 GCTTTACCTCCTGCAAATGTCAGCACATGTAGTAGGACACCA- 

G T AT C C T AGG AC AG AG A 3323 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I 

I 

Db 93 8 TCTTTGCC TGTCTGCACCTGCAG 

CTCCATGGCACCATGGACCCTGT 893 

Qy 3324 

GCCATAAGTAGCCCTTTGGAGGACTGATGGTGTCAACCAAAGGCATGTGATTGATTAATG 33 83 

II II I I I I I I I I I II I I I I 

Db 892 GC TCC GAGTGGTGGTG CC 

TGAT 8 71 

Qy 3384 

ATTCCCCCTTAGAAAGCAAGTGTTACCAAAGTTGTGTTATCTTGAAAGCATTACAGGTAA 3 4 43 
I I I I I I I I I I I I I I 

I I I 

Db 8 70 -CTCCACC ACCA GGCAGCA 

GTAG 8 49 

Qy 3444 

GGGCATGTTATGGTTATTTATCATTGTTTAATGAATAGTAGAGGTGTCAAGGGACTATGT 35 03 
III I I I I I I 

Db 8 48 AGGC 

CGCTATCCAG 835 

Qy 3504 

ATACATGATTAGGGTAAGATAGAATGTATTATATATATATATATATATACACACACACAT 3563 

I I I I I I I I I I 

I I 

Db 83 4 CAGGGTCAGGTTG 

CGCA- 818 

Qy 356 4 ATATATAGCTGAATCTTTGGTG- 

TATTGAAATAGGCAGCACTCTGAAAGACAGAAGCTTC 362 2 

I I I I I I I I I I I I I II I I I I I I 

Db 817 TGGTGATGGAGAAGTTGCCA 

TGGTGGTCGGAGGCCGA 781 

Qy 3623 GTCCAGCCACTCTTCAGCAC- 

ATTCCTTTACTAAGCAGTTTAAAGCCGTCCTAGTGGAGC 36 81 

I I I I I I I II III II I I I I I I I I I I I I I I 

I I 

Db 78 0 CTCCAGCC CGT-GGCGCTGAGCCAGGTCGTGGCTGGTGTTGGCAG-CCTGGTG- 



Qy 3682 

AAGCCCTAAAGCAGATTTAATTTTTGCCATTTTCCAAGAATGACGGTGGTGGCTTTTAGT 3 7 41 

III I I I I I I I I I I I I I I I I I I I I 

I 

Db 72 7 CT-CCATGGTGCAGGTGAAG GTCCTGGAACGT — 

GAGGTTGCGGATGGG 6 82 

Qy 3742 

C AG AAAAT GGC C TTCTGTGCTTT C AAAAAAAAAAC AAAAAAAAAACC AC AC AC AC AC AT A 3 8 01 

I I I I I I I I I I I I I I I I I I 

Db 681 CC GGCGCTCTGAGC AGGTCTGCAC- 

CTCGC— 653 



Qy 



3802 



AAAAACCCAACAGGTCAAAATAAAAGTTGAACTTGAGTTACATTTAATTTAAATATAAAT 3 861 
III I I I I I I I I I 

Db 652 CCC 

TCGAGCTGCGGTACCA 634 



Qy 3862 

GCATTTTGAGAAATGTTAAGAACAATTTAGTCAATCGTTCATCTGTCATTGGTACTGTAA 3921 
I I I I I I I I I I I I I I I I I 

Db 633 -CGTCTTG TAGAA GGTCA 

CATC 613 



Qy 3922 

AATAAGCTGTGGTCTATTTCCACTGTTTAATTTTCTACTCAGTTCTACCAAATAGGATGT 39 81 
I I I I I I I I I I I I I I I I 

I I 

Db 612 GTGCCCTTTGTCCACAG 

GGCCCAAGAGCCTG- 582 



Qy 3982 

CATGTTTGACATTTTTGATAGTGACTTTGGGGTCTTCTTCACTGAAAGCACCTTAGAACT 4 0 41 
III III I I I I I 

I I 

Db 581 CAGG TGAGGGTGACGTT 

CT 563 



Qy 4042 

GTACTATAAGAAAACATTTCCCCTATGTATAATTATATGAATGTGATGTTTATTGCTTAT 4101 
I I I I I I I I 

Db 562 G 

CCCCTCGGGA 552 



Qy 4102 

T AAT T T AT AAT T C AG TC AT TC TC T AT AT AGGAC T TC T T AAAAT T T AGAAGGGAAAT C TAG 4161 
I I I I I I I I I I I 

Db 551 CAGACATAC 

AGGGAATACGG- 532 



Qy 4162 

CTACTTCAAATTGTCTGTTAAATTTATTATGCCCAAATCAACCTCTGAAAAAAGGTTTTT 42 21 

I I I I I I I I I I 

Db 531 CGTGGCGACCT- 

TGAAGGCTG 512 



Qy 4222 

CCAGGAAGATTTACATTTAGGTTTAATATTTTTTTAGTTAGGTAGAGTTTTAAAAAATAC 42 81 



Qy 4282 

TTGAGCCTGTCCGT G AT AAAGC TAT AAAAT T C AAT AAC T T T T TAG AAT G T TAAAT GAAGA 4341 
I I I I I II III I 

Db 50 8 CCGGACC TGC TCA 

GAGAGAGGAGAG 48 4 



Qy 4342 

C AC T G T T T C C T AAC AT C AG T GAG AT AC AT C T T TGAAT T T AAAC AT T C AT AT T T AC T GAG T 4 4 01 
I I I I I I I I I I I I I I 

Db 483 CCCTGTCACCTGAC 

TGATCCA 463 



Qy 4402 

ACCTACTAGGTACCAAGTACTCTTTTAGGCACTGGAAATACAGTGATGGACAAAACAGGT 4 461 
III I I I I I III 

Db 462 GTGC TACTC 

CTG 451 



Qy 4462 

AAAAAATCGCTGCCCCCTCAGAGCTGACATTCTGGGGTGGGAATTTCATTTTGCCACGTA 4521 

II I I I I I I I I I I I I II I I I I I I 

I 

Db 45 0 GCAACCCCAGAGC GGGGAAGAA CCTTAGCCTC- 

TG 417 



Qy 4522 

CTAACGTTCTGCACAAAAGACAGGCTAGACTCTTGTCTAGATTGTTTAAAAGAAACTTTT 45 81 
III I I I I I I I I I I I I I I I I I I I I I 

Db 416 CTA CTTCCCAGCC-ACAGGCCAGGGGCATGTC-AGCT 

GACCTCC 3 75 



Qy 4582 

CAAATTGGTTACATTAATTTTAGTTTATTTTCACAAGTAAAAATGGCTTTTTATTTAGAT 46 41 
III I I I I I I I I I I I I I I I 

Db 374 CTAA — GACTGCCTTC 

TGCATTTGTTCACAAACCC 3 42 



Qy 4642 

TCTTTCTGTCCCAGGCTGTTGATCTTAAAACTAGTTGATTTAAAGAGTTTTTTTGCACAA 4 7 01 
I II I I I I I I I I I I I I III I I I I I 

I I I I 

Db 3 41 -CACTCCATCCTAGGCTGTTG TAGGGG GAGT 

CACAG 307 



Qy 4702 

CATTTCAATTATATTTGTGAACTTAGAAATTAACTTACAATCTAACCAGCCATCATATCA 4 761 

I I I I I II 

Db 306 AAGCTTCC 

CCAAC 29 4 



Qy 4762 

TATCCTATCAGGCTAGATATCTCAATAGTAGACTGAATACAAAGCTAATTTTTTTTACAT 4 821 
I I I I III I I I I I I 

I I I 

Db 293 TAGGGGTG CTC CTGGATGC 

TGGACAG 26 8 



Qy 4822 

GTCAATATTGGCACAAACTGGAATGAAAGAATAGTTTGATTCAGACCTGCTCCACTATGT 4 8 81 
III I I I I I I I I I I II I I I I I I I I I 

I I I 

Db 26 7 GGCA GGGAGAGACTTGGAGGGCAG TGATGCAGTCGGG 

TGT 228 



Qy 4882 

G T T GC T AAAAC AC AT GC T AT G AGC AC TCGAGGAAAC AC TAT AT T T T T T C C AAAAAAT AT G 49 41 
II I I I I I I I I 

Db 227 GT CCCTC 

TTCC 217 



Qy 4942 



T GAT TATATATGT T AAAG TAT AG AT AAC AT T TC AC AC T TGGAT AC AT AT G T GC AT T T AC T 5 0 01 

II I I I I I I I I I I I 



TATATGCAAAACCT 195 
Qy 5002 

GTATTTCTTGGTAAGCATATTTTTGGGGGAAAGTGCTGCTGATATGATACAAGTAGACAA 5 061 
I I I I I I I I I I I I I 

Db 19 4 G TTGGGTGGGAG 

GCAGGT 177 

Qy 5062 

AATTTAAATGAAATTTTGTCACATTCTATGGAAAATGGTTTCTGGTAAACTGAGAAGGAT 5121 

III I I I I I I I I 

I 

Db 176 TGGGGAGCAG- 

GAAGGCT 16 0 

Qy 5122 

ATTAAAATAAGTGGCTTTTTTCTGGGCTACCATTATTGTTTGATTTCTCTTTGTCAAGTG 5181 

II I I I I I I II I I I I I I I I 

I 

Db 159 GTTTCGAACAGCAGCT CC 

TTGTCCTTGGCCGCTC 126 

Qy 5182 

TATAGAACCTGTCATACATTCATGATAAGTAGCACTGAAAAATTACTCATTCAAATTTCC 52 41 

I I I I I I I I I I I III I 

I I 

Db 125 TGTAGGGGCCATC AACAC CTC 

TGCC 101 

Qy 5242 

CCTGGGCACGTAAGGCAAAATATTGCCGGTTGGGATTTCAAGGTCAGTGACGACGCATTT 53 01 

II III 

I 

Db 100 CC 

TTCC T 94 



Qy 53 02 CCTCCCAGTACAGACCCC-CCAGCCCC- 

CCTTGCTGGACATGGGGAGGCAGAGAGTCACT 5359 

I I I I I I I I I I I II III I I I I I I I I I I I I 

I I I I 

Db 93 ACTCCTAGGACTGAACATGCCTTCCCATCCTTGC CAGGGAG- 



Qy 5360 — 

TGACCATCCAGAAATACATGACTACAAGTCCTTTATGACTGTTTGCCATTTTTTTTA 5 416 
I I I I I I I III I I I I I I I I I I I I II 

I 

Db 4 8 AAGAGAGCA-CCTG CGTG — TGCAAGGCC — 

TGGGCCAGTAGGCC A 8 

Qy 5417 ATGGTAC 5423 

I III 
Db 7 ACAGTGC 1 
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Qy 5 GTTG — TCTGC 

GGGCTGCGGGGAGCTAAGTCCCCAGATTGGAGGAGGCTGGCTCT 57 

III I I I I I I I I I I I I I I I I I 

Db 1 GTTACTTCTGCTCTAAAAGCTGCGG 

AATTCC 31 

Qy 58 

GGTCTTCGATGCACAGGAGTGGCCGTTATGGAACGCAGCAGCAGCGTGCAGGGTCAAAGA 117 



Db 32 TCGA-GCACTG — TTGGCCTACTGGGTGTACTGTGGAGAC 

TGTCAAAGT 7 7 

Qy 118 

CAGCCGGCCCCCCATGTCAGTGGTCTAGGATGGCCAGTGAAGGCACCAACATCCCAAGTC 17 7 



Db 7 8 CTCCCGGAGCCCAATTTC CGGAAG — CGGTG- 

AGTT 110 



Qy 178 

CTGTGGTGCGCCAGATTGACAAGCAGTTTCTGATTTGCAGTATATGCCTGGAACGGTACA 23 7 
III I I I I I I I I I I I 

Db 111 CTG AAAGAAGTT 

CCTGCACCG T 132 



Qy 238 

AGAATCCCAAGGTTCTCCCCTGTCTGCACACTTTCTGCGAGAGGTGCCTGCAGAACTACA 2 9 7 
I I I I I I I I I I I I I I I 

I 

Db 133 AGTTTCCCAAG 

TCTGCG A 150 



Qy 298 

TTCCTGCCCACAGTTTAACCCTCTCCTGCCCAGTGTGCCGCCAGACCTCCATCCTGCCCG 35 7 
I I I I I I I I I I I I III I I I I I 

Db 151 ATCC — CCAACCATGAGCGCCTC 

GGGCGTACTGTCCTTTACCCAGC 19 4 



Qy 358 

AGAAAGGGGTGGCCGCGCTCCAGAACAATTTCTTCATCACAAACCTGATGGACGTGCTGC 417 
I I I I I I I I I I III III III 

I I I I 

Db 195 AAGGATGG GAGCAGGTGCTG GCCAAAGTGA — 

AACGGGCTGT 23 4 



Qy 418 

AGCGAACTCCAGGCAGCAACGCTGAGGAGTCTTCCATCCTGGAGACAGTCACTGCTGTGG 4 7 7 
I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 235 GGTTTA — CCTGGACGCCGC-CTGCG CCGAGAGCCTG-CA — 

CTGGGG 276 



Qy 478 CTGCGGGAAAGCCTCTC- 

TCTTGCCCAAACCACGATGGGAATGTGATGGAATTTTACTGC 536 

I I I I I I I I I I I I I I II I I I I I I 

Db 2 77 CTGCGG ATCCACCCGTCT CCTGGAGGCGGTGGG- 

GG 311 



Qy 537 

CAGTCCTGTGAGACTGCCATGTGTCGGGAGTGCACGGAGGGGGAGCACGCAGAGCACCCC 596 

I I I I I I I I I I I II II MM I I I I I I 

I 

Db 312 — GCCCT GACTGTCACCTG-CGAGAGTTC 

GAGCCCGAC 3 46 

Qy 597 

ACAGTTCCACTCAAGGATGTGGTGGAACAGCACAAGGCCTCGCTCCAGGTCCAGCTGGAT 656 

I I I I I I I I I II I I I I I I I I I I 

Db 3 47 GCAATT GGTGGTGGAGC CAAG 

CAGC 3 71 

Qy 657 

GCTGTCAACAAAAGGCTCCCAGAAATAGATTCTGCTCTTCAGTTCATCTCTGAAATCATT 716 
I I I I I I I I I I I I II 

Db 372 

CCAAGGCAGTGTTTGTGCTGAGCTG 396 

Qy 717 

CATCAGTTAACCAACCAAAAGGCCAGCATCGTGGATGACATTCATTCCACCTTTGATGAG 7 76 

II I I I I I I I I I I I I I I I I III I I I I 

Db 39 7 CCTGCTGAAAGGCCGG- 

AC C G T GG AG AT C C T AC GGG AC AT CAT 438 

Qy 777 CTCCAGAAGACTTTAAATGTGCGCAAGAGTGTGCTGCTTATGGAATTGGAGGT — 

CAACT 83 4 

I I I I I I I I II I I I I I I I I I I I 

I I I I 

Db 439 CTGCCGCAGTCACTTC CAGTATTGTG 

TGGTGGTCACAACC 478 



Qy 



835 ATGGCCTCAAACACAAAGTCCTCCAGTCGCAGCTGGATACTC- 



TGCTCCAGGGGCAGGAG 893 

II I III I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 4 79 GTGAGC CACGCTGTCCACC — TCACAGC 

TAATCATGTCCCAGCGGC 522 

Qy 894 

AGCATTAAGAGCTGCAGCAACTTCACAGCGCAGGCCCTCAACCATGGCACGGAGACCGAG 953 

I I I I I I I I II III III I I I I 

I I I I 

Db 523 GGCAGCGGC — CGAGATGGAG — GGGCAGCAGC 

CGGTGTTCGAG 562 

Qy 954 

GTCCTACTGGTGAAGAAGCAGATGAGCGAGAAGCTGAACGAGCTGGCCGACCAGGACTTC 1013 

I I I I I I I I I I I I III I I I I I I I I I I I I III 

I 

Db 563 CAGCTGGAGGAGAAGC TGTG TGAATG- 

GATGGGCAACATGAACTAC 607 

Qy 1014 

CCCTTGCACCCGCGGGAGAACGACCAGCTGGATTTCATCGTGGAAACCGAGGGGCTGAAG 10 73 

I I I I I I I I I I 

Db 608 AC GG— 

CCGAGGTG 619 

Qy 1074 

AAGTCCATCCACAACCTCGGGACGATCTTAACCACCAACGCCGTTGCCTCAGAGACAGTG 1133 
I I I I I I I I I I I II I I I I I I I 

Db 62 0 TTCCATGTCC CGTTATTGCTTGCCC 

CTGTTGCTCC 654 

Qy 1134 

GCCACGGGCGAGGGGCTGCGGCAGACCATCATCGGGCAGCCCATGTCCGTCACCATCACC 1193 

I I I I II II I I I I I I I I I I I I I I I III 

I I 

Db 655 -CCAC TTTGCCTTGAC — TCCAGCTTT-TGCATCCCTTTTCC — 

CACTGCTAC- 701 

Qy 1194 

ACCAAGGACAAAGACGGTGAGCTGTGCAAAACCGGCAACGCCTACCTCACCGCCGAACTG 12 53 
I I I I I I I I I I I I I I I I I I I 

Db 702 -CCCAGGA TGTGCA 

CCTCCTTAATAG 726 

Qy 1254 

AGCACCCCCGACGGGAGCGTGGCAGACGGGGAGATCCTGGACAACAAGAACGGCACCTAT 1313 
I I I I I I I I I I I I II I I I I III 

Db 72 7 CGCCCGAC CGGACAAGAGGAAGCTGGG 

AAG 756 

Qy 1314 

GAGTTTTTGTACACTGTCCAGAAGGAAGGGGACTTTACCCTGTCTCTGAGACTCTATGAC 13 73 

III I I I I I I I I I I I I I I I I 

I I 

Db 75 7 CCTGGGTGATGTGGAC — TCCACT ACGCTA — 



Qy 1374 

CAGCACATCCGAGGCAGCCCGTTTAAGCTGAAAGTGATCCGATCCGCTGATGTGTCTCCC 1433 
I II I I I I I I I I I I I I I I I I 



Db 78 7 C CCAGAGCTGC TGCTG 

C AG AT C AG AT G 813 

Qy 1434 

ACCACAGAAGGCGTGAAGAGGCGCGTTAAGTCCCCGGGGAGCGGCCACGTCAAGCAGAAA 1493 

I I I I I I I I III 

Db 814 CCTAGTGTCAGGCCT 

CAGTTC 83 4 

Qy 1494 

GCTGTGAAAAGACCCGCAAGCATGTACAGCACTGGAAAACGAAAAGAGAATCCCATCGAA 1553 
I I I I I II I I I I I III III I I I I 

Db 835 TCTGTGTGAA CATTTA 

GGAGTACGGGAGGAGT 866 

Qy 1554 

G AC GATTTGATCTTTC GAG T GGG T AC C AAAGGAAGAAAT AAAGGAGAGT T T AC AAAT C T T 1613 

I I I I I I I I I I I I I I I I I I I 

I 

Db 867 GTTTTGCT GTAGGTTCCTTAAG 

TCAGGTCAT 89 7 

Qy 1614 CAGGGGGTAGCTGC — 

ATCTACAAATGGAAAGATATTAATTGCAGACAGTAACAACCAAT 16 71 

I I I I I I I I I I I I I I I III I I I I I I I 

Db 898 C GCTGCGGATC TGGCCAATTA TGCC- 

CCTGCAAAGAACA — 935 

Qy 1672 

GTGTGCAGATATTTTCCAATGATGGCCAGTTCAAAAGTCGTTTTGGCATACGGGGACGCT 1731 

I I I I I I II I I I I I II I I I I I I I I I I I I I 

Db 936 GGAAGAAGA CTGCTGCAGG-CAGGGCATCAGTGGTTTT 

TGTGGACAGA 9 82 

Qy 1732 

CTCCGGGGCAGCTGCAGCGGCCCACAGGAGTGGCTGTACATCCCAGTGGGGACATAATCA 1791 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 9 83 ACCCTGGAT CTCACAGGAGCAGTTGGACATC ATGGAGAC 

AACT 1025 

Qy 1792 TTGCCGATTATGATAATAAATGGGTCAGCATTTTCTC — 

CTCCGATGGGAAATTTAAGAC 18 49 

II II II I I I I I I I I I I I I I I I I I I I II I 

I I 

Db 1026 TAGTAGA GAAGATCATT TCAGCACTTCCCCAGCTCCCA— GGCCAC 

AC 1071 

Qy 1850 AAAAATTG-GATCAGGAAAGC-TGATGGGACCCAAAGGAGT 

TTCTGTGGACCGCAAT 19 0 4 

III I I I I I II I I I I I I I III III I I I I I I I 
Db 10 72 AAATGATGTGAT — 

GGTTAACATGATAGCGCTCACTGCACTCCATACTGAGGA 112 2 

Qy 19 05 GGGCACATTAT — 

TGTTGTGGACAACAAGGCGTGCTGCGTGTTTATCTTCCAGCCAAACG 1962 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 1123 -GGAAAATTATAATGTGGTTG — CACCAGGC TGTCTTTCA- 

CAATTCA 1166 



Qy 1963 

GGAAAATAGTCACCAGGTTTGGTAGCCGAGGAAATGGGGACAGGCAGTTTGCAGGTCCCC 2 02 2 

I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 116 7 G TGACACCA CAGCCAAAGCCCTATGGG-AAGC — 

T T T AC T G AAC AC T 1210 

Qy 2023 

AT T T T GC AGC T G T AAAT AGC AAT AATGAGAT TAT TAT T AC AGAT T T C C AT AAT C AT T C T G 2 0 82 

I III II I I I I I I I I I I I I I I I 

Db 1211 A AGCACAAAGAGGCAGTGATG 

G AAG T T C GG AG AC AT C TAG 1250 

Qy 2083 

TCAAGGTGTTTAATCAGGAAGGAGAATTCATGTTGAAGTTTGGCTCAAATGGAGAAGGAA 2142 

I III I I I I I I I I I III I I I I I I I I 

Db 1251 TGGAAGCG GCAAGCA-GAGAAAACCTG 

C C AAT C AAG AT GAG T 1291 

Qy 2143 

ATGGGCAGTTTAATGCTCCAACAGGTGTAGCAGTGGATTCAAATGGAAACATCATTGTGG 2 2 02 

I I I I I I I I I I I I I III I I I I I III 

I 

Db 1292 ATGGGGAGAG TCACACCGGGACAGC TCATGT 

CCTATATTCAGC 133 4 

Qy 22 03 CCGACTGGGGAAACAGCAGGATCCAGGTTTTTGATGGGAGTGGATCATTTTTG 

TC 2257 

I I I I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 1335 TCTTCAAG AACAAC CTCAAAGCTCT 

AATGAATCATTGTGGCCTCCTC 13 81 

Qy 2258 CT AC ATT AAC— 

ACATCTGCTGACCCACTCTATGGCCCCCAAGGCCTGGCCCTAACTTCA 2315 

I II II I I I I I I II I I I I I I I I I I II 

I 

Db 13 82 CAGCTTGGACTGGCCACAGCT — C AAAC G T T G AAAC AC C C AC AG AC T G — CC 

AA 1431 

Qy 2316 

GATGGTCATGTTGTGGTTGCAGACTCTGGAAATCACTGTTTCAAAGTCTATCGATACTTA 23 75 

I I I I I II I I I I I I I I I I I I I I I II I I 

I 

Db 1432 GTGGGACAACTT TCTGG CT-TTTGAAAGGC — 

TCCTTCTTCA 1470 

Qy 23 76 CAGTAATGGTGGG- 

CAGGTGGATACCCGCTTCCATGGTCTTGCACTATAAACTGGAATGG 2 43 4 

I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1471 GAGCATTGGGGAGTCAG CAATGTCCGTTGTGTT 

AAATCAGCTG 1513 

Qy 2 435 ATTTCTCAATGCGGGACCA- 

G AT TAT G AC TAG AG T T T T T AT GCC AGAAGGAATC AT TGGT 2 493 

II I I I I I I I I I I I I II I I I I I I I I I I I 

Db 1514 CT GCCCATGATTAAGCCTG TAACCCAGA — 

G AAC C AAC GAG 1552 



Qy 2 49 4 GA — ACTTTCCAAGGTTATTTCTGAATGTAACAATTTCCT — 

TAAAAATGACTTATCCAA 2 5 49 

I I I I II I I I I I I I I I I I 

Db 1553 GACTACAGCCCTGAGGAACTGCTGA 

TCCTTCTCATATATATTTATTCTG 16 01 



Qy 2550 

TTTCTGTATTTCACCTTTAGGGTTAAAAAAAACTCTTCTACTGAATCTATAAAAACTGCA 2 6 09 

I I I I I II I I I I I I I I I I I I I I I I I I 

I 

Db 16 02 TCACTGGAGAGCTC ACGGTAGACAAAGAC CTGTGT 

GAAGCAGAA 16 45 



Qy 2610 

GTTTTACATCTGTGAACTATGGCTTAAGGGACAGGATTTATGTAGCTAAACTAATTTTGC 2 669 
I I I I I I I I I I I I I I I I I I I III 

Db 16 46 GAAAAAGTCAAGAAAGCATTGGCT 

CAGGTCTTCTGTGAGGAATCTGGATT 1695 



Qy 2670 

AAATCAAACAGACACTTAAAAAAACTAGCATATGTAAAGGTATTCGTTAATCCTGTGAAT 2 72 9 

I I I I I III I I I I I III I I I I I 

I 

Db 1696 GTCACCT TTGC TGCAAAAAATTACG 

GACTGGGACT 173 0 



Qy 2730 

GGTAGCTTTTGCACAGAACTTCCAAAAGCAAAACAAAAACAAAATCTATTGTAGTTATAT 2 7 89 

I I II III III I I I I I I I I I I I I I I I I 

I 

Db 1731 CTT — CAATTAATCTGACATTTC AC AAAT C C AAAAT - 

TGCCGTGGATGAAC 1778 

Qy 2 79 0 ACTTCATTTAACCTAGGTCACAAGACCCAGGGAATCTTCT- 

AACCTCACTTTTACAGTAG 2 8 48 

I I I I I I I I I I I I I I I I I I I I 

I 

Db 1779 TCTTTACTTCACTT 

CGGGATATTGCTGGAGCTC G 1812 

Qy 2849 

GTATTACTCTTGTGACATTTTTTTGGTTATCAACAACTAAATATAAATTACTTTGGAAAA 2 9 0 8 
I I I I I I I I I I I I I III I I I I III 

I I I I I I 

Db 1813 GAGT — CTCCTGAAACAGTTT AAGTCTGTATATGTTCC — 

TGGAAA- 1854 

Qy 29 09 AGTAAGGCTGTCTTGCAAAATGATCCCAGC- 

TCTGATTAGCAGCCCTCTGGAGTTCAGAA 2967 

I I I I I II I I I I I I I I I II II I 

I 

Db 1855 TCATACCCA CCAGGCATCTTATAAG CCATTG 

TTGAAG 1891 

Qy 296 8 CTTAAGTATCAGTGCAAATTTCTCAACCTTTCTGGGTTAGACAAAGATCC- 

TTTTTTGTG 3026 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I 

I I 

Db 1892 C — AAG TTGTGGAG G AAAT AT T T C AT C C C 



GAGAGGCCAGATTCCGTTGATATTG 19 43 



Qy 3027 TGTTCTTTTCACCA — 

CCCCTTTGGCTCACCTTGTATCAGCAAACAAAGTACTTCTTCAG 3 0 8 4 

I I I I II III I I I I I I I I I I I I I I I I I I 

I 

Db 19 44 AACACATGTCTTCAGGCCTCACT-GATCTCCTTAAAACTGGATTTAGCATG- 

TTCATGAA 2001 

Qy 3085 

GGAAACCTGAAATTTCTAATGCCTTGAAAAGCATATTACAAAAGTAATGCTACCTTTTGG 314 4 

I I I I I I I I I I I I I 

I I 

Db 2 0 02 GGTGAGCCG GCCT 

CATCCTAGTG- 2 02 4 

Qy 3145 

GAAACAAACTGCCCCGTTAACTCCAGATCATTGCACTGGAATGTAATCAAGAAAGTTAGT 32 0 4 
I I I I I I I I I I I I I I I II 

Db 2 025 ACTACCCC CTCCTGATC 

CTC 2044 

Qy 3205 

CATGTTTTATGTACCATGTTTTCACACGTGTCTCTTCTCTTCGACTTCCTGAAAGCGAAA 32 6 4 

I I I I I II I I I I I I I I I I I II I I I 

I I I 

Db 2 0 45 TTTGTGGTAGGTGGGGTCACA GTCTCT 

GAAGTGAAAATGGTCAAA 2 0 89 

Qy 3265 

GCTTTACCTCCTGCAAATGTCAGCACATGTAGTAGGACACCAGTATCCTAGGACAGAGAG 332 4 

II I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 2 09 0 GAT CTTGTGGCATC GTTGAAG 

CCAGGAACCCAGGTAATCGTG 2131 

Qy 3325 

CCATAAGTAGCCCTTTGGAGGACTGATGGTGTCAACCAAAGGCATGTGATTGATTAATGA 33 8 4 

I I I I I I I I 

I I 

Db 2132 C TGTCCAC 

ACGA 2143 

Qy 3385 

TTCCCCCTTAGAAAGCAAGTGTTACCAAAGTTGTGTTATCTTGAAAGCATTACAGGTAAG 3 4 4 4 

III II III III I I I I I I 

I 

Db 2144 CTCC TGAAGCCA CTT AACATTCC 

TG 2168 

Qy 3445 

GGCATGTTATGGTTATTTATCATTGTTTAATGAATAGTAGAGGTGTCAAGGGACTATGTA 35 0 4 

II I I I I I I I III I I I I I I I I I I I I I 

Db 2169 AGCT GTTATTTGCAACTG ACCGACT GCATCCA — 

GACCTTGGC 22 09 

Qy 3505 

TACATGATTAGGGTAAGATAGAATGTATTATATATATATATATATATACACACACACATA 356 4 

II I I I I I I I I I I I I I I I I I I I 
Db 2210 TTC TGAGCATCCGCTAAGAAG ATA- 



AG AC C T AC T C A 22 44 



Qy 3565 

TATATAGCTGAATCTTTGGTGTATTGAAATAGGCAGCACTCTGAAAGACAGAAGCTTCGT 362 4 

I I I I I I I I I I I I I I 

I 

Db 2245 AGCTG 

GAAATGCCGATGCAATTT 226 7 

Qy 3625 

CCAGCCACTCTTCAGCACATTCCTTTACTAAGCAGTTTAAAGCCGTCCTAGTGGAGCAAG 36 8 4 

I I I I I I II I I I I I I I I I I I I I I I I I I II 

Db 226 8 TCTGCCACCACTCCAAATACTCCTCCAC-AACCAG 

CGTCCCTGT 2310 

Qy 36 85 CCCTAA-AGC-AGATTTAATTTTTGCCATTTTCCAAGAATGACGGTGGTGGCTT- 

TTAGT 3 741 

I I I I I I I I I I I I II I I I I I I I I I III II I 

I I I 

Db 2311 C AC T AAT T GC G AG AAT G A 

TGGAATTCTGCCTGAAGGGTCTTGATACCTACTCAGT 2365 

Qy 3 7 42 CAGAAA 

ATGGCCTTCTGTGCTTTCAAAAAAAAAACAAAAAAAAAACCACACACA 3 795 

II I III I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2366 

GAGGTACTTTGCTTGGATTGCTGTGATTCTTAAAAAAAAAAAAAAAAAAGGCCACATGTG 2 42 5 

Qy 3 796 CACATAAAAAACCCAACAG 3814 

II I I I I 

Db 2 426 CTCGAG CTGCAG 2437 
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Qy 17 

CTGCGGGGAGCTAAGTCCCCAGATTGGAGGAGGCTGGCTCTGGTCTTCGATGCACAGGAG 76 

I I I I I I I I II I I I I I I I I I I I I 

Db 2 43 7 CTGC AGCTCGAGCAC ATGTGGC 

CTTTTTTTTTTTTTTTT 2399 

Qy 77 

TGGCCGTTATGGAACGCAGCAGCAGCGTGCAGGGTCAAAGACAGCCGGCCCCCCATGTCA 136 

I I I I I I I I I I I I III II II III 

Db 239 8 TTAAGAATCACAGCAATC-CAAGCAAAGT AC 

C T C AC T GAG TAG 235 7 

Qy 13 7 GTGGTCTAGGATGGCCAGTGAAGGCA CCAACATCC CAAGTCCTG — 

TGGTGC 186 

II II I I I I I I I I I I I I I I I I I I I I I II 

I 

Db 2356 GT-ATC AAGACCC- 

TTCAGGCAGAATTCCATCATTCTCGCAATTAGTGACAGGGAC 23 03 

Qy 187 GCCAGATTGACAAGCAGTTTCTGAT TTGCAGTATATGCCTGGAACGG 

TACAAG 239 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 23 02 G-CTGGTTGTGGAGGAGTATTTGGAGTGGTGGCAGAAAAT 

TGCATCGGCATTTCCAG 22 47 

Qy 2 40 AATCCCAAGGTTCTCCCCTGTCTGCACACTTTCTGCGAGAGGTGC- 

CTGCAGAACTACAT 29 8 

I I I I I I I I I I I I III I I I I I I I I 

Db 22 46 CTTGAGTAGGT CTTATCTTC TTAGCG 

GATGCTCAGAAG 22 09 

Qy 299 TCCTGCCCACAGTTTAACCCTCTCCTGCCCAGTGTGCCGCCAGAC 

CTCCATCCTG 353 

I I I I I I I I I I I I I I I 

I 

Db 22 0 8 CCAAGGT 

CTGGATGCAGTCGGTCAGTTGCAAATAACAG 2171 

Qy 354 CCCGAGAAAG — GGGTGGCCGCGCTCCAGAACAATTTCTTCATCACAA — 

ACCTGATGGA 409 

II I I I I I I I I I II II II I I I I I I I I I I I 

I I 

Db 2170 CTCAGGAATGTTAAGTGGC TTCAGGAGTCGTGTGGACAGCACGATTACCTG 

GGT 2117 

Qy 410 CGTGCTGCAGCGAACTCCAGGCAGCAACGCTG AGGAGTCTT 

CCATCCTGGAGA 462 

II I I I I I I I I I I I I I I I I I I I I I I 

Db 2116 TCCTGGCTTCAACGATGCCACAAGA- 

TCTTTGACCAT 2081 

Qy 463 

CAGTCACTGCTGTGGCTGCGGGAAAGCCTCTCTCTTGCCCAAACCACGATGGGAATGTGA 52 2 
I I I I I I I I I I I I I I I I I I I 

Db 2080 -TTTCACTTCAGAGACTG TGACC 

CCAC 2 055 



Qy 523 

TGGAATTTTACTGCCAGTCCTGTGAGACTGCCATGTGTCGGGAGTGCACGGAGGGGGAGC 5 82 

I I I I I I I I I I I I I I I I I I I I 

Db 2 054 CTACCA CAAAGAGGAT CA- 

GGAGGGGGT — 2 02 8 

Qy 583 

ACGCAGAGCACCCCACAGTTCCACTCAAGGATGTGGTGGAACAGCACAAGGCCTCGCTCC 6 42 

III I I I I I I I I I I I I I I I III 

I I 

Db 2 02 7 AGT CACTAGGATGAGG CCGGCTCA CCT 

TCA 1998 

Qy 643 

AGGTCCAGCTGGATGCTGTCAACAAAAGGCTCCCAGAAATAGATTCTGCTCTTCAGTTCA 7 02 

I I I I I I I I I I III I I I I I 

Db 199 7 TGAACATGCTAAATCCAGTTTTAAGGAG 

ATCAGTGAG 1961 

Qy 7 03 TCTCTGAAATCAT-TCATCAGTTAACCAACCAAA — 

AGGCCAGCATCGTGGATGACATTC 759 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 

I 

Db 196 0 GC-CTGAAGACATGTGTTCAAT — ATCAACGGAATCTGGCCT — CTCG-GGATGA- 



Qy 76 0 ATTCCACCTTTGATGAGCTCCA — 

GAAGACTTTAAATGTGCGCAAGAGTGTGCTGCTTAT 817 

I I I I I I I I I I I I I I I I I II III 

I 

Db 19 0 7 TTTCCTCCACAACT-TGCTTCAACAATGGCTTATAAGATGC- 

CTGGT 1863 



Qy 818 

GGAATTGGAGGTCAACTATGGCCTCAAACACAAAGTCCTCCAGTCGCAGCTGGATACTCT 8 7 7 
II II I I I I I I I I I I I I I I 

I I 

Db 1862 GGGTATG ATTTCCAGGAACATA TACAGAC 

TTAAACT 182 7 

Qy 8 78 GCTCCAGGGGCAGGAGAGCATTAAGAGCTGCAGCAACTTCAC— AGCGCAG- 

GCCCT 931 

I I I I I I I II I I I I I I I I I I I III I I I I I 

I 

Db 1826 GTTTCAGGAG 

ACTCCGAGCTCCAGCAATATCCCGAAGTGAAGTAAAGAGTT 17 76 

Qy 932 

CAACCATGGCACGGAGACCGAGGTCCTACTGGTGAAGAAGCAGATGAGCGAGAAGCTGAA 991 
I I I I I I I I I I II I I I I I I I I I I I I 

I I I I 

Db 17 75 CATCCACGGCA ATTTTGG ATTTGTGAAATGTCAGAT 

TAATTGAA 1732 

Qy 992 

CGAGCTGGCCGACCAGGACTTCCCCTTGCACCCGCGGGAGAACGACCAGCTGGATTTCAT 1051 
I I I I I I I I III I I I I I I I I I I I I I I 

I 

Db 1731 -GAG-TCCCAGTCCGTAATTT TTTGCA-GCAAAGGTGACAATCCAG 

ATTCCT 1683 



Qy 1052 

CGTGGAAACCGAGGGGCTGAAGAAGTCCATCCACAACCTCGGGACGATCTTAACCACCAA 1111 

I I I I I I I I I I III II I I I I I I 

Db 16 82 C ACAGAAGACCTGAG CCAATGCTT 

TCTTGACTT 1650 

Qy 1112 

CGCCGTTGCCTCAGAGACAGTGGCCACGGGCGAGGGGCTGCGGCAGACCATCATCGGGCA 1171 

I I I I I I II I I I I I III 

Db 1649 TTTCTTCTGCTTCA CACAGG 

TCTT T 1625 

Qy 1172 GCCCATGTCCGTCACCATCACCACCAAGGACAAAGACGGTG- 

AGCTGTGCAAAACCGGCA 1230 

III I I I I I I I I I I I I I I I I I I I I I II 

I I I 

Db 162 4 GTCTA CCGTGAGCT 

C T C C AG T G AC AG AAT AAAT AT AT AT G AGAAGGATC AGC A 15 72 

Qy 1231 

ACGCCTACCTCACCGCCGAACTGAGCACCCCCGACGGGAGCGTGGCAGACGGGGAGATCC 12 9 0 

I I I I I I I I I I I I I I I II III I I I I 

I 

Db 1571 G TTCCTCAGGGCTGTAGT 

CCTCGTTGGTTCTCTGGGTTACAGG C 1528 

Qy 1291 

T G G AC AAC AAG AAC G G C AC C T AT GAG T T T T T G T AC AC T G T C C AG AAGG AAG GGGACTTTA 1350 

I I I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 152 7 T — TAATCATGGGCAGCAGCT GATTT — AACAC 

AACGGACATTG 148 8 

Qy 1351 

CCCTGTCTCTGAGACTCTATGACCAGCACATCCGAGGCAGCCCGTTTAAGCTGAAAGTGA 1410 

I I I I I I I I I I I III I I I I I I I I I I I I I I I 

Db 148 7 — CTGACTC CCCAATGCTCTGAA GAAGGAGCC — TTTCA 

AAAG 1449 

Qy 1411 

TCCGATCCGCTGATGTGTCTCCCACCACAGAAGGCGTGAAGAGGCGCGTTAAGTCCCCGG 14 7 0 

II III I I I I I I III I I I I 

I 

Db 1448 -CCAGAAAGTTG TCCCAC TTGGC 

AGTCTGT-G 1419 

Qy 1471 

GGAGCGGCCACGTCAAGCAGAAAGCTGTGAAAAGACCCGCAAGCATGTACAGCACTGGAA 153 0 
III I I I I I I I I I I I I III I I I I I I 

I I I 

Db 1418 GGTGTTTCAACGTT TGAGCTGTGGCCAGTC CAAGCTGG 

AGGAG 13 76 



Qy 1531 

AACGAAAAGAGAATCCCATCGAAGACGATTTGATCTTTCGAGTGGGTACCAAAGGAAGAA 159 0 
I I I I I II I I I I I I I I I 

Db 13 75 GCCACAATGA TTC 

ATTAGAGCTTT 1352 



Qy 1591 

ATAAAGGAGAGTTTACAAATCTTCAGGGGGTAGCTGCATCTACAAATGGAAAGATATTAA 165 0 

I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 1351 GAGGTT GTTCTTGAAG AGCTGAATAT 

AGGACATGA- 1317 

Qy 1651 

TTGCAGACAGTAACAACCAATGTGTGCAGATATTTTCCAATGATGGCCAGTTCAAAAGTC 1710 

I I I I I I I I I I I I I I I I I I I I 

I I 

Db 1316 — GCTGTC CCGGTGTG ACTCTCC CCA-TAC 

TC 1288 

Qy 1711 

GTTTTGGCATACGGGGACGCTCTCCGGGGCAGCTGCAGCGGCCCACAGGAGTGGCTGTAC 17 7 0 

I I I I III I I I I I I I I I I II II II I 

I I 

Db 12 8 7 ATCTTGATTGGCAGGTTTTCTCTGC 

TTGCCGCTTCCACTAGATGTCTCCGAAC 1235 

Qy 1771 

ATCCCAGTGGGGACATAATCATTGCCGATTATGATAATAAATGGGTCAGCATTTTCTCCT 183 0 

III I I I I I I I I II III 

I I 

Db 123 4 TTCC ATCACTGCC TC 

TTTGTGCT 1212 



Qy 1831 

CCGATGGGAAATTTAAGACAAAAATTGGATCAGGAAAGCTGATGGGACCCAAAGGAGTTT 189 0 

I I I I I I I I I I I I I I I I I I I 

I I I 

Db 1211 TAGT GTTCAGTAAAGCT 

TCCCATAGGGCTTT 1181 

Qy 1891 — CTGTGGACCGCAATGGGCACATTATTGTTGTGGACAACAAGGCG- 

TGCTGCGTGTTTA 1947 

I I I I I I I I II I I I I I I I I I I III I II 

I 

Db 1180 GGCTGTGGT 

G T C AC T G AAT T G T G AAAG AC AG C C T G G T G C AAC C AC AT T AT AA 112 9 
Qy 1948 

TCTTCCAGCCAAACGGGAAAATAGTCACCAGGTTTGGTAGCCGAGGAAATGGGGACAGGC 2 0 0 7 

I I I I I I I I I I I I I I I 

I I 

Db 112 8 TTTTCC TCCTCAG TATGGAGT- 

GC 1106 

Qy 2008 

AGTTTGCAGGTCCCCATTTTGCAGCTGTAAATAGCAATAATGAGATTATTATTACAGATT 2 06 7 
I I I I I I I I I I I 

I 

Db 1105 AGTGAGC GCTATCAT 

GTTA 1087 

Qy 2068 TCCAT — 

AATCATTCTGTCAAGGTGTTTAATCAGGAAGGAGAATTCATGTTGAAGTTTGG 212 5 

I I I I I I I I I I I I I I I I I I I II I I I I I 



Db 10 86 ACCATCACATCATT- 
CTGGGGAAGT G 1047 



-TGTGTGGCCTGGGAG- 



Qy 2126 

CTCAAATGGAGAAGGAAATGGGCAGTTTAATGCTCCAACAGGTGTAGCAGTGGATTCAAA 2185 

II I I I I I I I I I I I I I I I I I 

Db 10 46 CT GAAATGATC TTCTCTACTA 

AGTTGTCTC 1017 

Qy 2186 

TGGAAACATCATTGTGGCCGACTGGGGAAACAGCAGGATCCAGGTTTTTGATGGGAGTGG 2 2 45 
III I I I I I I I I II I I I I I I I 

I I 

Db 1016 CATGATGTCCAACT GCTCCTGTGAGATCCAG 



Qy 2246 

ATCATTTTTGTCCTACATTAACACATCTGCTGACCCACTCTATGGCCCCCAAGGCCTGGC 23 05 
II I I I I I I I I I I I I I I I I 

I I 

Db 9 83 TTC TGTCC-ACA — AAAACCACTGAT 

GC 959 

Qy 2306 

CCTAACTTCAGATGGTCATGTTGTGGTTGCAGACTCTGGAAATCACTGTTTCAAAGTCTA 2365 
I I I I I I I I I I I I I 

Db 958 CCTGCCTGCA 

GCAGTCTT 941 

Qy 2366 

TCGATACTTACAGTAATGGTGGGCAGGTGGATACCCGCTTCCATGGTCTTGCACTATAAA 2 425 

I I I I I II I I I I 

Db 940 CTTCC- 

TGTTCTT 929 

Qy 2426 

CTGGAATGGATTTCTCAATGCGGGACCAGATTATGACTAGAGTTTTTATGCCAGAAGGAA 2 4 85 

I I I I I I I I I I I I I I I 

Db 92 8 TGCAGGGGCA 

TAATTGGCCAGA 907 

Qy 2486 

TCATTGGTGAACTTTCCAAGGTTATTTCTGAATGTAACAATTTCCTTAAAAATGACTTAT 2 5 45 
III I I I I I I I 

I I I I I I I 

Db 9 06 TCC GCAGCGATGACC 

TGACTTA- 8 85 



Qy 2546 

CCAATTTCTGTATTTCACCTTTAGGGTTAAAAAAAACTCTTCTACTGAATCTATAAAAAC 2 6 05 
II I I I I II I I I I I I I I I I I I I I 

I 

Db 884 AGGA AC CT AC AG CAAAACACTCCTCCCGTA-CT- 

CC 851 

Qy 2606 

T GC AG T T T T AC AT C T G T G AAC T ATGGC T T AAGGGAC AGGAT T T ATGT AGC T AAAC T AAT T 2 665 

I I I I I I I I I I I I I I III 
Db 850 TAAATGTTCACA- 

CAGAGAACTGAGGC 825 



Qy 2666 

TTGCAAATCAAACAGACACTTAAAAAAACTAGCATATGTAAAGGTATTCGTTAATCCTGT 2 72 5 

I I I I I I I I I I I I I III 

Db 82 4 CTGACACT AGGCATCTG 

ATCT 804 

Qy 2726 

GAATGGTAGCTTTTGCACAGAACTTCCAAAAGCAAAACAAAAACAAAATCTATTGTAGTT 2 7 85 
I I I I I I I I II III 

I I I I I 

Db 8 03 GCAGCAGCAGCTCTGGGGTT AGC 

GTAGTG 7 75 

Qy 2 786 ATATACTTCATTTAACCTAGGTCACAAGACCCAGGGAATCTTCTAACCT — 

CACTTTTAC 2 8 43 

II I I I I I I I I I I I I I I I III I I I I I 

I 

Db 774 G AG T C C AC AT C AC C C AG G 

CTTCCCAGCTTCCTCTTGTCC 736 

Qy 2844 

AGTAGGTATTACTCTTGTGACATTTTTTTGGTTATCAACAACTAAATATAAATTACTTTG 2 9 03 

I I I I III III III I I I I I II 

I I 

Db 735 GGTCGG GCGCT ATTAAGGAGGTGCACATC— CTG 

GGGTAGCAGTG 693 

Qy 2904 

GAAAAAGTAAGGCTGTCTTGCAAAATGATCCCAGCTCTGATTAGCAGCCCTCTGGAGTTC 2963 

I I I I I I II I I I I I I I I I I III II I I I I 

Db 692 GGAAAAG GG ATGCAAA AGCT- 

GGAGTCAAGGCAAAGTGGGG 653 

Qy 296 4 AG AAC T T AAG TAT C AG T GC AAAT T T C T C AAC CTTTCTGGGT T AG AC AAAG A — 

TCCTTTT 3021 

I I I I I I I I I I I I II I I I I II III 

Db 652 AGCAACAGGG GCAAGCAATAACGG 

GACATGGAACACCTCGG 612 

Qy 3022 TTGTGT- 

GTTCTTTTCACCACCCCTTTGGCTCACCTTGTATCAGCAAACAAAGTACTTCT 3 0 8 0 

I I I I I I I I I I II III I I I I I I I I III 

I I 

Db 611 CCGTGTAGTTCATGTTGCCCATCCAT TCAC ACAGC 

TTCTCCT 570 

Qy 3081 

TCAGGGAAACCTGAAATTTCTAATGCCTTGAAAAGCATATTACAAAAGTAATGCTACCTT 314 0 

III III I I I I I I II I I I I I I 

Db 569 CCAG CTGC TCGAACACC GGC 

TGCTGCCCC 541 

Qy 3141 

TTGGGAAACAAACTGCCCCGTTAACTCCAGATCATTGCACTGGAATGTAATCAAGAAAGT 32 0 0 

I I I I I I I I I I I I I I I I I I I I I 

I 

Db 540 T CCATCTCGGCCGCTGCCGC CGCTGGGA 

CATGA T 50 7 



Qy 32 01 TAGTCATGTTTTATGTACCATGTTTTCACACG — 

TGTCTCTTCTCTTCGACTTCCTGAAA 3258 

III III I I I I II I I I I I III I II I II 

I I 

Db 506 TAG — CTGTGAGGTGGACAGCG-TGGCTCACGGTTGTGAC — 

CACCACACAATACTGGAA 452 

Qy 3259 

GCGAAAGCTTTACCTCCTGCAAATGTCAGCACATGTAGTAGGACACCAGTATCCTAGGAC 3318 

III I I I I I I I I I II I I I I I I I I I I 

I 

Db 451 GTGA CTGCGGCAGATGATGTCCC GTAGGA 

TCTCCACGGTC 412 

Qy 3319 

AGAGAGCCATAAGTAGCCCTTTGGAGGACTGATGGTGTCAACCAAAGGCATGTGATTGAT 33 7 8 

I I I I I I I I I I I I I I I I I I I I I 

I 

Db 411 CG GCCTTTCAGCAGGCAGCTCAGCACAAACACTGCCTTG- 

GGCTGCT 366 

Qy 3379 

TAATGATTCCCCCTTAGAAAGCAAGTGTTACCAAAGTTGTGTTATCTTGAAAGCATTACA 3 43 8 
II I I I I I I I I I I I I I I I 

Db 365 TGGCTCCACC ACCAA — 

TTGCGT 345 

Qy 3439 

GGTAAGGGCATGTTATGGTTATTTATCATTGTTTAATGAATAGTAGAGGTGTCAAGGGAC 3 49 8 

I I I I I I I I I I I I I I I I 



Qy 3499 

TATGTATACATGATTAGGGTAAGATAGAATGTATTATATATATATATATATATACACACA 355 8 
I I I I 

I 

Db 316 AGGG 

CCC 310 

Qy 3559 

C AC AT AT AT AT AGC T G AAT CTTTGGTG T AT TGAAAT AGGC AGC AC T C T GAAAG AC AG AAG 3618 
III I I I I I I I 

I 

Db 3 09 CCCA CCGC- 

CTCCAGGAGACGGGTG 2 86 

Qy 3619 CTTCGTC-CAGCCACTCTTCA- 

GCACATTCCTTTACTAAGCAGTTTAAAGCCGTCCTAGT 3676 

I I I I I I I I I I I I I I I I I I I I II 

I I 

Db 2 85 GATC— CGCAGCCCCAGTGCAGGCTC TCGGCGCAG 

GCGGCGTCCAGGT 2 40 

Qy 36 77 GGAGCA- 

AGCCCTAAAGCAGATTTAATTTTTGCCATTTTCCAAGAATGACGGTGGTGGCT 3 735 
I I I I I I I I I I I I I I I I I I I 

I I 

Db 239 AAACCACAGCCC GTTTCACTTTGGCCA 



GCA 210 



Qy 3736 

TTTAGTCAGAAAATGGCCTTCTGTGCTTTCAAAAAAAAAACAAAAAAAAAACCACACACA 3 795 

III I I I I I I I I I I I I I I I I I 

I 

Db 2 09 CCTGCTC CCATCCTTGCT GGGTAAAGGAC 

AGTACGCCCGAGGCG 166 

Qy 3 796 CACATAAAAAACCCAACAGGTC — 

AAAAT AAAAG T T G AAC T T G AGT T AC AT T T AAT T T AA 3853 

I I I I III II II I I I I I I I I 

Db 165 CTCAT GGTTGGGGATTCGCAG 

ACTTGGGAAAC 13 4 

Qy 3854 

ATATAAATGCATTTTGAGAAATGTTAAGAACAATTTAGTCAATCGTTCATCTGTCATTGG 3913 
III I I I I I I I I I I I 

I 

Db 133 — TACGGTGC AGGAAC 

TTCTTTCAGAACTCACCGC 101 

Qy 3914 

TACTGTAAAATAAGCTGTGGTCTATTTCCACTGTTTAATTTTCTACTCAGTTCTACCAAA 39 73 

I I I I I I I III II III III I I I I I I I I I 

I 

Db 100 TTCCGGAAATTGGGCTCCGGGAGACTTTGACAG T C T C C AC AG T AC 

ACCCAG 50 

Qy 3974 

TAGGATGTCATGTTTGACATTTTTGATAGTGACTTTGGGGTCTTCTTCACTGAAAGCACC 4 033 
I I I I I I I I I I I I I I I I I I 

Db 49 TAGGCCAAC AGTGC — TCGAGGAATTC — CGC 

AGCT-T 17 

Qy 403 4 T TAG AAC T G T AC T A 40 47 

I I I I I I I I II 
Db 16 TTAGAGCAGAAGTA 3 
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Qy 326 

CCCAGTGTGCCGCCAGACCTCCATCCTGCCCGAGAAAGGGGTGGCCGCGCTCCAGAACAA 3 85 
I I I I III III I I I I I I I I I I I 

Db 1 CACTGTTGGCCTACTGGCTACTATCAGGATC 

GTGGC 36 



Qy 3 86 TTTCTTCATCACAAACCTGATGGACGTGCTGCAGCGAACTCCA — 

GGCAGCAACGCTG 441 

I I I I I I I I I I I I I I I I 

I I I 

Db 3 7 TATTTT GAAGAGCT 

GATCACCATGTTGGAAGCAGCACTG 75 



Qy 442 AG 

GAGTCTTCCATCCTGGAGACAGTCACTGCTGTGGCTGCGGGAAAGCCTCTCTCT 49 7 

I III I I I I I I I I I I I I I I I I I I I I I 

I 

Db 76 GGACTTGAGCGAGCTCACATGGGAATGTTTACTGAATTAGCTATTCTATA — 

CTCTAAAT 133 

Qy 498 

TGCCCAAACCACGATGGGAATGTGATGGAATTTTACTGCCAGTCCTGTGAGACTGCCATG 557 

I I I I I I III I I I I I I II I I I I I I I I I I 

I I 

Db 13 4 T TAAGCCTC AGAAAATGAGGGA GC ACCTG-GAG- 

CTGTTCTG 173 



Qy 558 

TGTCGGGAGTGCACGGAGGGGGAGCACGCAGAGCACCCCACAGTTCCACTCAAGGATGTG 617 
I I I I I I I I I I I I I I I II 

Db 174 - G T C TAG AG T G AAT ATTCCCAAGGT 

GCTAAGAGCTG — 208 



Qy 618 GTGGAACAGCACAAGGCCTCG- 

CTCCAGGTCCAGCTGGATGCTGTCAACAAAAGGCTCCC 676 

I I I I I I I I III II II I 
Db 2 09 CAGAACAAG — 

CTCATCTTTGGGCAGAACTGGTGTTTTTGTATGACAAGTATGA 2 6 0 



Qy 677 

AG AAAT AG AT TCTGCTCTTCAGTTCATCTCT G AAATC AT TC AT C AG T T AAC C AAC CAAAA 736 
I I I I III I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 261 AG AAT AT GAT AAT GC - CAT AAT T AC CAT GAT G - 

AAT CATC C AAC T G AT GC C T GG AAAG AA 318 



Qy 73 7 GGCCAGCATCGTGGATGACATTCATTCCACCTTTGATGAGCTCCA— GAAG- 

ACTTTAAA 793 

I I I I II III I I I I III III III I I 

Db 319 GGGCA-ATTCAAAGATATCATT ACCAAGGTTG 

CCAATGTGGAACTATACT 36 7 



Qy 794 

TGTGCGCAAGAGTGTGCTGCTTATGGAATTGGAGGTCAACTATGGCCTCAAACACAAAGT 853 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 36 8 AC AG AGC AAT AC AG T T C T AC T T A — GAAT TCAA 

GCCTCTGTTGTTAAAT 414 

Qy 854 

CCTCCAGTCGCAGCTGGATACTCTGCTCCAGGGGCAGGAGAGCATTAAGAGCTGCAGCAA 913 
I I I I I I I I I I I I I I I I I I I I II 

Db 415 GAT TTGCTGATGG-TGCTGT-CTCCACGG 

TTGGA 4 46 

Qy 914 CTTCACAGCGCAGGCCCTCAACCAT 

GGCACGGAGACCGAGGTCCTACTGGT G AAG AA 970 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 

Db 447 — TCACA- 

CTCGTGCAGTCAATTATTTCAGCAAGGTTAAACAGCTACCACTGGTGAAACC 5 03 

Qy 971 

GCAGATGAGCGAGAAGCTGAACGAGCTGGCCGACCAGGACTTCCCCTTGCACCCGCGGGA 103 0 

I I I I I I I I I III I I I I 

I 

Db 504 GTATTTG CGTTCAG TTC 

AGAACC A 52 7 

Qy 1031 

GAACGACCAGCTGGATTTCATCGTGGAAACCGAGGGGCTGAAGAAGTCCATCCACAACCT 109 0 

I I I I I I I I I I I I I I I III I I I I I 

I I 

Db 52 8 TAACAACAA ATC-TG 

TGAATGAATCATTGAACAATCT 563 

Qy 1091 

CGGGACGATCTTAACCACCAACGCCGTTGCCTCAGAGACAGTGGCCACGGGCGAGGGGCT 115 0 
I III 

I 

Db 564 

TTTTAT T 570 

Qy 1151 

GCGGCAGACCATCATCGGGCAGCCCATGTCCGTCACCATCACCACCAAGGACAAAGACGG 1210 

I I I I I I I I I I I I I III I I I I I I I I I I 

I 

Db 571 ACAGAAGAAGATTATCAGGC TCTGCGAACATCA 

ATAGATGC 611 

Qy 1211 TGAGCTGTG- 

CAAAACCGGCAACGCCTACCTCACCGCCGAACTGAGCACCCCCGACGGGA 12 69 

I I I I I I I I I I I I I I I I I III 

I 

Db 612 T TATGACAACTTTGACAA TATCTCGCT 

TGCTC A 644 

Qy 12 70 GCGTGGCAGACGGGGAGATCCTGG- 

AC AAC AAG AAC GGC AC C T AT GAG T T T T T G T AC AC T 132 8 

I I I I I I I I I I I I I I I I I I I I I I I 

Db 6 45 GCGT TTGGAAAAACATGAAC 

TCATTGAGTT 6 74 



Qy 132 9 GTCCAGAAGGAAGGGGACTTTAC — 

CCTGTCTCTGAGACTC T AT G AC C AGC AC AT C C GAG 1386 

I I I I I I I I I I I I I I I I II 

I I 

Db 6 75 CAGGAGAA TTGCTGCTTATCTCT 

TCAAAG 7 03 

Qy 1387 

GCAGCCCGTTTAAGCTGAAAGTGATCCGATCCGCTGATGTGTCTCCCACCACAGAAGGCG 14 46 

I I I I I I I I I I I I I I I 

Db 70 4 GCAAC AATCGCTG 

GAAACAGA G 72 5 

Qy 1447 

TGAAGAGGCGCGTTAAGTCCCCGGGGAGCGGCCACGTCAAGCAGAAAGCTGTGAAAAGAC 15 06 

I I I I I I I I I I I I I I II I I I I I I I 

Db 72 6 TGTAGAG C T G TGC AAGAAAGAC AGC C T T T AC A 

AGGATGCAATG 76 8 

Qy 1507 

CCGCAAGCATGTACAGCACTGGAAAACGAAAAGAGAATCCCATCGAAGACGATTTGATCT 1566 
I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 769 CAGTATGCTTCTGAATCTAAAGATACTGAATTG GCTGAAGA 

ACTCC 814 

Qy 1567 

TTCGAGTGGGTACCAAAGGAAGAAATAAAGGAGAGTTTACAAATCTTCAGGGGGTAGCTG 1626 

I I I I I I I I I I I I I I I I I I E I I I I III II I 

I I I 

Db 815 TGC-AGTGGTTTTTGCAGGAAGAAAAAAGAGAGTG 

CTTTGGAGCTTGTCTG 864 

Qy 1627 

CATCTACAAATGGAAAGATATTAATTGCAGACAGTAACAACCAATGTGTGCAGATATTTT 16 86 

I I I I I I I I I III 

I I I I 

Db 865 — TTTAC CTGTTAC 

GATCTTTT 884 

Qy 16 8 7 CCAATGATGGCCAGTTCAAAAGTCGTTTTGGCA — TACGGGGACGCTCTC 

CGGGGCA 1741 

I I I I I I I I I I I I I III II I I I I I I I II 

Db 8 85 AAGGCCAGAT 

GTCGTCCTAGAAACTGCATGGAGGCACAATATCATGGAT 933 

Qy 1742 

GCTGCAGCGGCCCACAGGAGTGGCTGTACATCCCAGTGGGGACATAATCATTGCCGATTA 18 01 

III I I I I I I I I I I I I II I I I I I I I 

Db 93 4 TTTGCCAT-GCCCTAT 

TTCATCCAGGTCAT G AAGG AG T AC T T G AC AAAGG 982 

Qy 1802 TG— 

ATAATAAATGGGTCAGCATTTTCTCCTCCGATGGGAAATTTAAGACAAAAATTGG 185 8 

II I I I I I I I I I I I I I II I I I I I I I I 

I 

Db 9 83 T G G AT AAAT T AG AT GC T T C AG AAT CAC 

T G AG AAAAG AAG AAG 102 4 



Qy 1859 

ATCAGGAAAGCTGATGGGACCCAAAGGAGTTTCTGTGGACCGCAATGGGCACATTATTGT 1918 

II I I I I I I I I I I I I I I I I I I I I I I III 

I I I 

Db 1025 AAC AAG C T AC AG AG AC AC AAC CCATTGTTTATGGTCAGC CCCA- 

GTTGA 1072 



Qy 1919 TGTGGACAACAAGGC 

GTGCTGCGTGTTTATCTTCCAGCCAAACGGGAAAATAGTCAC 19 75 

II I I I I I I I I I I I I I I I I I I II III 

I I I 

Db 10 73 TGCTGACAGCAGGACCCAGTGTTGC CGTCCCTCC — CCAGG- 

CAC 1114 



Qy 1976 

CAGGTTTGGTAGCCGAGGAAATGGGGACAGGCAGTTTGCAGGTCCCCATTTTGCAGCTGT 2 035 
I I I I I I I I I I I I I I I I I I I I 

Db 1115 CT — TTTGGT TATGG 

TTATACCGCACCACCGTATG 1147 



Qy 2036 

AAATAGCAATAATGAGATTATTATTACAGATTTCCATAATCATTCTGTCAAGGTGTTTAA 2 095 

I I I I III II I I I I 

Db 1148 GACAG CCA 

CAGCCTGGCTTTGGGT 1171 



Qy 2096 

TCAGGAAGGAGAATTCATGTTGAAGTTTGGCTCAAATGGAGAAGGAAATGGGCAGTTTAA 2155 

III I I I I I I I I I I I I I I 

Db 1172 ACAG CATGT GAGATG 

AAGCGC 1192 



Qy 2156 TGCTCCAACAGGTGTAG-CA- 

GTGGATTCAAATGGAAACATCATTGTGGCCGACTGGGGA 2 213 

I I I I I I I I I I I I I III I I I I I I 

Db 1193 TGATCC 

TGTAGTCACCTATTTTCGTACTGAAACATCGTCTTTACCCACT 12 41 



Qy 2214 

AACAGCAGGATCCAGGTTTTTGATGGGAGTGGATCATTTTTGTCCTACATTAACACATCT 2 2 73 

II I I I I I I I I I I I II III 

Db 12 42 TCTCAGTTTATAATGGGGG 

AAAACA 1266 



Qy 22 74 GCTGACCCACTCTATGGCCCCCAAGGCCTG — GCCCTAACTTCA- 

GATGGTCATGTTGTG 233 0 

III I III I I I I I I I I I I I I I I I I 

Db 1267 

GGCAACGTGTTCTTGTAACCTTTATTTCATGAAGGACTTCTTTT- 1310 



Qy 2331 

GTTGCAGACTCTGGAAATCACTGTTTCAAAGTCTATCGATACTTACAGTAATGGTGGGCA 239 0 
I I I I I I I I I I I I I I I I 

Db 1311 TGTTTCTAA — CTAT — 

AAACT 1328 



Qy 2391 

GGTGGATACCCGCTTCCATGGTCTTGCACTATAAACTGGAATGGATTTCTCAATGCGGGA 2 45 0 



1 1 1 1 1 II I II 1 1 1 1 1 1 1 1 1 1 1 1 1 I 

Db 1329 — TGGATCACCTAT GT TAAAACCT 

TATTTCACATT 1361 



Qy 2451 

CCAGATTATGACTAGAGTTTTTATGCCAGAAGGAATCATTGGTGAACTTTCCAAGGTTAT 2 510 
I I I I I I I I I I I I I I I I I I I I I I II I I I I I 

Db 1362 C C AC AT C AT - T T T AGAAT T TAT T T TC — GAAGG 

GGAATAGTTTCAATGTTTTA 1411 



Qy 2511 

TTCTGAATGTAACAATTTCCTTAAAAATGACTTATCCAATTTCTGTATTTCACCTTTAGG 2 5 7 0 
III III I I I I I I I II III 

Db 1412 TTC ACTTG GGCTTTT TTTC 

TTCCCC 1436 



Qy 2571 GTTAAAAAAAACTCTTCTACTGAATCTATAAA-AACTG — 

CAGTTTTACATCTGTGAACT 262 7 

I I I I I I I I I I I I I I I I I 

Db 1437 CTCTT 

TCTTTAAAGAACTGCTCAATATTCAATCTGT 1472 



Qy 2628 

ATGGCTTAAGGGACAGGATTTATGTAGCTAAACTAATTTTGCAAATCAAACAGACACTTA 2 6 8 7 

II I I I I I I I I I I II III III I I I I I I 

I 

Db 1473 -TG TGAAGAACCTGATTT GC ACTC 

TGTAGTGTTTAAAGAAAC — A 1514 



Qy 2688 

AAAAAACTAGCATATGTAAAGGTATTCGTTAATCCTGTGAATGGTAGCTTTTGCACAGAA 2 7 4 7 
I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1515 AAG AAAC T C T AAT AT TGAATCTCTTAAATTTAGTGTATG 

TAAACAG — 156 0 



Qy 2748 

CTTCCAAAAGCAAAACAAAAACAAAATCTATTGTAGTTATATACTTCATTTAACCTAGGT 2 8 0 7 
I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1561 CTTACAAATACG TATTGTC — TAAATG 

CATTTA 1591 



Qy 2808 

C AC AAG AC C C AGGG AAT C T T C T AAC C T C AC T T T T AC AGT AGGT AT T AC T C T T G T G AC AT T 2 86 7 
I I I I I I I I I I I III II 

Db 1592 AATCTGTT TTATTCA — 

AAGAAAAGC T AAAG 162 0 



Qy 2868 

TTTTTGGTTATCAACAACTAAATATAAATTACTTTGGAAAAAGTAAGGCTGTCTTGCAAA 2 92 7 

I I I I I I II II II I I I I I I I I I I 

I 

Db 1621 CAAAAAC 

ACTGGCATATGACCATGCAAGACTGTC A 1655 



Qy 2928 

ATGATCCCAGCTCTGATTAGCAGCCCTCTGGAGTTCAGAACTTAAGTATCAGTGCAAATT 2 9 8 7 
II I I I I II I I I I I I I I I I I I I I I 

Db 1656 GTG CCAACAAAGA CAACACT AATCAGCAC ATC- 

GTACA 1692 



Qy 2988 

TCTCAACCTTTCTGGGTTAGACAAAGATCCTTTTTTGTGTGTTCTTTTCACCACCCCTTT 3 0 4 7 

I I I I I I I I I I I III 

I I 

Db 1693 CTGGATT — GCAGTGCTTC 

CCAGATTATT 1719 

Qy 3048 

GGCTCACCTTGTATCAGCAAACAAAGTACTTCTTCAGGGAAACCTGAAATTTCTAATGCC 310 7 

I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 172 0 G AAAAATGTTACAG ACAACTT GCCTGATTTTT- 

AAATG — 1756 

Qy 3108 

TTGAAAAGCATATTACAAAAGTAATGCTACCTTTTGGGAAACAAACTGCCCCGTTAACTC 316 7 

I I I I I I I I I I I I I I I I II I I I I I I I I 

Db 175 7 AGCGT AAAAGGCCCTCTAACCTATG 

CAGGTTTCCCCATTA 1796 

Qy 316 8 CAGATCATTGCACTGGAATGTAATCAAGAAAGTTAGTCATGTTTTATGTAC — 

CATGTTT 3225 

I I I I II I I I I I I I I I I I I I I I II III 

I 

Db 179 7 TGCA TATAGAAAATGCTAGT-ATGTTTTGCTCACTTCAT- 

ATG 1837 

Qy 3226 TCACACGTGTCTCTTCTCTTCGACT 

TCCTGAAAGCGAAAGCTTTACCTCCTGCAAAT 3282 

I I I I I I I I I I I I I I II I I I I I I I I I I II 

Db 183 8 TAACAGGTG-CCCTTATGTTGTGCTGTATCCTG 

TGCTTTTTCT 1879 

Qy 3283 

GTCAGCACATGTAGTAGGACACCAGTATCCTAGGACAGAGAGCCATAAGTAGCCCTTTGG 33 42 
I I I I I I III I I I I I I I I 

Db 1880 

GTGGGACCATTCCATTCAGGAGCAAAGAGC 19 09 

Qy 3343 

AGGACTGATGGTGTCAACCAAAGGCATGTGATTGATTAATGATTCCCCCTTAGAAAGCAA 3 4 02 

III I I I I I I I I III 

Db 1910 ACC ATGATTCC 

AATCTT 1926 

Qy 3 4 03 GTGTTACCAAAGTTGTGTTATCTTGAAAGCATTAC 

AGGTAAGGGCATGTTATGGTTA 3 459 

III I I I I I I II I I I I I I I I I I I I I I I I III 

I I 

Db 1927 GTG TGTGTTTACT AACCCTTCCCTGAGGTTTGTGTATGT — 

TGGATA 19 71 

Qy 3 46 0 TTTATCATTGTTTAATGAATAGTAGAGGTGTCAAGGG — 

ACTATGTATACATGATTAGGG 3517 

II I I I I I I I I I I I I I I I I 

Db 1972 TT 

GTGGTGTTTTAGATCACTGAGTGTACA 2 000 

Qy 3518 

T AAG AT AG AAT G T AT TAT AT AT AT AT AT AT AT AT AC AC AC AC AC AT AT AT AT AGC T GAAT 35 7 7 

I I I I I I I I I I I I I I I I I I I I I I I I 



Db 2001 GAAGAGAGAA- 

AAATATTGCTGT — 2 031 



-ATTCAAACA- 



Qy 3578 

CTTTGGTGTATTGAAATAGGCAGCACTCTGAAAGACAGAAGCTTCGTCCAGCCACTCTTC 363 7 



TCTTC 2036 
Qy 3638 

AGCACATTCCTTTACTAAGCAGTTTAAAGCCGTCCTAGTGGAGCAAGCCCTAAAGCAGAT 369 7 

II III I I I I I I I I I I III 

I I 

Db 2037 AG TTT TGTTT 

GTGGAATTTGAAATTACTCAAAT 2 069 

Qy 3698 

TTAATTTTTGCCATTTTCCAAGAATGACGGTGGTGGCTTTTAGTCAGAAAATGGCCTTCT 3 75 7 

I I I I I I III III I I I I II I I I I I I I 

Db 2 0 70 TTAAAAT AAATTACTGGACT GTGGAAATAA — 

CATAGAATTG 2109 

Qy 3758 

G T G C T T T C AAAAAAAAAAC AAAAAAAAAAC C AC AC AC AC AC AT AAAAAAC C C AAC AG G T C 3 817 
I I I I II I I I I I I II I I I I 

Db 2110 AAGTTTT 

AATTAAATACCACTCAAACG 2136 

Qy 3818 AAAAT AAAAG T T G AAC T T GAG T T AC AT T T AAT T T AAAT AT AAAT G C AT T T - 

TGAGAAATG 3 8 76 

I I I I I I I I I I III II I I I I II III I I I I I I III 

I I I 

Db 213 7 AAAAGAACAGTAGTTTTTG 

TAGTTTTATAT T GG AT AC TGAGGC AT T AGGGAGGC ATG 2193 

Qy 3877 

TTAAGAACAATTTAGTCAATCGTTCATCTGTCATTGGTACTGTAAAATAAGCTGTGGTCT 3936 
I I I I I I I I I I 

Db 219 4 AAAGGAA 

GAGGAATGAG 2210 

Qy 3937 ATTTCCACTGTTTAATTTTCTACTCAGTTCTACCAAATAGGATGTCATGT- 

TTGACATTT 3995 

I I I I I I I I I I 

I I I I I I 

Db 2211 GATTGA 

G AC AT G T G AAG AC AT T G 2 233 

Qy 3996 

TTGATAGTGACTTTGGGGTCTTCTTCACTGAAAGCACCTTAGAACTGTACTATAAGAAAA 4 055 
I II I I III II III II I I I I I I I I I 

Db 2234 TGCATTATATCAATGTG CATTCCTG- 

TAGTTCATTAACAAGGTAC 22 7 7 

Qy 4056 

CATTTCCCCTATGTATAATTATATGAATGTGATGTTTATTGCTTATTAATTTATAATTC- 4114 

III I I I I I I I 



AAT AG T C T AAAG AAC C 229 7 



Qy 4115 — 

AGTCATTCTC T AT AT AGG AC T T C T TAAAAT T T AGAAGGGAAATC T AGC T AC T T CAAAT 4172 

I I I I I I I I I I I II I I I I I I I I I I I I I I III 

Db 229 8 AGAGTCACT-ACTATAGTGG CTTAACATTT 

AATCTGTCTCC 2337 

Qy 4173 

TGTCTGTTAAATTTATTATGCCCAAATCAACCTCTGAAAAAAGGTTTTTCCAGGAAGATT 4232 

I I I I I I I I I I II II I I I I I I I I I I I 

Db 233 8 AAT AT T T T AACC AAG T - G AC AC C GG GGTTTTTATC- 

GAAGCAT 23 7 8 

Qy 4233 

TACATTTAGGTTTAATATTTTTTTAGTTAGGTAGAGTTTTAAAAAATACTTGAGCCTGTC 42 92 

I I I I I I I I I I 

I I 

Db 2379 TTCACT 

TAAATGAACAAATC 239 8 

Qy 4293 

CGTGATAAAGCTATAAAATTCAATAACTTTTTAGAATGTTAAATGAAGACACTGTTTCCT 4352 
I I I I I I I I I I I I I 

Db 2399 ATGGCTGTTATAT 

TAACT 2416 

Qy 4353 

AACATCAGTGAGATACATCTTTGAATTTAAACATTCATATTTACTGAGTACCTACTAGGT 4 412 

I I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 2 417 TGAAATAAA AT AT AT T T AAA 

CATGTAAAAAAAAAAAAAAAAAGGC 2 461 

Qy 4413 

ACCAAGTACTCTTTTAGGCACTGGAAATACAGTGATGGACAAAACAGGTAAAAAATCGCT 4 4 72 
I I I I I I I I I I I I I I I 

I I I I 

Db 2 462 CACATGTGCTC GAGCTG CAGG 

TCGCG 2487 

Qy 4473 GCCCC 4477 

III I 

Db 2488 GCCGC 2492 
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Qy 19 49 CTTCCAGCCAAACGGGAAAATAGTCACCAGGTTTGGTAGC- 

CGAGGAAATGGGGACAGGC 2007 

I I I I I I II III I I I I 

Db 2495 CTAGCGGCC — GCG ACCT 

GCAGCTCGAGCACATGTGG 2 461 



Qy 2008 

AGTTTGCAGGTCCCCATTTTGCAGCTGTAAATAGCAATAATGAGATTATTATTACAGATT 2 06 7 

I I I I I I II I I I I I I I I I 



TTTTTTTTACATGTT 2435 
Qy 2068 

TCCATAATCATTCTGTCAAGGTGTTTAATCAGGAAGGAGAATTCATGTTGAAGTTTGGCT 212 7 

I I I I I I I I I I I I I I I I I I III I I I I 

I 

Db 2 43 4 T AAAT AT AT T T TAT T TC AAGT T AAT AT AAC AG CCAT 

GATTTGTT 2391 



Qy 212 8 CAAATGGAGAAG- 

GAAATGGGCAGTTTAATGCTCCAACAGGTGTAGCAGTGGATTCAAAT 2186 
II I I I I I I I I I I I I I I III 

I 

Db 239 0 CATTT AAGTGAAATGCTTCGATAAAAACCCC 

TCACTT 2350 



Qy 2187 

GGAAACATCATTGTGGCCGACTGGGGAAACAGCAGGATCCAGGTTTTTGATGGGAGTGGA 2 2 46 
II II III I I I I I I I 

Db 23 49 GGTTAAAATATT 

GGAGACAG 2330 



Qy 2247 

TCATTTTTGTCCTACATTAACACATCTGCTGACCCACTCTATGGCCCCCAAGGCCTGGCC 23 06 
I I I I I I I I I I I I I I I I 

Db 232 9 ATTAA 

ATGTTAAGCCACTATA 2309 



Qy 2307 

CTAACTTCAGATGGTCATGTTGTGGTTGCAGACTCTGGAAATCACTGTTTCAAAGTCTAT 2366 
II II I I I I I I I I 

I 

Db 23 0 8 GTAGT GACTCTGGT 



TCTTT 2290 



Qy 2367 

CGATACTTACAGTAATGGTGGGCAGGTGGATACCCGCTTCCATGGTCTTGCACTATAAAC 2 42 6 



Db 22 89 AGACTATTGCA TG TACCTTGTTAATGAACT 

AC 2258 

Qy 2427 

TGGAATGGATTTCTCAATGCGGGACCAGATTATGACTAGAGTTTTTATGCCAGAAGGAAT 2 4 86 
I I I I I I I I I I I I I I I I I 

Db 2257 AGGAATG CACATT 

GATATAATGC 2 235 

Qy 2 48 7 CATTGGTGAACTTTCCAAGGTTATTTCTGAATGTAACAATTTCCTTAAAAATGACT- 

TAT 2545 

I I I I I I I I I I I I I I 

I I 

Db 2234 ACAATGTC — 

TTCACATGTCTCAAT 2212 

Qy 2546 

CCAATTTCTGTATTTCACCTTTAGGGTTAAAAAAAACTCTTCTACTGAATCTATAAAAAC 2 6 05 

I I I I I I I I I I I I I I I 

I 

Db 2211 CCTCATTC CTCTTC 

CTTTCATGCC 2188 



Qy 2606 

TGCAGTTTTACATCTGTGAACTATGGCTTAAGGGACAGGATTTATGTAGCTAAACTAATT 2665 



Db 2187 TCC CTAATGCCT- 

CCAAT- 2165 



Qy 2666 

TTGCAAATCAAACAGACACTTAAAAAAACTAGCATATGTAAAGGTATTCGTTAATCCTGT 2 72 5 
I I I I I I I I I I I I I I I I III I I I I I I I 

I 

Db 216 4 ATAAAAC TACAAAAACTA CTGTTC 

TTTTCGTT T 2132 

Qy 2726 

GAATGGTAGCTTTTGCACAGAACTTCCAAAAGCAAAACAAAAACAAAATCTATTGTAGTT 2 7 85 

I I I I I I I III I I I I I I I I I I I I I I 

I I I 

Db 2131 GAGTGGTA TTTAATTAAAACTTC AATTCTAT 

GTT 2098 

Qy 2786 

ATATACTTCATTTAACCTAGGTCACAAGACCCAGGGAATCTTCTAACCTCACTTTTACAG 2 8 45 
III II III 

I I I 

Db 2 09 7 ATTT CCACAGTC 

CAG 2 0 83 

Qy 2846 

TAGGTATTACTCTTGTGACATTTTTTTGGTTATCAACAACTAAATATAAATTACTTTGGA 2 9 05 

II I I I I I I I I I I I I I 



Db 2082 TA- 

TAAATTTG — 2065 



-ATTTATTT- 



Qy 29 06 AAAAGTAAGGCTGTCTTGCAAAATGATC — 

CCAGCTCTGATTAGCAGCCCTCTGGAGTTC 2963 

I I I I I I I I I I I I I II I I I I I I I I I II 

I I I 

Db 2 06 4 AGTAA 

T T T C AAAT T C C AC AAAC AAAAC TGAAGAAC AGC AAT AT T T T G T T - 2 016 

Qy 2964 

AGAACTTAAGTATCAGTGCAAATTTCTCAACCTTTCTGGGTTAGACAAAGATCCTTTTTT 3 023 

II I I I I I I I I I I I I I I I I I II 

Db 2015 TG — AATTTCTCTC TTCTG TACAC 

TC 1992 

Qy 3024 

GTGTGTTCTTTTCACCACCCCTTTGGCTCACCTTGTATCAGCAAACAAAGTACTTCTTCA 3 0 83 

I I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 1991 -AG T GAT C — T AAAAC AC C AC AAT ATC C AAC AT AC AC AAAC C 

TCA 1950 

Qy 3084 

GGGAAACCTGAAATTTCTAATGCCTTGAAAAGCATATTACAAAAGTAATGCTACCTTTTG 3143 

I I I I I 

I 

Db 1949 

GGGAA G 19 44 

Qy 3144 

GGAAACAAACTGCCCCGTTAACTCCAGATCATTGCACTGGAATGTAATCAAGAAAGTTAG 32 03 

II I I I I I I I III I I I I I I I I I I 

Db 19 43 GGTTA GT AAAC AC 

ACACAAGATTGGAATCATG 1912 

Qy 3204 

TCATGTTTTATGTACCATGTTTTCACACGTGTCTCTTCTCTTCGACTTCCTGAAAGCGAA 32 63 

I I I I I I I I I I I I I I I I I 

I I I 

Db 1911 GTG-CTCTTTGC TCCTGAATG- 

GAA 1889 

Qy 3264 

AGCTTTACCTCCTGCAAATGTCAGCACATGTAGTAGGACACCAGTATCCTAGGACAGAGA 3323 
II II I I I I I I I I I I I I I I I I 

Db 1888 TGGT C C C AC AGAAAA AGCAC 

AGGATAC 1862 

Qy 3324 

GCCATAAGTAGCCCTTTGGAGGACTGATGGTGTCAACCAAAGGCATGTGATTGATTAATG 33 83 
III I I I I I I I I 

Db 1861 AGC 

ACAACATAAGG 18 48 

Qy 3384 

ATTCCCCCTTAGAAAGCAAGTGTTACCAAAGTTGTGTTATCTTGAAAGCATTACAGGTAA 3 4 43 

III I I I I I I II III I I I I III 

I I 

Db 18 47 GCACCTGTTACATATGAAGTG AGCAAAACA— 



TAC 1815 



Qy 3444 

GGGCATGTTATGGTTATTTATCATTGTTTAATGAATAGTAGAGGTGTCAAGGGACTATGT 35 03 

I I I I I I I I I I I I I I I III 
Db 1814 TAGCAT TTTCTATATGCATAATG 

GGG 1789 

Qy 3504 

AT AC AT GAT T AGGG T AAG AT AG AAT G T AT TAT AT AT AT AT AT AT AT AT AC AC AC AC AC AT 3563 

I I I I I I I I I I I I I I I I I III 

I I I 

Db 178 8 AAACCTGCATAG GTTAGAGGGCCTTTT 

ACGCTCAT 175 4 

Qy 3564 

ATATATAGCTGAATCTTTGGTGTATTGAAATAGGCAGCACTCTGAAAGACAGAAGCTTCG 3623 

I I I I I I I I I I I I II II I I I I I I I I I I 

I 

Db 1753 TTAAAAATCAGGCAAGTT-GTCTGTAACATTTTTCAATAATCTG GGAAGC- 

ACT 1702 



Qy 3624 

TCCAGCCACTCTTCAGCACATTCCTTTACTAAGCAGTTTAAAGCCGTCCTAGTGGAGCAA 36 83 
I I I I I I I I I I I I I I I I I I I I 

Db 1701 GCAATCCAGTGTAC 

GATGTGCTGATTAGTGTTGTCTTTGTTG 1660 



Qy 3684 

GCCCTAAAGCAGATTTAATTTTTGCCATTTTCCAAGAATGACGGTGGTGGCTTTTAGTCA 3 7 43 
I I I I I III I I I I I I I II I I I I I I I 

Db 1659 GCACTGA — CAG TCTTG-CATGGTC ATATGCCAGTG 

TTTTT 1622 



Qy 3744 GAAAATGGCCTT — 

CTGTGCTTT C AAAAAAAAAAC AAAAAAAAAAC C AC AC AC AC AC AT A 3 8 01 

I I I I I I I I I I I I I I I I I I I I III 

I I I 

Db 1621 GCTTTAGCTTTTCTTT G AAT AAAAC AG AT T T AAAT - 

GCATT 1582 



Qy 3802 

AAAAACCCAACAGGTCAAAATAAAAGTTGAACTTGAGTTACATTTAATTTAAATATAAAT 3 861 

II I I I I I I I I I I I I I III I I I I I I I I I I I I I 

I I 

Db 1581 TAGA CAATACGT — ATTTGTAAGCTG T T TAC AT AC AC T AAAT T T AAG 

AGAT 1533 

Qy 3862 

GCATTTTGAGAAATGTTAAGAACAATTTAGTCAATCGTTCATCTGTCATTGGTACTGTAA 3921 

I I I I I I I III I I I I I I I I 

I I I 

Db 1532 T C AAT AT TAGA GTT 

TCTTTGTTTCTTTAA 1504 

Qy 3922 

AATAAGCTGTGGTCTATTTCCACTGTTTAATTTTCTACTCAGTTCTACCAAATAGGATGT 39 81 

I I I I I I I II I I I I I I I I I I I I I I 

I 

Db 15 03 AC AC TAC AG AG T GC AAAT CAG GTTCTTCAC 



AACAGATTGAATAT 146 0 



Qy 3982 

CATGTTTGACATTTTTGATAGTGACTTTGGGGTCTTCTTCACTGAAAGCACCTTAGAACT 4 0 41 

I III I I I I I I I I I I I I I I I I II 

Db 1459 TGAGCAGTTCTT T AAAG AAAG AG GGGGAAG 

AAAAAAAGCCCAAGTGAATA 1410 

Qy 4042 

GTACTATAAGAAAACATTTCCCCTATGTATAATTATATGAATGTGATGTTTATTGCTTAT 4101 

II I I I I I I I I I I I I III II II I I I I I I III 

Db 1409 AAACATT 

GAAACTATTCCCCTTCGAAAATAAATTCTAAAATGATGTGGAATGTGAAA 1353 
Qy 4102 

T AAT T T AT AAT T C AG TCATTCTC T AT AT AGGAC T TC T T AAAAT T T AGAAGGG AAAT C TAG 4161 

II III I I I I I I I I I I I I 

Db 1352 TA AGG TTTTAACAT 

AGGTGATC 1331 

Qy 4162 

CTACTTCAAATTGTCTGTTAAATTTATTATGCCCAAATCAACCTCTGAAAAAAGGTTTTT 42 21 

I I I I I I I I I I III I I I I I I I 

I I 

Db 133 0 CAAGTTTATAGTTAGA AAC 

AAAAAGAAGTCCTT 129 8 

Qy 4222 

C C AGG AAG AT T T AC AT T T AGG T T T AAT AT T T T T T T AGT T AGGT AGAGT T T T AAAAAAT AC 42 81 

I I I I I I I I I I I I I I 

I I 

Db 129 7 CATGAAA 

TAAAGGTTACAAGAACAC 12 73 

Qy 4282 

TTGAGCCTGTCCGTGATAAAGCTATAAAATTCAATAACTTTTTAGAATGTTAAATG 433 7 

I I I I I I I III I I I I I I I I I I I I 

Db 1272 GT-TGCCTGT TTTCC — 

CCCATTATAAACTGAGAAGTGGGTA 123 4 

Qy 4338 

AAG AC AC T G T T T C C T AAC AT C AG T G AGAT AC ATC T TTGAAT T T AAAC AT TCATATTTACT 4397 

I I I I I I I I I I I II II I II I III I I I I I I I 

I 

Db 1233 AAGACGATGTTTCAGTACGAAAATAGGTGACTAC— AGGATCAGCGC- 

TTCATCTCACAT 1177 

Qy 4398 G- 

AGTACCTACTAGGTACCAAGTACTCTTTTAGGCACTGGAAATACAGTGATGGACAAAA 4 456 

I I I I I I III I I I I III II III I I I I I I I I I 

Db 1176 GCTGTACC — CAAAG — CCAGG CTGT — GG — 

CTGTCCATACGGTGGTG 1136 

Qy 4457 

CAGGTAAAAAATCGCTGCCCCCTCAGAGCTGACATTCTGGGGTGGGAATTTCATTTTGCC 4516 
I I I I I I I I II III I I I I I I I I I I I 

I I 

Db 1135 -CGGTATAA CCATAACCAAAAGGTGC CTGGGGAGGGA CG 

GCA 1095 



Qy 4517 

ACGTACTAACGTTCTGCACAAAAGACAGGCTAGACTCTTGTCTAGATTGTTTAAAAGAAA 45 76 

I I I I I I I I I I I I I I I I I I I I 

Db 109 4 AC— ACTGG-GTCCTGC TGTCAGCATCAACT 

GGGG 1063 

Qy 4577 

CTTTTCAAATTGGTTACATTAATTTTAGTTTATTTTCACAAGTAAAAATGGCTTTTTATT 4636 

II III I I I I I I I I I I I I I I I I I 

I I 

Db 1062 CTGACCATA AACAATGGGTTGTGTCTCT 

GTAGCTTGTTCTT 1022 

Qy 4637 

TAGATTCTTTCTGTCCCAGGCTGTTGATCTTAAAACTAGTTGATTTAAAGAGTTTTTTTG 4696 

I I I I I I I I I I I I I I I I I I I I I I I I III 

I 

Db 1021 CTTCTTT TCTCAG TGATTCTGAAGC ATCTAA 

TTTATC 985 

Qy 4697 

CACAACATTTCAATTATATTTGTGAACTTAGAAATTAACTTACAATCTAACCAGCCATCA 4 756 

III I I I I I I I I II 

I I I 

Db 984 CAC CTTTGTCAAGTACT 

CCTTCA 962 

Qy 4 757 TATCAT— ATCCTATCAGGC- 

T AG AT AT C T C AAT AG TAG AC T G AAT AC AAAGC TAAT T T T 4813 

I II II II I I I I I I I I I I II II I I I I I 
Db 961 TGACCTGGATGAAATAGGGCATGGCAAAATCCATGATATTGTGCCT- 
CCATGCAG 908 

Qy 4814 TTTTACATGTCAATAT-TGGCACAAACTGGAATGAAAG- 

AAT AG T T T GAT T C AG AC C T GC 48 71 

III I I I I I I I I I I II I I I I I I I I I I I I I I I I 

I I 

Db 907 - T T T C T AGG AC G AC AT C T GGC C T T AAAAG AT C G T AAC AGG T — 

AAACAGACAAGC 856 

Qy 4 8 72 TCCA 

CTATGTGTTGCTAAAACACATGCTATGAGCACTCGAGGAAACACTATAT 492 4 

I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 855 TCCAAAGCACTCTCT-TTTTTCT 

TCCTGCAAAAACCACTGCAGGAG 811 

Qy 4925 

T T T T T C C AAAAAAT AT G T G AT T AT AT ATGT T AAAGT AT AGAT AAC AT TTCACACTTGGAT 4984 

I I I I I III II I I I I I I I I I I I I I I I I I I I I I 

I 

Db 810 TTCTTCAGC C AAT T C AG T AT C T T TAG AT T C AG AAGC AT — 

ACTGCATTGCATCCTTGTAA 753 

Qy 4985 

ACATATGTGCATTTACTGTATTTCTTGGTAAGCATATTTTTGGGGGAAAGTGCTGCTGAT 5 0 4 4 

I I I I I I I I I I I I III III 

I 

Db 752 A GGCTGTCTTTCTTGCACAGCT CTAC- 

ACT 724 



Qy 5045 

ATGATACAAGTAGACAAAATTTAAATGAAATTTTGTCACATTCTATGGAAAATGGTTTCT 510 4 
I I I I I I II I I I I I 

Db 723 CTGTTTCCAG CGA 

TTGTTGC 704 



Qy 5105 

GGTAAACTGAGAAGGATATTAAAATAAGTGGCTTTTTTCTGGGCTACCATTATTGTTTGA 516 4 
II I I I I I I I I I I II 

I 

Db 703 CTTTGAAG 

AGATAAGCAGC A 684 



Qy 5165 TTTCTCTTTGTCAAGTGTATAGAACCTGTC- 

ATACATTCATGATAAGTAGCACTGAAAAA 5223 

I I I I I I I I I I II 

Db 6 83 ATTCTCCT GAAC 

TCAATGAGTTCATGTT 656 



Qy 5224 

TTACTCATTCAAATTTCCCCTGGGCACGTAAGGCAAAATATTGCCGGTTGGGATTTCAAG 52 83 
III I I I I I I I I I I I I I I I I I I I I 

I I I I 

Db 655 TTTC CAAA CGCTGAGC AAGCGAGATATTG 

TCAAA 622 



Qy 5284 

GTCAGTGACGACGCATTTCCTCCCAGTACAGACCCCCCAGCCCCCCTTGCTGGACATGGG 53 43 
II I I I I I I I I I I I I I 

Db 621 GT TGTCAT 

AAGCATCTATTGATGTTC 596 

Qy 5344 

GAGGCAGAGAGTCACTTGACCATC C AG AAAT AC AT G AC T AC AAG TCCTTTAT G AC T G T T T 54 03 
I I I I I I I I I I I I I III 

Db 595 GCAGAG CCTGATAATCTT CTT 

CT 573 

Qy 5404 

GCCATTTTTTTTAATGGTACTTAGTATTTTGATCAAACTTTAGTCTCCAGAACTAAACAA 5 463 
I I I I I I I I I I I I I II I 

Db 572 

GTAATAAAAAGATTGTTCAATGATTCAT 5 45 

Qy 5464 GTCCCTAAGTTTCCTTATTTTAATTTACTGTGACTAGATTTGAAGCAAATA — 

AATACTC 5521 



Db 544 -TCAC — AGATTTGTTGTTATGGTT 

C T GAAC T G AAC GC AAAT AC GG T T T C AC 496 

Qy 552 2 CAGATCCATGCAGCTAGAACACACTTGCTTCCACTACTAAATATACAGGGTATG- 

TCCTA 5580 



Db 495 CAG TGGTAGCTGTTTAAC-CTTGCTGAAATAATTGACTGCAC- 

GAGTGTGATCCAA 4 42 



Qy 55 81 ACATGGAGTTAACTGGAATAGCAGTACA-CTAGCA — 



AGTATCTGTGAATCCTTAGCACT 563 7 

I I I I I I II I I I I I II I I I I II I I I I I I I 
Db 441 C C G T G G AG AC AGC AC CAT C AGC AAAT C AT T T AAC AAC AG AG G C T - 

TGAATTCTAAGTA — 3 85 

Qy 5638 

GACGGGTTAACAGAAATGCTTTGGTAATACCTACTTAGTTAATTGGAGGAAGTAGTAAAT 569 7 



Db 384 GAACTGTATTGCTCTG TAGT 

ATAGT T 359 

Qy 5698 

AAACATTAGGTAATCTGCAGATTACTTCAAATGGGAAAAATCTTTTTGTAGACTCTATAG 5 75 7 



Db 358 CCACATT-GGCAACCT TGGTAATGATATCTTTG 

AAT 324 

Qy 5 75 8 TACCCTCTCTATTCACTAGCTTCTGAAAAGGGAGGAGTATTTTTAG 

TTTGACAATT 5813 



Db 323 TGCCCT-TCT-TTC-CAGGCATCAGTT 

GGATGATTCATCATGGTAATTATGGCATTA 270 

Qy 5814 

TAATAATTTAAAAACAAGACATCTCCAGGTAGGAAAAAATGAAAGCTATTTCATGCAAAC 58 73 



Db 269 TCATATTCTTCATACTTG 

TCATACAAAA 2 42 

Qy 5874 

ATTATCTAATTTAGCTTAAAAGTGAAAGTGGTAATACTGTTGGTTTCTGTAAATGTTGCA 5933 



Db 2 41 A— CACCAGTTCTGCCCAAAGATGAGCTTG TTC 

TGCA 207 

Qy 5934 

GGGTTTTAAACTTTATAATTACTTTAATATTTTTGATAACTAGAAATCTAGTATTGCCAT 5993 
I II II III I I I I I I I I I I III 

Db 2 06 GCTCTTAGCAC CTTGGGAATATTCACT — CTAGA 

CCAGAACAGC— 165 

Qy 599 4 AAAGGAAACTAAGTGC- 

CCATCAAAGATTTGTTTGGTATAAATAAAGAATTATTTGTTTT 6 052 



Db 16 4 TCCAGGTGCTCCCTCA TTTTCTGAGGCTTAAATTTAGAGT — 

ATAGAATA 117 

Qy 6053 

GTTTTCAATGACAGTAAGCTACAAATCATGATGCTTAAAAACTTTCTAAAGATGAATTGT 6112 
I II I I I I I I I III II I I I I I I I 

Db 116 G CTAATTCAGTAAAC ATTCC 

CATGTGAGCTCGCTCAA 8 0 



Qy 6113 



GTGGCAGTGATTGGTCTGTTTGTGGAGAATGTATGAAAGCTATTAATATTCTAGAATAGA 6172 

I I I I I I I III I I I I I I 

I 

Db 79 GTCCCAGTG CTG 

CTTCCAACATGG- 56 

Qy 6173 

TTAATAAATTGGCTATGTTGTTCCAATGAATGTACAGCACTTCCATTAACTTTTGAAAGC 6232 

II I I I I I I I 



TGATCAGCTCTT C 43 

Qy 6233 

AACACAGCCTTAAACTCAATGCTTTTGCTTTATGACATGGGAATGTTCTGTCATCAATG 62 91 

I I I I I I I I I I I III III I I I I I I 

I I 

Db 42 AAAATAGC CACGATCC TGATAGTAGCCAGTAG 

GCCAACAGTG 1 
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US-10-478-914-57/C 
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; Patent No. 7335755 

; GENERAL INFORMATION: 
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; PRIOR APPLICATION NUMBER: JP 2001-162775 
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; PRIOR FILING DATE: 2001-08-24 

; NUMBER OF SEQ ID NOS : 417 
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; SEQ ID NO 57 

LENGTH: 2336 

TYPE: DNA 
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Query Match 14.1%; Score 5407; DB 1; Length 2336; 

Best Local Similarity 35.5%; Pred. No. 0; 

Matches 1495; Conservative 0; Mismatches 802; Indels 1914; 
Gaps 215; 

Qy 351 

CTGCCCGAGAAAGGGGTGGCCGCGCTCCAGAACAATTTCTTCATCACAAACCTGATGGAC 410 



Db 2336 CAGCTCGAGCAC-ATGTGGCC 

TTTTTTTTTTTTTTTTTTTTTGTA- 2293 

Qy 411 

GTGCTGCAGCGAACTCCAGGCAGCAACGCTGAGGAGTCTTCCATCCTGGAGACAGTCACT 4 7 0 

III I I I I I I I I I I I I I I I I I I I I I 



I 

Db 2292 

GAGCCTGTAATT 2256 



-TTCTCAGAGCTGCC AGGAG TGCATC 



Qy 471 

GCTGTGGCTGCGGGAAAGCCTCTCTCTTGCCCAAACCACGATGGGAATGTGATGGAATTT 53 0 

I III I I I I I I I I I I II 

I 

Db 2255 TC CTG TTCTCTGAATCCC CC 

ATCT 2232 

Qy 531 

TACTGCCAGTCCTGTGAGACTGCCATGTGTCGGGAGTGCACGGAGGGGGAGCACGCAGAG 59 0 
I I I I I III II I I I I I 

Db 2231 TTCTGCAGCTCC 

AAGCTTTGTGT 2209 

Qy 591 CACCCCACAGTTCC 

ACTCAAGGATGTGGTGGAACAGCACAAGGCCTCGCTCCA — G 6 4 4 

I I I I I I I II III! I I I I I I I I I I II I I I I 

I 

Db 22 0 8 CCCACAG— CCTGTGACTC TGTGCT— AACAAATC 

GCTATTGTCCAGTG 216 4 

Qy 645 

GTCCAGCTGGATGCTGTCAACAAAAGGCTCCCAGAAATAGATTCTGCTCTTCAGTTCATC 7 0 4 

I I III I I I I I I I I I I I I II I I I I I I I I I I I I 

Db 2163 GGGCGAATGGTGGCTGG-AACTAAAGAATTGCTG TCTGGT- 

TTCTATTCAAA 2114 

Qy 705 

TCTGAAATCATTCATCAGTTAACCAACCAAAAGGCCAGCATCGTGGATGACATTCATTCC 76 4 

II I II I I I I I I I I I I I 

Db 2113 TC CAGGTA 

GCGAGATATATGAATG 2 09 0 

Qy 765 

ACCTTTGATGAGCTCCAGAAGACTTTAAATGTGCGCAAGAGTGTGCTGCTTATGGAATTG 82 4 
I I I I I I 

Db 2089 

GACTTT 2084 

Qy 825 GAGGTCAACTATGGCCTCAAACACAAAGTCCTCCAGTCGCAGCTGGATAC — 

TCTGCTCC 882 

III I I I I I I I I I I 

I I I I I I I 

Db 2 0 83 TCGAA TCGTCATG 

TGAATAACGTCTGCTC- 2055 

Qy 883 

AGGGGCAGGAGAGCATTAAGAGCTGCAGCAACTTCACAGCGCAGGCCCTCAACCATGGCA 9 42 

I I I I I I I I I I I I I I I I I 

Db 2 054 GGCATGA 

AGGCTCAGAGCCATG 2 033 

Qy 943 

CGGAGACCGAGGTCCTACTGGT G AAG AAGC AG AT G AGC G AG AAGC T G AAC GAGCTGGCCG 10 02 

III I I I I I I I I I I I I I 



GATTAACTCGTAGGCTG 2008 



Qy 1003 

ACCAGGACTTCCCCTTGCACCCGCGGGAGAACGACCAGCTGGATTTCATCGTGGAAACCG 1062 



Db 2007 ACCA CTAACATCC TTT GT 

CG 1984 

Qy 1063 

AGGGGCTGAAGAAGTCCATCCACAACCTCGGGACGATCTTAACCACCAACGCCGTTGCCT 112 2 
I I I I I I I I I III I II I I I I I III 

Db 19 83 AGGG AG AA- AC AT T C C C AAG T AT C 

ATTTTATTCAC 1950 

Qy 1123 

CAGAGACAGTGGCCACGGGCGAGGGGCTGCGGCAGACCATCATCGGGCAGCCCATGTCCG 1182 

I I I I 

Db 19 49 ACTTAAT 

TTTCTA 1937 



Qy 1183 T C AC CAT C AC C AC C AAGG AC AAAG AC GG T G AGC T G T GC AA — 

AACCGGCAACGCCTACCT 12 40 

I I I I I I I I I I I I I I I I I I I I I I I II I III I I I I 

I I 

Db 1936 TC-CCAT-ACCCCCA AAAT AAGGC T AGC TAT T T AAT TAG T T GGC — 

CGCTTTTCT 1886 

Qy 12 41 CACCGCCGAACTGAGCACCCCCGACGGGAGCGTGGCAG- 

ACGGGGAGATCCTGGACAACA 1299 

I III I I I I I I I I I I I I I I I I I II 

Db 18 85 CT TAATTTTTAGTGTTTCTGTTGATAATGTGTGAGTTTGGGAAAATGCT 

AAGT 1833 

Qy 1300 

AGAACGGCACCTATGAGTTTTTGTACACTGTCCAGAAGGAAGGGGACTTTACCCTGTCTC 1359 

II III I I I I I I I I I I I I I I 

I I I 

Db 1832 AGCTTTTCAC T T AG AAC AC T G T TATTTTC 

TCTT 1800 

Qy 1360 

TGAGACTCTATGACCAGCACATCCGAGGCAGCCCGTTTAAGCTGAAAGTGATCCGATCCG 1419 

II I I I I III I I I I I I I I I I I I 

I 

Db 1799 TAAAGTTTTCTACCTTACATTT AT TAT AGC — 

ATAGTTATCTTTATAG 175 4 

Qy 1420 

CTGATGTGTCTCCCACCACAGAAGGCGTGAAGAGGCGCGTTAAGTCCCCGGGGAGCGGCC 14 79 

I I I I I I I I I I II I I I I I I I I I I 

I I 

Db 1753 CATAGATG C AG AAAG T AAG AG AG AGC TTGTTT — 

TTT C AAG AAAAC AAC C 1706 

Qy 1480 

ACGTCAAGCAGAAAGCTGTGAAAAGACCCGCAAGCATGTACAGCACTGGAAAACGAAAAG 1539 

II I I I I I I I I I I I I I I I I I 

I 

Db 1705 C T - T T AAAAT AC TTTCCAACCC 



ATGAAGGG 16 77 



Qy 1540 

AGAATCCCATCGAAGACGATTTGATCTTTCGAGTGGGTACCAAAGGAAGAAATAAAGGAG 1599 
I I I I I I III I I I I I I I I I 

Db 1676 AAAAATCCTCC TTT TTTC 

CCCCAAG 1652 



Qy 16 0 0 AGTTTACAAATCTTCAGGGGGTAGCTGCA- 

TCTACAAATGGAAAGATATTAATTGCAGAC 1658 

I I I I I I I I I II 

Db 1651 TGCATTC T AC T TAT - 

TACTTTGCA 1629 



Qy 1659 AG T AAC AAC C AAT G T G T GC AG AT AT TTTC C AAT G ATGG 

CCAGTTCAAAAGTCGT 1712 

I I I I I I I I I I I I I I I I I III I I I I I I I I I I 

Db 162 8 TTTTTCTCCCAAAGT CC AAAT T T ATGC AAAG AAAAT AG AAAC AAG T T C AAAT G - 

CAA 1573 



Qy 1713 

TTTGGCATACGGGGACGCTCTCCGGGGCAGCTGCAGCGGCCCACAGGAGTGGCTGTACAT 17 72 
I I I I I II I I I I I II 

I 

Db 1572 T GCAT TAAC CAAATAAAACAAGTCTC 

CTT 1544 



Qy 1773 

CCCAGTGGGGACATAATCATTGCCGATTATGATAATAAATGGGTCAGCATTTTCTCCTCC 1832 
I I I I I I I III II I I II I I I I I I I I 

Db 1543 CAAATTAGGAACC-AACCTAAGCATTTGTAAAGTGTAGCAGAATCAGAATT- 

CT— 1492 



Qy 1833 

GATGGGAAAT T T AAGAC AAAAAT TGGATCAGGAAAGCTGATGGGACCCAAAGGAGT T TC T 1892 
III I I I I I I I I I I I I I I I I II 

I I 

Db 1491 TTT AAAAAT TAG AT TTGGAACCTGAA- 

CT 1464 



Qy 1893 

GTGGACCGCAATGGGCACATTATTGTTGTGGACAACAAGGCGTGCTGCGTGTTTATCT — 195 0 
I I II I I I I I II I I I I I I I I I 

Db 1463 ATATAATTCATAATTCTCATTTTT-CTGTGGAAAATTA 

TTTTATCTTT 1417 



Qy 1951 - 

TCCAGCCAAACGGGAAAATAGTCACCAGGTTTGGTAGCCGAGGAAATGGGGACAGGCAG 2 0 09 

I I I I I I I I I I I III I I I I I I I I I I I I I 

Db 1416 CTCCTGTATACCTGAAAAAATGTCCATAGGCTT 

AAAGGGTCATGCT- 1371 



Qy 2010 TTTGCAGGTCC — CCATTTTGCAGCTGTAAATAGCAATAATGAGATTATTA — 

TTACAGA 2065 

I I I I I III I I I I I III III I I I I I I I I I I I I I 

Db 13 7 0 TTTACA-TTCCTTCCATATCACAG — 

G T AC TAT G AAG T AAGGAGAC TTT T AGGT T TC T T T 1314 



Qy 2 066 TTTCCATAATCATTCTGTCAAGGTGTTTAATCAGGA 

AGG AG AAT T C AT G T T G AAG T T 2122 



II I 1 1 1 I 1 1 I I 1 1 I I 1 1 1 1 1 1 I I 1 1 1 I 1 1 I 

1313 TTGTCTTAAAC — TCAGAC-AGCTTTGTAAGCAGTAGTGTGTAGATTACAAG- 



Qy 2123 

TGGCTCAAATGGAGAAGGAAATGGGCAGTTTAATGCTCCAACAGGTGTAGCAGTGGATTC 2182 

I I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 126 0 AG— ACAAAAGCAGGCGCGACTGAGAAG 

AGTTGGTGGGGGAGAAGCTT- 1215 



Qy 2183 

AAATGGAAACATCATTGTGGCCGACTGGGGAAACAGCAGGATCCAGGTTTTTGATGGGAG 2 2 42 
I I I I I I III I I I I I I I I I I I 

Db 1214 GGGGCACTTC CTGTC 

ACTCAACACATTCCAGAT 1182 

Qy 2243 

TGGATCATTTTTGTCCTACATTAACACATCTGCTGACCCACTCTATGGCCCCCAAGGCCT 23 02 

I I I I I I I I I I I I I I I I II III 

I I I 

Db 1181 CACTAAAAAATTTCCACACCCTCTGCAT TCCC — 

CCT 1147 

Qy 2303 

GGCCCTAACTTCAGATGGTCATGTTGTGGTTGCAGACTCTGGAAATCACTGTTTCAAAGT 2362 

I I I I I I I I I I I I I I I I I I I I I I 



Db 1146 TGCCC — ACTCCAGTT 

CCCGGTATTTTCTGATTCCATAT 1110 

Qy 2363 

CTATCGATACTTACAGTAATGGTGGGCAGGTGGATACCCGCTTCCATGGTCTTGCACTAT 2 42 2 
I I I I I I I I I I I I I I I I 

I I I I I 

Db 1109 GTTGTGGTATTTACCATACT TCTCTCCCT 

CACTAG 10 75 

Qy 2423 

AAACTGGAATGGATTTCTCAATGCGGGACCAGATTATGACTAGAGTTTTTATGCCAGAAG 2 4 82 

III I I I I I III 

Db 10 74 GCTCTG GCAAGAC 

TGC 1059 

Qy 2483 

GAATCATTGGTGAACTTTCCAAGGTTATTTCTGAATGTAACAATTTCCTTAAAAATGACT 2 5 42 

III III I I I I II I I I I I I I I 

Db 1058 TTCAGAGG GGATG CA— 

TTCCTTTAGATTG 1031 

Qy 2543 

TATCCAATTTCTGTATTTCACCTTTAGGGTTAAAAAAAACTCTTCTACTGAATCTATAAA 2 6 02 



Qy 2603 

AACTGCAGTTTTACATCTGTGAACTATGGCTTAAGGGACAGGATTTATGTAGCTAAACTA 2 662 

I III I I I I I I I I I I I I I I I I I III I I I I I 



Db 102 7 -AAAGCGG AGCTGGGAA-AATGGCT GGCAG — 

TTTCAGAATCTAGTC — 9 85 



Qy 2663 

AT T T TGCAAAT C AAAC AG AC AC T T AAAAAAAC T AGC AT ATGT AAAGGT AT TCGTTAATCC 2722 
III I I I I I I I I 

I I 

Db 984 ACGATCGCACGCATG AGCA 

CC 964 

Qy 2723 

TGTGAATGGTAGCTTTTGCACAGAACTTCCAAAAGCAAAACAAAAACAAAATCTATTGTA 2 7 82 
I III I I I I II 

I I I 

Db 963 T CAC ACATCCA 

TGT- 950 



Qy 2783 

GTTATATACTTCATTTAACCTAGGTCACAAGACCCAGGGAATCTTCTAACCTCACTTTTA 2 8 42 

III I I I I I I I I I I 

Db 9 49 CCCT ACCCGCCCC 

CCCGCTC 930 



Qy 2843 

C AG T AGG T AT T AC T C T T G T G AC AT T T T T T TGGT T ATC AAC AAC T AAAT AT AAAT T AC T T T 2 9 02 
II I I I I I I I I 

Db 929 CCG CCCCTG 

CAGCT 916 



Qy 2903 

GGAAAAAGTAAGGCTGTCTTGCAAAATGATCCCAGCTCTGATTAGCAGCCCTCTGGAGTT 2962 

I I I I I I I I I I I I I I I I I I 

Db 915 GGCTGACCTG TCTCA 

CCCACTG 89 4 



Qy 2963 

CAGAACTTAAGTATCAGTGCAAATTTCTCAACCTTTCTGGGTTAGACAAAGATCCTTTTT 3 02 2 
I I I I I I I I I I I I I I I I I I I I 

Db 893 CTGGCCTATCG-AACGG CCAGGACTGTCTGGTTTTGGC 

TC 855 



Qy 3023 

TGTGTGTTCTTTTCACCACCCCTTTGGCTCACCTTGTATCAGCAAACAAAGTACTTCTTC 3 0 82 
III I I I I I I I I I I I I III 

Db 854 GTG CCTTTG TCCATGT 

CTGGCTTAGTTCCTCT— 823 



Qy 3083 

AGGGAAACCTGAAATTTCTAATGCCTTGAAAAGCATATTACAAAAGTAATGCTACCTTTT 3142 
III III I I I I I 

Db 822 CTGTCTATGCT — TGCCT 

CTACC 8 02 



Qy 3143 GGGAAACAAACTGCCCC-GTTAACTCCAGATCATTG — 

CAC T GG AAT G T AAT C AAG AAAG 3199 

I I I I I I I I I I I I I I III III II 

I I 

Db 801 CCCACCGCCCCAGGCGGCACAAG-TGTTTGGCCACACAAA- 

AC TAG AG AT AG 752 



Qy 3200 

TTAGTCATGTTTTATGTACCATGTTTTCACACGTGTCTCTTCTCTTCGACTTCCTGAAAG 32 59 

I III II II I I I I I I I I I I I I I I I I 

Db 751 AAA AGGTGGTAAAAAC 

TTCAAACTTTTCTAAATTCTCCAAC 711 

Qy 3260 

CGAAAGCTTTACCTCCTGCAAATGTCAGCACATGTAGTAGGACACCAGTATCCTAGGACA 3319 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 710 AG-TTTATTTCTTGTGAATTTCTTC- 

CTTCTTTAAATACTCCATTTTAAGAAAACA 657 

Qy 3320 

GAGAGCCATAAGTAGCCCTTTGGAGGACTGATGGTGTCAACCAAAGGCATGTGATTGATT 33 79 

I I I I I I I I I I I I I I I I I I I I 

Db 656 AA AAAATTA 

ATTATCTAAAGGCAAAGAATGGA — 625 

Qy 3380 

AATGATTCCCCCTTAGAAAGCAAGTGTTACCAAAGTTGTGTTATCTTGAAAGCATTACAG 3 439 

I I I I I I I I I I I I III I I I I I I I I I I I 

Db 62 4 AAGCAACCTT TGTGTTCC TTATAATAAC- 

TGACTTCATAAC— 585 

Qy 3440 

GTAAGGGCATGTTATGGTTATTTATCATTGTTTAATGAATAGTAGAGGTGTCAAGGGACT 3 499 

III I I I I 

I I 

Db 58 4 TCTC TCCAG 

CT 574 

Qy 3500 

ATGTATACATGATTAGGGTAAGATAGAATGTATTATATATATATATATATATACACACAC 3559 

II I I I I I I I 

Db 573 GCGTT 

ATGGGATGT 56 0 

Qy 356 0 ACATATATATAGCTGAATCTTTGGTGTATTGAAATAGGCAGCAC— TCTGAA 

AGACA 3614 

I I I I I I I I I I I I I II III I I I I I I I I I I I 

I I I I 

Db 559 — GTATAAAAAGCT TCTGTTCTG 

AG AAC AAAGG AGC AC G T GC AG AAAT GAG AC G 50 8 

Qy 3615 

GAAGCTTCGTCCAGCCACTCTTCAGCACATTCCTTTACTAAGCAGTTTAAAGCCGTCCTA 36 7 4 

II II I I I I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 50 7 AAAAAATC-CACTGACAGTATTC CAT T AC AC AAAT TACT TAAAAG— 

ATTTTA 458 

Qy 3675 

GTGGAGCAAGCCCTAAAGCAGATTTAATTTTTGCCATTTTCCAAGAATGACGGTGGTGGC 3 73 4 
II I I I I I I I I I I I I I I I I I I I I II 

I I I I 

Db 45 7 GT CAAGCCCCTCAACAGATT CAATTTTAAAA 

TGGC 423 



Qy 3735 



TTTTAGTCAGAAAATGGCCTTCTGTGCTTTCAAAAAAAAAACAAAAAAAAAACCACACAC 3 79 4 

I I I I I I I I I I I I I I I I I I III I 

I I 

Db 422 TTTTAGT 

TAAAAAAAAAAAATTGAAAGTGCTTAC 3 89 

Qy 3795 

ACACATAAAAAACCCAACAGGTCAAAATAAAAGTTGAACTTGAGTTACATTTAATTTAAA 3 85 4 
II I I I I I I I I I III I I I I I I I I I I I I 

Db 388 C C AG T AAAAG AAC C G AAG TAG T C C TGAACT 

GTTACGT 352 

Qy 3855 

TATAAATGCATTTTGAGAAATGTTAAGAACAATTTAGTCAATCGTTCATCTGTCATTGGT 3914 
II I I I I I I III III I I I I I I I I II I 

I I I 

Db 351 AAGACTTTTT ACAGTT-GGATC TTTGTCAAAAGGGGATGGG 

GGT 309 

Qy 3915 

ACTGTAAAATAAGCTGTGGTCTATTTCCACTGTTTAATTTTCTACTCAGTTCTACCAAAT 39 7 4 

I I I I I I I I I III I III I I I I I I I 

Db 308 GATGGGAGA-AAGCAG CAAC 

GACAATCAAAAAAGTTCGAGC 269 

Qy 3975 

AGGATGTCATGTTTGACATTTTTGATAGTGACTTTGGGGTCTTCTTCACTGAAAGCACCT 4 03 4 

I I I I I I I I I I I I I I I I II 

Db 26 8 TGCT-GTGGC- 

T AG AGG AC AAC T T C T G T G 242 

Qy 4035 

TAGAACTGTACTATAAGAAAACATTTCCCCTATGTATAATTATATGAATGTGATGTTTAT 4094 

I I I 

Db 241 

TTTCC 237 

Qy 4095 

TGCTTATTAATTTATAATTCAGTCATTCTCTATATAGGACTTCTTAAAATTTAGAAGGGA 415 4 

I I I I I I I I I I I I I I I I I 

I 

Db 236 AGATAGGA- 

TTCTTGCTGTAGAAATGGAA 2 09 

Qy 4155 

AATCTAGCTACTTCAAATTGTCTGTTAAATTTATTATGCCCAAATCAACCTCTGAAAAAA 4214 
I I I I I II I I I I I I I I 

Db 208 CTTCCAGC CA GCACAG — 

CATCC 188 

Qy 4215 

GGTTTTTCCAGGAAGATTTACATTTAGGTTTAATATTTTTTTAGTTAGGTAGAGTTTTAA 42 7 4 

I I I I II I I I I I I 

Db 187 TGTCC CA 

G TAG AG 175 

Qy 4275 

AAAATACTTGAGCCTGTCCGTGATAAAGCTATAAAATTCAATAACTTTTTAGAATGTTAA 433 4 

I I I I I I I I I I I II I I I I I I I I I I I I 

Db 174 AAATGAGTTTGTCAGT — TAAAACAAAAAAA 



AAATT- 



141 



Qy 4335 

AT G AAG AC AC T G T T T C C T AAC ATC AG TGAGAT AC ATC T TTGAAT T T AAAC AT TCATATTT 4394 
I I I I I I I II III I I I I I I I I I I I I 

Db 140 AGATACTGGAACC CAG GCTAGA- 

CGAGGTATTGAACCGCGC 101 

Qy 4395 

ACTGAGTACCTACTAGGTACCAAGTACTCTTTTAGGCACTGGAAATACAGTGATGGACAA 4 45 4 



Db 100 -CAGATTTCCTTGCAG CCCTG TCTGCTCAGCTC — GCATTCCAGT — 

AGGCCAA 52 

Qy 4455 AA 

CAGGTAAAAAATCGCTGCCCCCTCAGAGCTGACATTCTGGGGTGGGAATTTCATT 4511 



Db 51 CAGTGCTCGAGGAATTCCGCAGC- 

T T T T AG AGC AG AAG T AAC AC T 8 

Qy 4512 TTGCCACGTAC 452 2 

I I I I I I 

Db 7 T CCGTAC 1 
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Qy 18 TGCGG-GGAGCTAAGTC — 

CCCAGATTGGAGGAGGCTGGCTCTGGTCTTCGATGCACAGG 74 



Score 5370.5; DB 1; Length 2336; 
Pred. No. 0; 
0; Mismatches 805; Indels 1295; 



Db 2 TACGGAAGTGTTACTTCTGCTC TAAAAGCTGCGGAAT TCCTCGA- 

GCACTG- 51 



Qy 75 AGTGGCCGTTATGGAACGC-AGCAGCAGCGTGCAGGGT — 

CAAAGACAGCCGGCCCCCCA 131 

I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 52 -TTGGCC-TACTGGAATGCGAGCTG- 
AGCAGACAGGGCTGCAAGGAAATCTGGCGC 104 



Qy 132 

TGTCAGTGGTCTAGGATGGCCAGTGAAGGCACCAACATCCCAAGTCCTGTGGTGCGCCAG 191 
III III III II I I I I I I I I 

Db 105 GGTT CAAT ACC-TCGT — CTAG-CCTG- 

GGTTC 132 



Qy 192 

ATTGACAAGCAGTTTCTGATTTGCAGTATATGCCTGGAACGGTACAAGAATCCCAAGGTT 2 51 

I I I I I I I I I I I III I I I I I I I I I I I I 

I I 

Db 133 CAGTATCTAATTTTTTTTTTGT TTTAACTG-AC AAACTCAT 

TT 174 



Qy 252 CTCCCCTGTCTGCACACTTTCTGCGAGAGGTGCCTGCAGAAC — 

TACATTCCTGCCCACA 3 09 

III III I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 175 CTCTACTG GGACA 

GGATGCTGTGCTGGCTGGAAGTTCCATTTCTACAGCAA 225 



Qy 310 GTTTAACCCTCTCCTG CCCAG- 

TGTGCCGCCAGACCTCCATCCTGCCCGAGAAAGGG 365 

I I I I I I I I I I I I I II 

Db 226 G AATCCTATCTGGAAACACAGAAGT- 

TGTCCTCTAGCC 262 



Qy 366 GTGGCCGC- 

GCTCCAGAACAATTTCTTCATCACAAACCTGATGGACGTGCTGCAGCGAAC 42 4 

I I I I I I I I I I I III I I I I I I I I I I I I I 

I 

Db 263 ACAGCAGCTC — GAACTTTTT TGATTGTCGT — 

TGCTGCTTTC 301 



Qy 42 5 TCCAGGCAGCAACGCTGAGGAGTCTTCCATCCTGGAGAC — 

AGTCACTGCTGTGGCTGCG 482 

III I I I I I I I I I I III III II I II 

Db 3 02 TCC CATCACCCC CATCC— 

CCTTTTGACAAAGATCCAACTGT 341 

Qy 483 

GGAAAGCCTCTCTCTTGCCCAAACCACGATGGGAATGTGATGGAATTTTACTGCCAGTCC 5 42 
I I I I I I I I III 

I 

Db 3 42 AAAAAG TCTT ACG 

TAACAGTTC 363 

Qy 543 

TGTGAGACTGCCATGTGTCGGGAGTGCACGGAGGGGGAGCACGCAGAGCACCCCACAGTT 6 02 
I I I I I I I II I II III I I I I I 

I I 

Db 36 4 AG GACTACTTCGGTTC TTTTAC — TGGGTAAGCAC 



TT 398 



Qy 6 03 CCACTCAAGGATGTGGTGGAACAGCACAAGGCCTCGCTCCAGGTCCAGCTG 

GATGC 658 

III III I I I I I I I I I I I I I I I 

I I 

Db 399 TCAAT 

TTTTTTTTTTTAACTAAAAGCCATTTTAAAATTGAATCTGTTGAGGGGC 452 

Qy 659 -TGTCAACAAAAGGCTCCCAGAAATAGATTCTGCTCTTCAGTTC- 

ATCTCTGAAATCATT 716 

I I I I I I II I I I I I I I I I III II I I I I 

I I 

Db 453 TTGACTAAAA 

TCTTTTAAGTAATTTGTGTAATGGAATACTGTCAGTGGATTTTTT 5 0 7 

Qy 717 

CATCAGTTAACCAACCAAAAGGCCAGCATCGTGGATGACATTCATTCCACCTTTGATGAG 7 76 

III II I I I I I I I I I I I I I I I I 

Db 508 CGTCT CATTT CTGCA-CGTG 

CTCCTTTGTT 536 

Qy 777 CTC CAGAAGACTTTAAATGTGCGCAAGAGTGTGCTGCTTATG-GAATTG- 

GAGGTC 83 0 

III I I I I I I I I I I II I I I I I I I I I I I I II 

I I I 

Db 53 7 CTCAGAACAGAAG- 

CTTTTTATACACATCCCATAACGCAGCTGGAGAGAGTTATGAAGTC 595 

Qy 831 AACTATGGCCTCAAACACAAAG — TCC — TCCAGTC 

GCAGCTGGATAC 8 74 

I III I I I I I I I I I II I I I I I I II I I I I I 

Db 596 

AGTTATTATAAGGAACACAAAGGTTGCTTTCCATTCTTTGCCTTTAGATAATTAATTTTT 655 

Qy 875 

TCTGCTCCAGGGGCAGGAGAGCATTAAGAGCTGCAGCAACTTCACAGCGCAGGCCCT 931 

I I I I I I I I I I I I I I I II I I I I I I I I 

Db 656 TTGTTTTCTTAAAATG 

GAGTATTTAAAGAAGGAAGAAATTCACA 699 

Qy 932 

CAACCATGGCACGGAGACCGAGGTCCTACTGGTGAAGAAGCAGATGAGCGAGAAGCTGAA 991 

III I I I I I I I I I I I I I I I I I 

I 

Db 70 0 AGAAATA AACTGTTGGAGAATTT 

AG AAAAG T T T G A 73 4 

Qy 992 

CGAGCTGGCCGACCAGGACTTCCCCTTGCACCCGCGGGAGAACGACCAGCTGGATTTCAT 1051 

I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 735 AGTTTTTACC-ACCTTTTCTATCTCTAGTTT TGTGTGGCCAAACA-CTTG — 

TGCCG 787 

Qy 1052 

CGTGGAAACCGAGGGGCTGAAGAAGTCCATCCACAACCTCGGGACGATCTTAACCACCAA 1111 

I I I I I I I I II I I I I I II III I I I I I I I I I I I 
Db 78 8 CCTGG-GGCGGTGGGG — GTAGAGG — CAAGCATAGAC- 
AGAGAGGAACTAAGCC 836 



Qy 1112 

CGCCGTTGCCTCAGAGACAGTGGCCACGGGCGAGGGGCTGCGGCAGACCATCATCGGGCA 1171 

I I I I I I I I I I III II III III 

I I 

Db 83 7 AGACA-TGGACAAAGGCACGAG 



C C AAAAC C AG AC A 8 70 
Qy 1172 

GCCCATGTCCGTCACCATCACCACCAAGGACAAAGACGGTGAGCTGTGCAAAACCGGCAA 1231 

I I I I I I I I I III I I I I I I I I I I I I I I I I I 

Db 8 71 GTCC-TGGCCGT-TCGAT AGGCCA GCAGTGGG-TGAG 

ACAGG— 9 09 

Qy 1232 

CGCCTACCTCACCGCCGAACTGAGCACCCCCGACGGGAGCGTGGCAGACGGGGAGATCCT 12 91 

I I I I III I I I I I I I I I I I I I I I I I I I 

Db 910 TCAGCC AGCTGCAGGGGCGGGAGCGGGG- 

GGGCGGGTAG 9 47 

Qy 1292 

GGACAACAAGAACGGCACCTATGAGTTTTTGTACACTGTCCAGAAGGAAGGGGACTTTAC 1351 

II I I I I I I I I I I I I I I I I I II I I I I I 

Db 9 48 GG ACATGGATG TGTGAG GTGCTCATGC — 

GTGCGATCGTGACT 988 

Qy 1352 CCTGTCTCTGAGACTCTATGACCAGCACA TCCGAGGCAGC- 

CCGTTTAAGCTGAAAG 140 7 

I I I I I I I I I I I I I I I I III I I I I I I I I I 

I 

Db 9 89 — AGATTCTGAAACT GCCAGC-CATTTTCC CAGCTCCGCTT 

TG 1028 



Qy 1408 

TGATCCGATCCGCTGATGTGTCTCCCACCACAGAAGGCGTGAAGAGGCGCGTTAAGTCC- 1466 

II I I I I I I I I I I I I I I I I I I I I I 

Db 1029 TG CAATCTAAAGGAATGCATCC — CCTCTGAAG 

CAGTCTT 1066 



Qy 1467 - 

CCGGGGAGCGGCCACGTCAAGCAGAAAGCTGTGAAAAGACCCGCAAGCATGTACAGCAC 152 5 

III III I I I I I I I I III II I I I I I I I I I 

Db 106 7 GCCAG AGC C T AGTGAGGGAGAGAAG T AT GG T AAAT AC C AC AA- 

CATAT 1113 



Qy 1526 

TGGAAAACGAAAAGAGAATCCCATCGAAGACGATTTGATCTTTCGAGTGGGTACCAAAGG 15 85 

I I I I I I I I I I I II II I I I I I I I I III 

I I 

Db 1114 — GGAATC AGAAAATAC CGGGAAC TGGAGTGGG 

CAAGGG 1150 



Qy 1586 

AAGAAATAAAGGAGAGTTTACAAATCTTCAGGGGGTAGCTGCATCTACAAATGGAAAGAT 16 45 

I I I I I I I I I I I I I I I I I II III I I I I I I 

Db 1151 — GGAATGCA-GAGGGTGTGGAAATTTTTTAGTG ATC 

TGGAATGTG 1193 

Qy 1646 

ATTAATTGCAGACAGTAACAACCAATGTGTGCAGATATTTTCCAATGATGGCCAGTTCAA 17 05 

I I I I I I I I I I I I I I II I I I I I III I 



I 

Db 119 4 TTGAGTGACAGGAAGTGCC — CCAA GC TTCTCCC CCA 

CCA 1231 

Qy 1706 

AAGTCGTTTTGGCATACGGGGACGCTCTCCGGGGCAGCTGCAGCGGCCCACAGGAGTGGC 1765 
I I I I I I I I I I I I I I I 

Db 1232 A CTCTTC TCAGTCGC GCCTGCT — 

TT 1255 

Qy 1766 

TGTACATCCCAGTGGGGACATAATCATTGCCGATTATGATAATAAATGGGTCAGCATTTT 182 5 

I I I I I I I I I I I I I I I I I I I II 

I 

Db 12 56 TGTCTAACTC — T T G T AAT C T AC AC AC T AC T GC T T A 

CAAAGCTGT 129 8 

Qy 1826 

CTCCTCCGATGGGAAATTTAAGACAAAAATTGGATCAGGAAAGCTGATGGGACCCAAAGG 18 85 

II I I I I I I I I I I I I I I I I I I I I I I I 

Db 1299 CT G AG T T T AAG AC AAAAA 

GAAACCTAAAAGTCTCCTTA — 1336 

Qy 18 86 AGTTTCTGTGGACCGCAATGGGCACATTATTGTTGTGGACAACAAGGCGTG 

CTGCGT 1942 

III I I I I II I I I I I I I I I I I I 

I 

Db 133 7 — CTTCATAGTACC TG TGATATGG 

AAGGAATGTAAAAGCAT 13 75 

Qy 1943 

GTTTATCTTCCAGCCAAACGGGAAAATAGTCACCAGGTTTGGTAGCCGAGGAAATGGGGA 2 0 02 

I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 1376 G — ACCCTTTAAGCCTAT — GGACATT — TTTTCAGGTAT 

ACAGGAGA 1417 

Qy 2 0 03 CAGGCAGTTTGCAGGTCCCCATTTTGCAGCTGTAAATAGC- 

AAT AAT GAG ATT ATT ATT A 2 061 

III I I I I I I I I I I I I I I I I I I I I 

I 

Db 1418 AAGATAAAAT 

AATTTTCCACAGAAAAATGAGAATTATGAATTATATAGTT 146 7 

Qy 2062 

C AG AT T T C CAT AAT CATTCTGT C AAGGTGT T T AATC AGGAAGGAGAAT T C AT G T T G AAG T 2121 
I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 146 8 CAGGTTCCAAATCTAATT TTTAA AAGAATTC 

TGA— T 1502 

Qy 2122 TTGGCTCAAATGGAGAAGGAAATGGGCAGTTT 

AATGCTCCAACAGGTGTAGCAGTGG 2178 

I I I I I I I I I I I I I I I I I I I 

I I I 

Db 15 03 TCTGCT ACACTTTACAAATGCT TAGGT 

TGG 1532 



Qy 2179 

ATTCAAATGGAAACATCATTGTGGCCGACTGGGGAAACAGCAGGATCCAGGTTTTTGATG 2 23 8 



Db 1533 TTCCTAAT TTGAAGGAGACTTG 

TTTTATTT 1562 

Qy 2239 

GGAGTGGATCATTTTTGTCCTACATTAACACATCTGCTGACCCACTCTATGGCCCCCAAG 2 2 9 8 
II I I I I I I I I I II III I I I I I 

Db 1563 GG TTAATGCATTGCATTTGAACT — 

TGTTTCTATTTTCTTTG 1602 

Qy 2299 

GCCTGGCCCTAACTTCAGATGGTCATGTTGTGGTTGCAGACTCTGGAAATCACTGTTTCA 235 8 

I I I I I I I I I I III I I I I I I I 

I I 

Db 16 03 CATAAATT TGGACT TTG GGAGAAAAATG 

CA 1632 

Qy 2359 

AAGTCTATCGATACTTACAGTAATGGTGGGCAGGTGGATACCCGCTTCCATGGTCTTGCA 2 418 
I I I I I I I I I I I I I III 

Db 1633 AAGTAATAAGTAGAATGCACT TGGG 

GGA 1660 

Qy 2419 

CTATAAACTGGAATGGATTTCTCAATGCGGGACCAGATTATGACTAGAGTTTTTATGCCA 2 4 7 8 
III III I I I I I I I I II I I I I I 

Db 1661 AAAAAGGA — GGATTTTTC CCT 

TCATGGGTT 1689 

Qy 2 4 79 GAAGGAATCATTGGTGAACTTTCCAAGGTT- 

ATTTCTGAATGTAACAATTTCCTTAAAAA 2537 

I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 169 0 GGAA 

AGTATTTTAAAGGGTTGTTTTCTTGAAAAAACAAGCTC 1731 

Qy 2538 

TGACTTATCCAATTTCTGTATTTCACCTTTAGGGTTAAAAAAAACTCTTCTACTGAATCT 2 59 7 

II II I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 1732 — TCTCTTAC — TTTCTGCATCTATGCTATAAAGATAA CTA-TG 

CT 1772 

Qy 2598 

ATAAAAACTGCAGTTTTACATCTGTGAACTATGGCTTAAGGGACAGGATTTATGTAGCTA 2 65 7 

I I I I I I I I I I I I I I I I 



Db 17 73 ATAATAAATG TAAG GTAG- 

AA 1792 

Qy 2658 

AAC T AAT T T T GC AAAT C AAAC AGAC AC T T AAAAAAAC T AGC AT ATGT AAAGGT AT T C G T T 2 717 

I I I I I I I I I I I I I I I I I I I I 

I 

Db 1793 AACT TTAAAGAGA 

AAAT AAC AGTGTTC — T 182 0 

Qy 2 718 AATCCTGTGAATGG 

T AGC T T T TGC AC AGAAC T TCCAAAAGCAAAACAAAAACAAA 2 7 72 

II I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I 



Db 1821 AA G T GAAAAGC T AC T T AGC AT TTTCCCA- 

AACTCACACATTATCAACAGAAACACT 18 75 



Qy 2773 

ATCTATTGTAGTTATATACTTCATTTAACCTAGGTCACAAGACCCAGGGAATCTTCTAAC 2 832 

I I I I I I I I I I I I I I I I I I III 

I 

Db 18 76 AAAAATT-AAGAGAAAAGCGGC CAACTAATTAAATAG 

CTAGC 1916 

Qy 2833 

CTCACTTTTACAGTAGGTATTACTCTTGTGACATTTTTTTGGTTATCAACAACTAAATAT 2 892 

I I I I I I I I I I I I I I I I I I I I I I 



Db 1917 CTTATTTT GGGGGTAT- 

GGGATAGAAAATTAAGTGT 1951 



Qy 2893 

AAATTACTTTGGAAAAAGTAAGGCTGTCTTGCAAAATGATCCCAGCTCTGATTAGCAGCC 2 952 
III I I I I I I I I I I I I I I I III 



1952 GAAT AAAATGATA CTTG-GGAATGTT- 



Qy 2 953 CTCTGGAGTTCAGAACTTAAG- 

TATCAGTGCAAATTTCTCAACCTTTCTGGGTTAGACAA 3 011 
III III II I I I I I I I I 

Db 19 82 CTC GTAC — CACAAAGGATGTTAGTG- 

TACGAGTTA 2 021 



Qy 3012 

AGATCCTTTTTTGTGTGTTCTTTTCACCACCCCTTTGGCTCACCTTGTATCAGCAAACAA 3 0 71 
I I I I I I II II II I I I I I II 

Db 2 022 — ATCCTTCCTAG CATGGCTCTGAGCCTT CATGCCG 

AGC AG AC— 2 062 



Qy 3072 

AGTACTTCTTCAGGGAAACCTGAAATTTCTAATGCCTTGAAAAGCATATTACAAAAGTAA 3131 
II III I I I I I III I I I I I I I I I I 

Db 2063 -GTTATTC ACATGACGATTC GAAAAGTCCATT- 

CATATATCT 2102 



Qy 3132 

TGCTACCTTTTGGGAAACAAACTGCCCCGTTAACTCCAGATCATTGCACTGGAATGTAAT 3191 

I I I I I I I I I I I I I I 

I I 

Db 2103 CGCTACC 

TGGATTTGAATAGAAA 2125 



Qy 3192 

CAAGAAAGTTAGTCATGTTTTATGTACCATGTTTTCACACGTGTCTCTTCTCTTCGACTT 32 51 
I I I I I I I I I I I I I I I I II 

Db 2126 CC AG AC AG CAATTCTTTAGTTCCA 

GCCACCATTCGCCCC 216 4 



Qy 3252 

CCTGAAAGCGAAAGCTTTACCTCCTGCAAATGTCAGCACATGTAGTAGGACACCAGTATC 3311 

I I I I I I I I I I I I I I I I I III 

I I 

Db 2165 ACTGGA — CAATAGCGAT TTGTTAGCA 



CAGAGTC 2196 



Qy 3312 

CTAGGACAGAGAGCCATAAGTAGCCCTTTGGAGGACTGATGGTGTCAACCAAAGGCATGT 33 71 

I I I I I I I I I I I III I I I I I I II 
Db 219 7 ACAGG-CTGTGGGACACAA — AGC TTGGAG — CT 

GCA 2228 

Qy 3372 

GATTGATTAATGATTCCCCCTTAGAAAGCAAGTGTTACCAAAGTTGTGTTATCTTGAAAG 3 431 



Db 2229 GAAAGATGGGGGATTC AGAGAACAGGAAATTACAGG CTCG- 

ATG 2271 

Qy 3432 

CATTACAGGTAAGGGCATGTTATGGTTATTTATCATTGTTTAATGAATAGTAGAGGTGTC 3 491 

I I I I I I I II II III I I I I I 
Db 2272 CACTCCTGGCA GC 

T C T G AG AAT AC AAAAAAAAAAAAAAAAA 2312 

Qy 3 492 AAGGGACTATGTATACATGATTAG 3515 

II III III II I 

Db 2313 AAAAGGCCACATGTGCTCGAGCTG 2336 
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Qy 59 GTCTTCGATGCACAGGAGTGGCCGTTATGGAACG — 

CAGCAGCAGCGTGCAGGGTCAAAG 116 

I I I I I I I I I I I II I I I I I I I I I I 

Db 212 0 GTCT AGCGGCCG CGACCTGCAGCT- 

CGAGC 2 092 



Qy 117 

ACAGCCGGCCCCCCATGTCAGTGGTCTAGGATGGCCAGTGAAGGCACCAACATCCCAAGT 176 
III I I I I II II I I I I I 

Db 2091 ACATGTGGCCTTTTTTTTTTTTTTT 

TAAAAGAA 2 059 



Qy 177 

CCTGTGGTGCGCCAGATTGACAAGCAGTTTCTGATTTGCAGTATATGCCTGGAACGGTAC 236 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



Db 2 058 — T G AG T T AC AT T T - AT T GAT AT G — GTTTGTCATATGCTTTATA- 

AATGGT-C 2012 



Qy 237 

AAGAATCCCAAGGTTCTCCCCTGTCTGCACACTTTCTGCGAGAGGTGCCTGCAGAACTAC 2 96 
I III II I I I I I I I I I I I I I I I I 

Db 2011 A CCCT T T G AAAC AT GTATTATTACTATTTGGGGGAGAGG 

GGGACT — 196 7 



Qy 29 7 ATTCCTGCCCACAGTTTAACCCTCTCCTGCCCAGTGTGC- 

CGCCAGACCTCCATCCTGCC 355 

I I I I I I I I I I I I I I I I I I 

I I 

Db 1966 GTTCAT TTTA CAGGGGACAAGCAAGA- 

CAGGCT 1935 



Qy 356 

CGAGAAAGGGGTGGCCGCGCTCCAGAACAATTTCTTCATCACAAACCTGATGGACGTGCT 415 
I I I I I I III I I I I I I I I I I I I I 

Db 193 4 CAAGGAGGGAAAGGACAGGCTCAAAG TCATCACA 

GTGTG 1896 



Qy 416 

GCAGCGAACTCCAGGCAGCAACGCTGAGGAGTCTTCCATCCTGGAGACAGTCACTGCTGT 4 75 
I I I I I I I I I III I I I I I I 

Db 1895 G GGCTGGAATGC 

AGTTGCCCTTCCT 1871 



Qy 476 

GGCTGCGGGAAAGCCTCTCTCTTGCCCAAACCACGATGGGAATGTGATGGAATTTTACTG 535 

I I I I I I I I I I 

Db 1870 TCTTTCT 

TTTTGC— 1858 



Qy 536 

CCAGTCCTGTGAGACTGCCATGTGTCGGGAGTGCACGGAGGGGGAGCACGCAGAGCACCC 595 

II I I I I I I I I I I I I I I I I I I I I I I I 

Db 1857 ACAT CTTCC — GTCTCTAGGGTG — AGGAGGGG 

TGCAGG 1823 



Qy 596 

CACAGTTCCACTCAAGGATGTGGTGGAACAGCACAAGGCCTCGCTCCAGGTCCAGCTGGA 655 
I I I I I I I I I I I I I I I I I II II 

Db 1822 CACAGG — CACCCAAG ACAGCT GC 

GGTCCAGC 1793 



Qy 656 

TGCTGTCAACAAAAGGCTCCCAGAAATAGATTCTGCTCTTCAGTTCATCTCTGAAATCAT 715 

III I I I I III II 



Db 1792 — 

CTGTGGTC — 1774 



-CCCG- 



-GCCCCAC- 



Qy 716 

TCATCAGTTAACCAACCAAAAGGCCAGCATCGTGGATGACATTCATTCCACCTTTGATGA 7 75 



Db 17 73 TCAGTGA— CGCCCCAGAGGCCC-CATC 

TTCCCCACATAATGA 173 4 

Qy 776 

GCTCCAGAAGACTTTAAATGTGCGCAAGAGTGTGCTGCTTATGGAATTGGAGGTCA 831 

I I I I I I I I I I I I I 

Db 1733 GGCTGCTCCA 

TCCTCCTCAAAG 1712 

Qy 832 

ACTATGGCCTCAAACACAAAGTCCTCCAGTCGCAGCTGGATACTCTGCTCCAGGGG C 8 8 8 



Db 1711 -CCCAGACCTATTTCATAAG CCCCAGACCCCAC- 

CTTCACCCAGGGCCCC 166 4 



Qy 8 89 AGGAGAGC AT T AAGAGC TGC AGCAAC T TC - AC AGCGCAG 

GCCCTCAACCATGG 9 40 

I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1663 AAGAGAAC 

AGAGCTGGAGACACTTCTACTCCTAGCACTGGATGCCCTCTCC 1613 



Qy 941 

CACGGAGACCGAGGTCCTACTGGTGAAGAAGCAGATGAGCGAGAAGCTGAACGAGCTGGC 10 0 0 



Db 1612 CCTCCT-GTGAAGCTGTAGGTGG- 

GGGGTGG- 158 4 



Qy 1001 CGACCAGGACTTCCCCTT GCA — 

CCCGCGGGAGAACGACCAGCTGGATTTCATCGTG 1055 



Db 1583 AGGACACCCCTTTAAGGCAGGCTCGGGGGGCTATGAACTCCAAGACTCC- 

TG 1533 



Qy 1056 

GAAACCGAGGGGCTGAAGAAGTCCATCCACAACCTCGGGACGATCTTAACCACCAACGCC 1115 

I I I I I I I II I I I I I I I II I I I I I I 

Db 1532 GAAGCCAGTAAGAAGTGGGAAGGTAGGCACACCATTTGG — 
GATTTTTAT 1485 

Qy 1116 

GTTGCCTCAGAGACAGTGGCCACGGGCGAGGGGCTGCGGCAGACCATCATCGGGCAGCCC 1175 



Db 148 4 -TTTTCACTGAGCTA AGAGAGG AG AT T C — CAG-CTAT 

GGAATCCA 14 43 

Qy 1176 

ATGTCCGTCACCATCACCACCAAGGACAAAGACGGTGAGCTGTGCAAAACCGGCAACGCC 1235 



I I I 



Db 14 42 GAGAGCGT TCTACAGGAACA GAACAG 

AAATTGGCTGGGCC 1403 

Qy 1236 

TACCTCACCGCCGAACTGAGCACCCCCGACGGGAGCGTGGCAGACGGGGAGATCCTGGAC 12 95 
I I I I I I I I I I I I I 

I I I I I I 

Db 1402 CTGGG CCCAGA GGAGG- 

CCTGGA- 13 81 

Qy 1296 AACAAGAACGGCACCTATGAGTTTTTGTACACTGTCCAGAAGGAAGGGGACTTTA- 

CCCT 1354 

II I II I I I I I I I I I I I I I I I I I 

I 

Db 13 8 0 GCCGC TTACTGTTCAGTG-CCAG 

CTTCATCCTT 1349 



Qy 1355 GTCTCTGAGACTCTATGAC — 

CAGCACATCCGAGGCAGCCCGTTTAAGCTGAAAGTGATC 1412 

I I I I I I I I I II I I I I I I I III III 
Db 1348 GGCACTCTGCCTCTCACACAGCATCGC-TCC— GGCTCCCAG- 
AGC 1307 



Qy 1413 

CGATCCGCTGATGTGTCTCCCACCACAGAAGGCGTGAAGAGGCGCGTTAAGTCCCCGGGG 14 72 
I I I I I I I I I I I I I I I I I II II 

Db 13 06 CCACAGTGG CAGCCGCTGGAGGC-TGTGGCAGC- 

AGCTGCTGCCCT 1263 



Qy 1473 

AGCGGCCACGTCAAGCAGAAAGCTGTGAAAAGACCCGCAAGCATGTACAGCACTGGAAAA 1532 

III I I I I II I II I I II I II I I I I I 

Db 1262 GCC CAAG — GAGGGATGTG GGCTTG GCTCTGACAGTTC 

AAGC 1223 



Qy 1533 

CGAAAAGAGAATCCCATCGAAGACGATTTGATCTTTCGAGTGGGTACCAAAGGAAGAAAT 1592 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I I 

Db 1222 CAGAAAGAG CCTTAGAAGTCATTCAGGACATTCCCAT — 

T T T AC AG AT GAT G AAAT 1169 

Qy 1593 AAAG 

GAGAGTTTACAAATCTTCAGGGGGTAGCTGCATCTACAAATGGAAAGATATT 16 4 8 

III I I I I I I I I I I I I I I I I I I I I II 

Db 116 8 AAGGCTCAGAGAG 

AGCAGGGGCTTTCAAGAACAATAAAGGTGGAG 112 4 

Qy 16 49 AATTGCAGACAGTAACAACCAATGTGTGCAGA TATTTTCCAATGATGGC 

AG 1700 

I I I I III I II I I I I I I I I I I I I I I I I I I 

I I 

Db 1123 ACTGGAACTCAG — GCCTCC-- 

TGTCTGCTGAGGGTGTCTGCCCATGCTCTCTCTCAGAG 106 8 



Qy 1701 TTCAAAAGTCGTTTTGGCATACGGGGACG — 

CTCTCCGGGGCAGCTGCAGCGGCCCACAG 1758 

II I I I I I III I I I I I III III 

Db 106 7 TTTCTGAGTGGTGGTGTCTGGAGGGGCTGGACACGCAAGGG- 

AGCTGGCCTCGT 1015 



Qy 1759 

GAGTGGCTGTACATCCCAGTGGGGACATAATCATTGCCGATTATGATAATAAATGGGTCA 1818 

I I I I I I I I III I I I I I II II I I I I I III 

I I 

Db 1014 GTGTGTCTGTCTGTGCC GGGATGT-GTCTGTG — 

GCTTCTGTCTCTGAACGTGCCA 962 



Qy 1819 

GCATTTTCTCCTCCGATGGGAAATTTAAGACAAAAATTGGATCAGGAAAGCTGATGGGAC 18 7 8 
I I I I I I I I I I I I I I II 

Db 961 GGGGTTTTGTGTCCTGTGTG TGTCTCT 

CTGTGGGTGC 925 



Qy 18 79 CCAAAGGAGTTTCTGTGGACCGCAA — 

TGGGCACATTATTGTTGTGGACAACAAGGCGTG 1936 

I I I I I I III I I I I II 

I I 

Db 92 4 TC CTGTGTGCAGGAACTTGGGGCCA- 

TGGCCAGAGAGGGATG 884 



Qy 1937 

CTGCGTGTTTATCTTCCAGCCAAACGGGAAAATAGTCACCAGGTTTGGTAGCCGAGGAAA 1996 
II I I I I I I I I I I I I I I I I I 

Db 8 83 -GGC CTATTCCT-ACAATCTTGGGAGGCCTCCACAG 

GCTGAGCTGG 8 40 



Qy 1997 

TGGGGACAGGCAGTTTGCAGGTCCCCATTTTGCAGCTGTAAATAGCAATAATGAGATTAT 2 056 
I I I I I I I I I I I I I I I I I I I I I I I 

Db 839 TGGGGA GGGTCC TGAAG-GGTAGAGTG- 

AAGATGGGGACT — 8 02 



Qy 2 057 TATTACAGATTTCCATAATCATTCTGTCAAGGTGTTTAATCAG- 

GAAGGAGAATTCATGT 2115 

I I I I III II II 

Db 801 — TTAC TCC CAG 

CGGAGAGGAAGAAACCA 7 75 



Qy 2116 

TGAAGTTTGGCTCAAATGGAGAAGGAAATGGGCAGTTTAATGCTCCAACAGGTGTAGCAG 2175 
I I I I I I I I I I I I I I I I I I I I III 

Db 774 -GAAGATAGGCT TGGGGCATG-GATGGG 

AAAGGTCTTG 739 



Qy 2176 

TGGATTCAAATGGAAACATCATTGTGGCCGACTGGGGAAACAGCAGGATCCAGGTTTTTG 2 235 
III I I I I I I I I I I I I I I I I I I I 

Db 73 8 -GAACTAGGATGG GGCAAGGGAAACA — 

AAGCCCCA A 705 

Qy 2236 ATGG- 

GAGTGGATCATTTTTGTCCTACATTAACACATCTGCTGACCCACTCTATGGCCCC 2 2 9 4 

I I I I I I I I I I III I I I I I I I I I I I I I 

I I I I 

Db 70 4 ATGGAGTATGGGTC CCTGGA AG T T G AC AC AC T C T - 

TAGCCCT 66 4 



Qy 2295 CAAGGCCTGGCCCTAACTTCAGATGGTCATGTTGTGGTTGCAGACTCTGGAAAT — 



CACT 2352 

I I I I I I I I I I III I I I I III I III II I 

I I I I 

Db 663 CATGG — GGGCCCAGCC — 

CAGTGGGTAACCCCGGGGAAGTGGTCACATGACGTGACACT 6 08 

Qy 2353 

GTTTCAAAGTCTATCGATACTTACAGTAATGGTGGGCAGGTGGATACCCGCTTCCATGGT 2 412 

I I I I I I I I I II III III 

I I 

Db 607 -TCTCA GCTGATTGCT TGAGATCCC CCA- 

GT 579 



Qy 2 413 CTTGCACTATAAACTGGAATGGATTTCTCAATGCGGGA — 

C C AG AT TAT G AC TAG AG T T T 2 4 7 0 

III III II I I I I I I I I I 

Db 578 GTGGC CTG 

GGCCTTGTCCTTGAGTGAGCCCAGGTTC 5 43 



Qy 2471 

TTATGCCAGAAGGAATCATTGGTGAACTTTCCAAGGTTATTTCTGAATGTAACAATTTCC 2 53 0 
I I I I I I I I III III I I I I I I I I I I 

Db 5 42 — AAGCCAGGAAGGC TGGCCAGCTGCCCAAGG 

ACTGAG 50 7 



Qy 2531 

TTAAAAATGACTTATCCAATTTCTGTATTTCACCTTTAGGGTTAAAAAAAACTCTTCTAC 259 0 

I I I I I I I II I I I I I 

I 

Db 506 GCCTTCTCC TC 

CTCCTCCTC 487 



Qy 2591 

TGAATCTATAAAAACTGCAGTTTTACATCTGTGAACTATGGCTTAAGGGACAGGATTTAT 2 650 
I I I I I I I I I I I I I I I I III I I I I 

Db 486 TGTGTC CTCC — TCGATCATCT-TGGCC 

AGCTCCCGGGGC 450 



Qy 2651 GT AGC T AAAC T AAT T T TGC AAATCAAACAGAC A- 

CTTAAAAAAACTAGCATATGTAAAGG 2 709 

I I I I I I I I I I I I I 

Db 449 — AGCTCCA 

CGTCTCCACCTGCCATCTGGATCTGGCTCTCATCTGTC 4 05 



Qy 2710 

TATTCGTTAATCCTGTGAATGGTAGCTTTTGCACAGAACTTCCAAAAGCAAAACAAAAAC 2 769 
I I I I I I I I I I I I I I I I I I I 

Db 40 4 TCGT TCTTG GGTAGC 

CGGTATTT 382 



Qy 2770 

AAAATCTATTGTAGTTATATACTTCATTTAACCTAGGTCACAAGACCCAGGGAATCTTCT 2 82 9 

II III II II II I I I I III 

I 

Db 381 -GTCTCGGAAGTGG CT CCGCAGT GTGGCCC 

GCTCTGCC 345 

Qy 2 83 0 AACCTCACT- 

T T T AC AG T AGG T AT T AC T C T T G T G AC AT T T T T T TGGT T ATC AAC AAC T AA 2 8 8 8 

I I I I II I I I I III III III I I I I III 



Db 3 44 TTCCTCTGTGTGAACTGT — GCATCCCGCT CCATC- 

TTCTCTTCCAC 301 



Qy 2889 

ATATAAATTACTTTGGAAAAAGTAAGGCTGTCTTGCAAAATGATCCCAGCTCTGATTAGC 2 9 4 8 
I I I I I I III I I I I II I I I I I I I 

Db 300 GAGTTGCTTCTGATAC TCCT- 

CGTACTCCTCCCGGCTC A 263 



Qy 2949 

AGCCCTCTGGAGTTCAGAACTTAAGTATCAGTGCAAATTTCTCAACCTTTCTGGGTTAGA 3 0 0 8 

I I I I I I I I I I I I III I I I I I II III 
Db 262 TGCCCT GAGCTTCGG C — TGCCGACTTGTC — 



Qy 3009 

CAAAGATCCTTTTTTGTGTGTTCTTTTCACCACCCCTTTGGCTCACCTTGTATCAGCAAA 3 06 8 
I I I I I I I I I I I I I I I I I I I I 

Db 227 CCCTTATCC 

TCGCCGCCTCCCAGGCTCCCAGTG 195 



Qy 3069 

CAAAGTACTTCTTCAGGGAAACCTGAAATTTCTAATGCCTTGAAAAGCATATTACAAAAG 312 8 
II I II I I I I I III III II 

Db 194 — AGG TTCTTCAG CTG GCC 

GC 176 



Qy 3129 TAATGCTACCTTTTGGGAAACAAACTGCCCCGTTAACTCCA- 

GATCATTGCACTGGAATG 3187 

I I I I I II I I I I I I I I I I I I I I I I I I I I 

I 

Db 175 CCACCAT GG TCTTCACCATGAACGCCATGGTCTT — 

CGCCGGCCGG 132 

Qy 3188 

TAATCAAGAAAGTTAGTCATGTTTTATGTACCATGTTTTCACACGTGTCTCTTCTCTTCG 32 4 7 
II I I I I I I I I I I I 

Db 131 GGCACCGG GCGCC- 

TGCTACCACGCG 107 

Qy 3248 

ACTTCCTGAAAGCGAAAGCTTTACCTCCTGCAAATGTCAGCACATGTAGTAGGACACCAG 33 0 7 

I I I I I I II I I I I I I I I I I II 

I I I I 

Db 106 CCAGGCACCGGGCACGTCCGCGCCTACT TCAGC 

ACTCCAG 6 7 

Qy 3308 

TATCCTAGGACAGAGAGCCATAAGTAGCCCTTTGGAGGACTGATGGTGTCAACCAAAGGC 336 7 
II I I I I I I I I I I I I I I I 

Db 66 TA GGCCAACAGT-GCTC 

GAGGA 46 

Qy 336 8 ATGTGATTGATTAATGATTCCCC — 

CTTAGAAAGCAAGTGTTACCAAAGTTGTGTTATCT 3425 

I I I I I I III I I I I I I I I I 

I I 

Db 45 ATTCCGCAGCTTTTAGAGCAGAAGTAAC 



Qy 3 42 6 TGAAAGCATTACAGGTAAGGGCAT 3449 

I I I I I I I I I I 

Db 14 T CCGTACA GGCCT 2 
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Qy 14 GGGCTGCGGGGAGCTAAGTCCCCAGAT- 

TGGAGGAGGCTGGCTCTGGTCTTCGATGCACA 72 



Db 3 GGCCTGTACGGAAGTGTTACTTCTGCTCTAAAAGCTGCGGAAT 

GCACT 5 7 

Qy 73 GGAGTGGCCGTTATGGAACGCAGCAGCAG 

CGTGCAGGGTCAAAGAC 118 

I I I I I I I I I I I I I I I I I I I I I I I 

Db 58 G— TTGGCC- 

TACTGGAGTGCTGAAGTAGGCGCGGACGTGCCCGGTGCCTGGCGCGTGGT 114 
Qy 119 

AGCCGGCCCCCCATGTCAGTGGTCTAGGATGGCCAGTGAAGGCACCAACATCCC A 173 



Db 115 AGCAGGCGCCCGGTGCC 

CCGGCCGGCGAAGACCATGGCGTTCATGGTGA 163 

Qy 174 

AGTCCTGTGGTGCGCCAGATTGACAAGCAGTTTCTGATTTGCAGTATATGCCTGGAA — C 231 



Db 16 4 AGACC-ATGGTGGGC GGCCAGCTG 

AAGAACCTCACTGGGAGCCTGGGAGGC 213 



II I I I I 
TCCTCGA- 



Qy 232 GGTACAAGAATCCCAAGGTTCTCCCCTGTCTGCACACTTTCTGCGAGAGGTGC- 

CTG 287 

I I I I I I I I I I I I I I I I I 

Db 214 GG — CGAGGAT AAGG 

GAGATGGGGACAAGTCGG 2 44 

Qy 288 

CAGAACTACATTCCTGCCCACAGTTTAACCCTCTCCTGCCCAGTGTGCCGCCAGACCTCC 3 4 7 

III I I I I I I I I I I I I I I I I I 

I 

Db 2 45 CAG CCGAAG CTCAGGGC — 

ATGAGCCGGGAGGAGTAC 2 79 

Qy 3 48 ATC CTGCCCGAGAAAGGGGTGGCCGCGCTCCA- 

GAACAATTTCTTCAT 39 4 

III I I I I I I I I I I III I I I I I I 

I 

Db 280 

GAGGAGTATCAGAAGCAACTCGTGGAAGAGAAGATGGAGCGGGATGCACAGT T 332 

Qy 395 

CACAAACCTGATGGACGTGCTGCAGCGAACTCCAGGCAGCAACGCTGAGGAGTCTTCCAT 45 4 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 333 CACA CAGA-GGAAG-GCAG-AGCG GGCCACACTGCGGAGCC 

ACTT 374 

Qy 455 CCTGGAGACAGTCACTGCTGTGGCTGCGGGAAAGCCTCTCTCTTGC 

CCAAACCACGA 511 

II I I I I I I II I I I I I III I II I I I I 

I I 

Db 3 75 CC — GAGACAAATAC 

CGGCTACC C AAG AAC GAG AC AG AT G AG AGC C AG AT C CAG A 42 7 

Qy 512 

TGGGAATGTGATGGAATTTTACTGCCAGTCCTGTGAGACTGCCATGTGTCGGGAGTGCAC 5 71 
I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 42 8 T-GGCAGGTGGAGACGTGGAGCTGC CCCGGGAGCTGGCCAAG 

ATGATC 474 

Qy 572 

GGAGGGGGAGCACGCAGAGCACCCCACAGTTCCACTCAAGGATGTGGTGGAACAGCACAA 631 
I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 4 75 - GAG GAG G A CACAG AG GAG GAG GAG GAG 

AA 503 

Qy 632 

GGCCTCGCTCCAGGTCCAGCTGGATGCTGTCAACAAAAGGCTCCCAGAAATAGATTCTGC 691 
I I I I I I III I I I I I I I I III I I I I I 

I I 

Db 50 4 GGCCTCAGTCCTTGGGCAGCTGGC CAGC CTTCCTG 

GC 540 

Qy 692 

TCTTCAGTTCATCTCTGAAATCATTCATCAGTTAACCAACCAAAAGGCCAGCATCGTGGA 751 
I I I I I I I I I I I I I II II I I I I I I 



Db 541 TTGAAC-CTGGGCTCACTCA — AG 

AAGGCC 570 



GAC— 



Qy 752 

TGACATTCATTCCACCTTTGATGAGCTCCAGAAGACTTTAAATGTGCGCAAGAGTGTGCT 811 

II I I I I II I I I I I I I I I II I II 

Db 571 CAGGCCACACTGGGGGATCTCAAGCAATC 

AGCTGAGAAGTGT 612 

Qy 812 

GCTTATGGAATTGGAGGTCAACTATGGCCTCAAACACAAAGTCCTCCAGTCGCAGCTGGA 8 71 

I I I I I I I I I I I I II II I I I I I 

I I I I I 

Db 613 -C ACGTCA — TGTGACCACTTCCCCGGGGTTACCCACTGG — 

GCTGG- 654 

Qy 872 

TACTCTGCTCCAGGGGCAGGAGAGCATTAAGAGCTGCAGCAACTTCACAGCGCAGGCCC- 93 0 
I I I I III I I I I I I I I I I I I I I I I I I 

I I I 

Db 655 GCCCC CAT G AGGGC — T AAG AG - T G T G T C AAC T T C - C AG 

GGACCCA 696 

Qy 931 TCAACCAT GGCACGGAGACCGAGGTCC — 

TACTGGTGAAGAAGCAGATGAGCGAGA 984 

I I I I I III I II III I I I I I 

Db 697 

TACTCCATTTGGGGCTTTGTTTCCCTTGCCCCATCCTAGT 736 

Qy 9 85 AGCTGAACGAGCTGGCCGACCAGGAC— TTCC 

CCTTGCACCCGCGGGAGAACGACCA 1039 

I I I I I I I I I I I I I I I I I I I 

I 

Db 73 7 TCCAAGACCTTTCCCATCCATGC-CCCAAG 

CCTATCT 772 

Qy 10 40 GCTGGATTTCATCGTGGAAACCGAGGGGCTGAAGAAGTCCATC CACAACC 

TCGG 1093 

I I I I I I I I I I I III III II I I I I I I I I I I 

I 

Db 773 TCTGG-TTTCTTCCT 

CTCCGCTGGGAGTAAAGTCCCCATCTTCACTCTACCCTTCAG 82 8 

Qy 109 4 GAC GAT C T T AAC C AC C AAC GCC 

GTTGCCTCAGAGACAGTGGCCACGGGCGAGG 1146 

III II I I I I I I I II I I I I I I I I I I I 

I 

Db 829 GAC CCTCCCCACCAGCTCAGCCTGTGGAGGCCTC 

C C AAG AT T G TAG 8 74 

Qy 1147 

GGCTGCGGCAGACCATCATCGGGCAGCCCATGTCCGTCACCATCACCACCAAGGACAAAG 12 06 

I I II I I I I I I I I I I I I I I I I I I I I 

Db 8 75 GAATAGGCCCATCCCTC-TCTGG CCATGGCC 

CCAAGTTC 912 

Qy 1207 

ACGGTGAGCTGTGCAAAACCGGCAACGCCTACCTCACCGCCGAACTGAGCACCCCCGACG 12 66 

III I I I I I I I I I I I I I 

Db 913 CTG CACACAG 



GAGCACCCAC 932 



Qy 1267 

GGAGCGTGGCAGACGGGGAGATCCTGGACAACAAGAACGGCACCTATGAGTTTTTGTACA 132 6 



Db 933 — AGAGAGACACAC AC AGGAC - AC AAAA CCCCTG 

GCA 966 

Qy 1327 

CTGTCCAGAAGGAAGGGGACTTTACCCTGTCTCTGAGACTCTATGACCAGCACATCCGAG 13 86 



Db 96 7 C-GTTCAG -AG AC AG AAGC C AC AG AC AC AT C C C GGCAC — 

AG AC AG AC AC AC AC GAG 1019 

Qy 13 8 7 G-CAGCCCGTTTAAGCTGAAAGTGATCCGATCCGCTGATGTGTCTCC- 

CACCACAGAAGG 1444 

I I I I I I II II III II III I I I I I I I I I I I 

Db 1020 GCCAGCTCCCTT— GC GTGTCCAGCCCCTCC 

AG AC AC C AC C AC T C AG AAAC 106 8 

Qy 14 45 CGTGAAGAGGCGCGTTAAGTCCCCGGGGAG-CGGCCACGTCAAGCAGAAAG-CTGTG- 

AA 1501 

III I I I I I I I I I I III II I I I I I I I I I 

Db 1069 TCTGAGAGAGAGCAT 

GGGCAGACACCCTCAGCAGACAGGAGGCCTGAGTTC 1119 

Qy 1502 

AAGACCCGCAAGCATGTACAGCACTGGAAAACGAAAAGAGAATCCCATCGAAGACGATTT 1561 



Db 112 0 CAGTCTC-CA — CCTTTATTGTTCTTGAAAGC- 
CCC T 1152 



Qy 1562 

GATCTTTCGAGTGGGTACCAAAGGAAGAAATAAAGGAGAGTTTACAAATCTTCAGGGGGT 1621 

I I I I I I I I I I I I I I I I I I I I I 

Db 1153 GCTCTCTC TGAG — CCTTA TTTC — 

ATCATCTG 1181 

Qy 1622 

AGCTGCATCTACAAATGGAAAGATATTAATTGCAGACAGTAACAACCAATGTGTGCAGAT 16 81 

I I I I I I I I I I I I 

I I 

Db 1182 TAAAATGGGAA 

TGTCCTGA- 12 0 0 



Qy 16 82 ATTTTCCAATGATGGCCAGTTCAAAAG- 

TCGTTTTGGCATACGGGGACGCTCTCCGGGGC 1740 

I I I I I I I I I I I I I I I I I I I 

I 

Db 12 01 ATGAC TTCTAAGGCTCTTTCTGGCTT- 

GAACTGTCAGAGCC 12 40 



Qy 1741 AGCTGCAGCGGCCCACAGGAGTGGCTGTACATCCCAGTG- 

GGGACATAATCATTGCCGAT 1799 

I I I I I I I I I I I I I II I I I I 

I 

Db 12 41 A AGCCCACA TCCCTCCTTGGGCAGGGCA- 



GCAGCT 12 74 



Qy 1800 

TATGATAATAAATGGGTCAGCATTTTCTCCTCCGATGGGAAATTTAAGACAAAAATTGGA 1859 
III I I I I I I I I I I 

Db 12 75 GCTGCCA CAGC 

CTCCAGCG 1293 

Qy 1860 

TCAGGAAAGCTGATGGGACCCAAAGGAGTTTCTGTGGACCGCAATGGGCACATTATTGTT 1919 

I I I I III I I I I I I I II II I I I 

Db 129 4 GCTG CCACTGTGGGCTCTGGGAGCC 

GGAGCGATGCTG-T 1331 

Qy 1920 

GTGGACAACAAGGCGTGCTGCGTGTTTATCTTCCAGCCAAACGGGAAAATAGTCACCAGG 19 79 

III II I I I I I I I III III I I I I II II 

I I I I I 

Db 1332 GTGAGAGGCA — GAGTGCCAAGGATGAAGCTGGCA-CTGAACAG — 

TAAGCGGCTCCAGG 13 86 

Qy 1980 

TTTGGTAGCCGAGGAAATGGGGACAGGCAGTTTGCAGGTCCCCATTTTGCAGCTGTAAAT 2 039 

I II I I I I I I I I I I I I I I 

Db 1387 CCT CC TCTGGGCC CAGGGCC 

CAGC 1410 

Qy 2040 

AGCAATAATGAGATTATTATTACAGATTTCCATAATCATTCTGTCAAGGTGTTTAATCAG 2 099 

I I I I I I I I I I I 

I 

Db 1411 — CAAT TTCTGT 

TCTG 1424 

Qy 2100 

GAAGGAGAATTCATGTTGAAGTTTGGCTCAAATGGAGAAGGAAATGGGCAGTTTAATGCT 2159 
I I I I I I I I I I I I I I I I I 

I 

Db 1425 TTCCTGTAGAA CGCTC-TCTGGA 

TT 1448 

Qy 2160 

CCAACAGGTGTAGCAGTGGATTCAAATGGAAACATCATTGTGGCCGACTGGGGAAACAGC 2 219 

III I I I I I I I I I I III I I I I I I I I I I 

Db 1449 CCA TAGC — TGGAATC TCCTCTCT-TAGC — 

TCAGTGAAAAATAA 148 8 

Qy 2220 

AGGATCCAGGTTTTTGATGGGAGTGGATCATTTTTGTCCTACATTAACACATCTGCTGAC 2 2 79 

I I I I I I I I I I I I I I I I I I I I I I I 

Db 1489 AAATCCCAAAT GGTGTG 

CCTACCTTCCCACTTCT 1522 

Qy 2280 

CCACTCTATGGCCCCCAAGGCCTGGCCCTAACTTCAGATGGTCATGTTGTGGTTGCAGAC 2339 

I I I I I I I I I I I 

I 

Db 1523 -TAC TGGCTTCCAGG 

AGTC 1540 



Qy 2340 

TCTGGAAATCACTGTTTCAAAGTCTATCGATACTTACAGTAATGGTGGGCAGGTGGATAC 2399 

I I I I I I I I I I I I III III I I I I I I I I I I 

Db 1541 T-TGGAG TTCATAGCCCCCCGA-GCCTGCCTTAAAGG GGTG — 

TCC 1582 

Qy 2400 - 

CCGCTTCCATGGTCTTGCACTATAAACTGGAATGGATTTCTCAATGCGGGACCAGATTA 2 45 8 
I I I I I III II I I I I I I I I I 

Db 15 83 TCCACCCCC CACCTACAGC TTCACAG- 

GAGGGG 1614 

Qy 2459 

TGACTAGAGTTTTTATGCCAGAAGGAATCATTGGTGAACTTTCCAAGGTTATTTCTGAAT 2 518 
I I I I I I I I I I I I I I 

Db 1615 AGAGGGCATC 

CAGTGCTAGG A 1635 

Qy 2519 

GTAACAATTTCCTTAAAAATGACTTATCCAATTTCTGTATTTCACCTTTAGGGTTAAAAA 2 5 7 8 

III I I I I I I I I I I I I I I I I I I I I I 

Db 1636 GTAGAAGTGTC TCCAG-CTCTG TTCTC — 

TTGGGG 1667 

Qy 2579 

AAACTCTTCTACTGAATCTATAAAAACTGCAGTTTTACATCTGTGAACTATGGCTTAAGG 263 8 

III I I I I I I I I I I I I I I I 

I 

Db 166 8 CCCT GGGTGAAGG 

TGGGGTCTGGGGCTTATGA 1699 

Qy 2639 

GACAGGATTTATGTAGCTAAACTAATTTTGCAAATCAAACAGACACTTAAAAAAACTAGC 2 69 8 
I I I I II III I I I I I II I I I 

Db 170 0 AATAGGT CTGGGCT- 

TTGAGGAGGATGGAGCAGCCTCAT 173 7 

Qy 2699 

ATATGTAAAGGTATTCGTTAATCCTGTGAATGGTAGCTTTTGCACAGAACTTCCAAAAGC 2 75 8 

I I I I I I II I I I I I II III I I I I I I I I I 

I 

Db 173 8 -TATGTGGGGAAGATGG GGCCTCTG GG — GCGT CACTGAG— 

ACCACAGGT 17 83 

Qy 2759 

AAAAC AAAAAC AAAAT C T AT T G T AGT T AT AT AC T TC AT T T AACC T AGG T C AC AAG AC CCA 2818 

I III I I I I I I I I I I I 

I I 

Db 178 4 GGGGCCGGGGCTGGACC GCAGCTGT 

CTTGGGTGCCTGTGCCT 1825 

Qy 2819 

GGGAATCTTCTAACCTCACTTTTACAGTAGGTATTACTCTTGTGACATTTTTTTGGTTAT 2 8 7 8 

I I I I I I I I I I I I I I I I 

I I 

Db 1826 G CACCCCTCCTC AC C C TAG AG AC G 

GAAGAT 1855 



Qy 2879 

CAACAACTAAATATAAATTACTTTGGAAAAAGTAAGGCTGTCTTGCAAAATGATCCCAGC 2 93 8 

I I I I I I I I I I I I I I I I I I II 

I I I I I 

Db 1856 GTGCAAAAA GAAAGAAGGAAGG 

GCAACTGCATTCCAGC 1893 

Qy 2939 

TCTGATTAGCAGCCCTCTGGAGTTCAGAACTTAAGTATCAGTGCAAATTTCTC A 2 992 

I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 189 4 CC CACACTG TGATGACTT-TGAGCCTGTCC 

TTTCCCTCCTTGA 1935 

Qy 2993 

ACCTTTCTGGGTTAGACAAAGATCCTTTTTTGTGTGTTCTTTTCACCACCCCTTTGGCTC 3 052 

I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 1936 GCCTGTCTTGCTT GTCCCCTGTAAAATGAACAGTCCCCCTCTCC 

CCC 1982 

Qy 3053 

ACCTTGTATCAGCAAACAAAGTACTTCTTCAGGGAAACCTGAAATTTCTAATGCCTTGAA 3112 
I I I I I I I I I I I III I I I I III I I I I I 

I 

Db 19 83 AAATAGTAATA — ATACA TGTTTC-AAAGGGTGACC 

ATTTAT A 2 022 

Qy 3113 

AAGCATATTACAAAAGTAATGCTACCTTTTGGGAAACAAACTGCCCCGTTAACTCCAGAT 3172 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 2 023 AAGCATATGACAA ACCATAT CAATAAATG TAACTC — 

ATT 2060 

Qy 3173 

CATTGCACTGGAATGTAATCAAGAAAGTTAGTCATGTTTTATGTACCATGTTTTCACACG 3232 

III I I I I I I I I III I I I I 

I 

Db 2061 CTTT TAAAAAAAAAAAAAAAAGGC 

CACATG 2 09 0 

Qy 3233 TGTCTCTTCTCTTCGACTTCCTGAAAGCGAAAGCTTTAC 3271 

II I I I I I I I III I I I I I 

Db 2 091 TG CTCGAGCTGCAGGTCGCGGCCGCTAGAC 212 0 
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Qy 54 CT- 

CTGGTCTTCGATGCACAGGAGTGGCCGTTATGGAACGCAGCAGCAGCGTGCAGGGTC 112 



Db 2 CTACTGGTCTT GTGTGGGTGTTTT C— CAGCA-CG- 

GAAAAGTC 41 

Qy 113 

AAAGACAGCCGGCCCCCCATGTCAGTGGTCTAGGATGGCCAGTGAAGGCACCAACATCCC 172 



Db 42 GAGATAA TTGCCAATGATC-AGGGAAACC- 

TCC 79 

Qy 173 AAGTCCTGTGGTGCGCCAGATTGACAAGCAGT- 

TTCTGATTTGCAGTATATGCCTGGAAC 231 



Db 8 0 AAGCTATGT CGCC TTTACGGACACTGAACGGTTGATCGGTG-ATGCC- 

GCAAA 13 0 

Qy 232 

GGTACAAGAATCCCAAGGTTCTCCCCTGTCTGCACACTTTCTGCGAGAGGTGCCTGCAGA 2 91 



Db 131 GAATCAAG — TTGCAATGAACCCCAC CAACACAGTTTTTG- 

ATGCCAA 175 

Qy 292 ACTACATTCCTGCCCACAG-TTTAACCCT- 

CTCCTGCCCAGTGTGCCGCCAGACCTCCAT 349 



Db 176 ACGTC-TGATTGGACGCAGATTTGATGATGCTGTTGTCCAGT 

CTGATATGAAA 22 7 

Qy 350 CCT — GCCCGAGAAAGGGGTGGCCGC-GCT- 

CCAGAACAATTTCTTCATCACAAACCTGA 405 

I I I I I I I I I I I I I I I I I I I I II I 

Db 22 8 CATTGGCCCTTTATGGTGGTGAATGATGCTGGCAGGCC 

CAAGGTCC 2 73 

Qy 406 

TGGACGTGCTGCAGCGAACTCCAGGCAGCAACGCTGAGGAGTCTTCCATCCTGGAGACAG 465 

III III I I I I I I III II I I I I I I I I III 



; Score 5280.5; 
; Pred. No. 0; 
0; Mismatches 



DB 1; Length 219 8; 
748; Indels 1633; 



I 

Db 2 74 AAGT AGAATACAAGGGAGAGAC-CAAAAG CTTCTATCC- 

AGAGGAGG 318 



Qy 466 

TCACTGCTGTGGCTGCGGGAAAGCCTCTCTCTTGCCCAAACCACGATGGGAATGTGATGG 52 5 

I I I I I I I I I I I I I I I III 

I I 

Db 319 TGTCTTCTATGGTT CTGACAAAGA 

TGAAGG 3 48 

Qy 526 

AATTTTACTGCCAGTCCTGTGAGACTGCCATGTGTCGGGAGTGCACGGAGGGGGAGCACG 5 85 

I I I I III I I I I I I I I I I I I 

Db 3 49 AAATTG CAG AAGCCTACCTT 

GGGAAG 3 74 

Qy 586 

CAGAGCACCCCACAGTTCCACTCAAGGATGTGGTGGAACAGCACAAGGCCTCGCTCCAGG 6 45 

I I I I I I I I I I I I I I I I I I I I III III 

Db 3 75 ACTGTTAC CAATGCTGTGGT CACAGTGCC- 

AGCT 40 7 

Qy 6 46 TCCAGCTGGATG-CTGTCAACAAAAGGCTCCCAGAAATAGATTCT- 

GCTCTTCAGTTCAT 703 

II I I I I I I I I I I I I I I I I I I I I I I I I I II II 

I 

Db 40 8 TACT — TTAATGACTCTCAGCGTCAGGCTACC 

AAAGATGCTGGAACTATTGCTGGT 461 

Qy 704 

CTCTGAAATCATTCATCAGTTAACCAACCAAAAGGCCAGCATCGTGGATGACATTCATTC 763 

III III I I I I I I I I I I I I I I I I I I 

I I 

Db 462 C T C AAT G T AC T T AG AAT TAT T AAT G AGC C AA CTGCTGC 

TGCTATTGCTTA 511 

Qy 764 

CACCTTTGATGAGCTCCAGAAGACTTTAAATGTGCGCAAGAGTGTGCTGCTTATGGAATT 823 

I I I I I I I I I I I I I I III III I I I I I I 

I 

Db 512 CGGCTTAGA CAAAAAGGTTGGAGCAGAAAG-AAACGTGCTCATCTT- 

TGACCTG 563 

Qy 82 4 GGAGGT— CAACTATG— GCCTCAAACACAAAGTCCTC- 

CAGTCGCAGCTGGATACTCTG 8 78 

I I I I I I II III I I I I I I I I I I I II I I I I I II 

I I 

Db 56 4 GGAGGTGGCACTTTTGATGTGTCAA 

TCCTCACTATTGAGGATGGAATCTTTG 615 

Qy 879 

CTCCAGGGGCAGGAGAGCATTAAGAGCTGCAGCAACTTCACAGCGCAGGCCCTCAACCAT 93 8 
I I I I I I I I I I I 

Db 616 AGGTCAAGT 

CTACAGCT 632 

Qy 939 GGCACGGAGACCGAGGTCCTACT — 

G G T G AAG AAG C AG AT G AG C G AG AAGC T G AAC G AGC 996 

I I I I I I I I I I I I I I I I I I I I III I I I I I 



Db 633 G GAG AC ACCCACTTGGGTGGAGAAGATTTTGA- 

CAACCGAA 6 72 



Qy 997 

TGGCCGACCAGGACTTCCCCTTGCACCCGCGGGAGAACGACCAGCTGGATTTCATCGTGG 1056 
I I I I I I I I I II I I I I I I 

Db 6 73 TGGTCAACCA TTTTATT 

GCTGAGTTTAAGCG 703 



Qy 1057 

AAACCGAGGGGCTGAAGAAGTCCATCCACAACCTCGGGACGATCTTAACCACCAACGCCG 1116 

II II I I I I I I III 
Db 70 4 CAA 

GCATAAGAAGGACAT 721 



Qy 1117 

TTGCCTCAGAGACAGTGGCCACGGGCGAGGGGCTGCGGCAGACCATCATCGGGCAGCCCA 117( 
I I I I I II III I I I I I I I I 

Db 72 2 CAGTGAGAACAAGAGA GCTGT — AAGAC 

GCC— 750 



Qy 1177 

TGTCCGTCACCATCACCACCAAGGACAAAGACGGTGAGCTGTGCAAAACCGGCAACGCCT 1236 
I I I I I II I I I I I I I I I I I I I 

Db 751 — TCCGT ACTGCTT GTGAAC- 

GTGCTAAGCGTAC 781 



Qy 1237 

ACCTCACCGCCGAACTGAGCACCCCCGACGGGAGCGTGGCAGACGGGGAGATCCTGGACA 1296 

I I I I I II I I I I I I I I I I I I I I I I I 

Db 782 -CCTCTCTTCC AGCACCCAGGCCAGTATT 

GAGATC 815 

Qy 1297 

ACAAGAACGGCACCTATGAGTTTTTGTACACTGTCCAGAAGGAAGGGGACTTTACCCTGT 1356 
I I I I I I I I I I I I I I I I I I I 

Db 816 GATTCT CTCTAT GAAGGAA- 

TCGACT 8 40 

Qy 135 7 CTCTGAGACTCTATGACCAGCACATCCGAGGCAGCCCGTTT AAGCTGAAAG- 

TGAT 1411 

III I I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 8 41 -TCTATACCTCCATTACCCG 

TGCCCGATTTGAAGAACTGAATGCTGAC 887 



Qy 1412 

CCGATCCGCTGATGTGTCTCCCACCACAGAAGGCGTGAAGAGGCGCGTTAAGTCCCCGGG 14 71 
I I I I I I I I I I I I II 

Db 888 CTGTTCC GTGGCACCC TG 

GACCCAGTA 914 

Qy 14 72 GAGCGGCCACGTCAAGCAGAAAGCTGTGAAAAGACCCGCAAGCATG- 

TACAGCACTGGAA 153 0 

III I I I I I I I I I I I I I I I I I I I I I I 

Db 915 

G AG AAAG C C C T T C G AG AT GC C AAAC T AG AC AAG T C AC AG AT T C AT GAT AT TGTCCTGG — 9 72 



Qy 1531 

AACGAAAAGAGAATCCCATCGAAGACGATTTGATCTTTCGAGTGGGTACCAAAGGAAGAA 159 0 



I I 1 1 1 1 1 I I 1 1 I 

Db 9 73 TTGGTGGTTCTACTCGTA 

TCCCCAAG 99 8 

Qy 1591 

ATAAAGGAGAGTTTACAAATCTTCAGGGGGTAGCTGCATCTACAAATGGAAAGATATTAA 165 0 

II I I I I I I I I I II I I I I I I I I I I 

I I 

Db 999 ATTCAGAAGCTTCTCCAAGACT 

TCTTCAATGGAAAAGAACTGAA 10 42 

Qy 1651 

TTGCAGACAGTAACAACCAATGTGTGCAGATATTTTCCAATGATGGCCAGTTCAAAAGTC 1710 
I I I I I I I I I I I I I I I I I I I I 

Db 10 43 T AAGAGCATCAACC CTGATGAAG — 

CTGTT 1070 

Qy 1711 

GTTTTGGCATACGGGGACGCTCTCCGGGGCAGCTGCAGCGGCCCACAGGAGTGGCTGTAC 17 7 0 
I I I I I I I I I I I I I I I I I I I 

I 

Db 10 71 GCTTATGGTGCAGCTGTCCAG 

GCAGC C 109 7 

Qy 1771 

ATCCCAGTGGGGACATAATCATTGCCGATTATGATAATAAATGGGTCAGCATTTTCTCCT 183 0 

III I I I I 
Db 1098 ATC 

TTGTC 1105 

Qy 1831 

CCGATGGGAAATTTAAGACAAAAATTGGATCAGGAAAGCTGATGGGACCCAAAGGAGTTT 189 0 
III I I I I I I 

I 

Db 1106 TGG 

AGACAAG T 1116 

Qy 1891 

CTGTGGACCGCAATGGGCACATTATTGTTGTGGACAACAAGGCGTGCTGCGTGTTTATCT 195 0 
I I I I I I I I I I I I I I I I I I I 

I I I 

Db 1117 CTGAG AATGTT CAAGATTTGCTGC — 

TCT 1143 

Qy 1951 

TCCAGCCAAACGGGAAAATAGTCACCAGGTTTGGTAGCCGAGGAAATGGGGACAGGCAGT 2 010 
I III I I I I 

Db 1144 T GGA 

TGTCAC 1153 

Qy 2011 

TTGCAGGTCCCCATTTTGCAGCTGTAAATAGCAATAATGAGATTATTATTACAGATTTCC 2 0 7 0 
III I 



Db 1154 TCCTC- 

TTTCC 1163 



Qy 2071 

ATAATCATTCTGTCAAGGTGTTTAATCAGGAAGGAGAATTCATGTTGAAGTTTGGCTCAA 213 0 
I I I I I I I I I I I I I I I I I I 



Db 1164 - 

GCTGGTGGAGTCAT- 



-CTTGGTATTGAAACT- 
1192 



Qy 2131 

ATGGAGAAGGAAATGGGCAGTTTAATGCTCCAACAGGTGTAGCAGTGGATTCAAATGGAA 219 0 
I I I I I I I I I I I I I III 

Db 1193 GACTGT CCTCATCAAGCGTA 

ATAC 1216 



Qy 2191 

ACATCATTGTGGCCGACTGGGGAAACAGCAGGATCCAGGTTTTTGATGGGAGTGGATCAT 2 2 5 0 
I I I I I I I I I I I I I I I I 

Db 1217 -CACCATT CCTAC 

CAAGCAGACAC 1239 



Qy 2251 

TTTTGTCCTACATTAACACATCTGCTGACCCACTCTATGGCCCCCAAGGCCTGGCCCTAA 2310 
I I I I I I I I I I I I I I III III III 

Db 12 40 AGACCTTCA CT-ACCTATTC — TGACAACCA — 

GCCTG 1272 



Qy 2311 

CTTCAGATGGTCATGTTGTGGTTGCAGACTCTGGAAATCACTGTTTCAAAGTCTATCGAT 23 7 0 
III III I I I I I II III 

Db 1273 GTG — TGCTTATTCAGGT 

TTATGAAGGCGAGCGTG 13 05 



Qy 2371 

ACTTACAGTAATGGTGGGCAGGTGGATACCCGCTTCCATGGTCTTGCACTATAAACTGGA 2 43 0 

II III I I I I I I I I I I I I I I I I I I I I 

Db 13 06 CCAT G AC A- AAGG AT AAC AAC CTGCTTGG - C AAG 

TTTGAACTCAC 13 48 



Qy 2431 

ATGGATTTCTCAATGCGGGACCAGATTATGACTAGAGTTTTTATGCCAGAAGGAATCATT 2 49 0 
I I I I I I I I I I II I I I I 

Db 1349 AGGCATACCTC-CTGC AC 

CCCGA 13 70 



Qy 2491 

GGTGAACTTTCCAAGGTTATTTCTGAATGTAACAATTTCCTTAAAAATGACTTATCCAAT 2 55 0 

I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 1371 GGTG TTCC TCAGATTG 

AAG T C AC T T T T G AC AT 1402 



Qy 2551 

TTCTGTATTTCACCTTTAGGGTTAAAAAAAACTCTTCTACTGAATCTATAAAAACTGCAG 2 610 
III I I I I I I I I I I I I I I I I I 

Db 1403 TGATG CCAAT — GGT ATACTC AATG— TCT 

CTG 1431 



Qy 2611 

TTTTACATCTGTGAACTATGGCTTAAGGGACAGGATTTATGTAGCTAAACTAATTTTGCA 2 6 7 0 
I I I I I II I I I I I I I I I I I III 

Db 1432 CTGTGGACAA — GAGTACGGGAAAAGA 

G AAC AAG ATT 1466 



Qy 2671 

AATCAAACAGACACTTAAAAAAACTAGCATATGTAAAGGTATTCGTTAATCCTGTGAATG 2 73 0 



II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I 

Db 146 7 ACTATCACT AATGACAAGGGC — CGTT 

TGAGCAAG 1499 

Qy 2 731 GTAGCTTTTGCACAGAACTTCCA- 

AAAGCAAAACAAAAACAAAATCTATTGTAGTTATAT 2 789 

III I I I I I I I I I I I I I I I II I I I I I II I I I 
Db 150 0 GAAGACATTGAACGTATGGTCCAGGAAGCTGAG-AAGTACAAAG- 
C T G AAG AT GAGA 155 4 

Qy 2790 

ACTTCATTTAACCTAGGTCACAAGACCCAGGGAATCTTCTAACCTCACTTTTACAGTAGG 2 8 49 
II III II I I I I I I I I I I 

I 

Db 1555 AGCAGAGG GA — 

CAAGGTGTCATCCAA G 1580 

Qy 2850 

TATTACTCTTGTGACATTTTTTTGGTTATCAACAACTAAATATAAATTACTTTGGAAAAA 2 9 09 

I I I I I I I I I I I I I I I I I I II II I I I I I I I 

I 

Db 1581 AATT-CACTTGAGTCCTAT GCCTTCAAC 

AT G AAAGC AAC T G T T G AAG AT 162 8 

Qy 2910 GTAAGGCTGTCTTGCAAAATGATCCCAGCTCTGATTAGCAGCCCTCTGGAG- 

TTCAGAAC 296 8 

I I II I I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 1629 GAGAAACTTCAAGGCAAGATTAAC GATGAGGA- 

C AAAC AG AAG AT T C T GG AC 16 79 

Qy 2969 

TTAAGTATCAGTGCAAATTTCTCAACCTTTCTGGGTTAGACAAAGATCCTTTTTTGTGTG 3 02 8 
I I I I I I I I I I I I I III I I I I I I I III 

Db 16 8 0 — AAGTGT-AATGAAATTATC 

AACTGGCTTGATAAGAATCAGA 1719 

Qy 3029 

TTCTTTTCACCACCCCTTTGGCTCACCTTGTATCAGCAAACAAAGTACTTCTTCAGGGAA 3 0 8 8 

II II II III I I I I I I I I 

I 

Db 1720 CT GC 

T G AG AAGG AAG AAT T T G AAC AT C AAC AGAAAG 1755 

Qy 3089 

ACCTGAAATTTCTAATGCCTTGAAAAGCATATTACAAAAGTAATGCTACCTTTTGGGAAA 314 8 

I I I I I I I I I I I I I 

Db 1756 AGCTGGA GAAAG 

TTTG 1771 

Qy 3149 

CAAACTGCCCCGTTAACTCCAGATCATTGCACTGGAATGTAATCAAGAAAGTTAGTCATG 32 0 8 

I I I I I I I I I I I I I I I I I I I I I III II 

I I I I 

Db 17 72 CAACCCCATCATCACCA-AGC — T G T AC C AG AG T G CAGG 

AGGCATG 1814 



Qy 32 09 TTTTATGTACCA — 

TGTTTTCACACGTGTCTCTTCTCTTCGACTTCCTGAAAGCGAAAGC 32 66 



Ill I I I I I 1 1 1 1 1 1 1 1 II 

1 1 1 

Db 1815 CCAGGAGGAATGCCTGGGG GATTTCCTGGTGGTG- 

GAGC 1852 

Qy 3267 

TTTACCTCCTGCAAATGTCAGCACATGTAGTAGGACACCAGTATCCTAGGACAGAGAGCC 332 6 

I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 1853 T CCTCC CTCTGGTGGT GCTTCCT CAGGGC- 

CC 1883 



Qy 332 7 ATAAGTAGCCCTTTGGAG- 

GACTGATGGTGTCAACCAAAGGCATGTGATTGATTAATGAT 33 85 

I I I I I I I I I I I I I I I I I I I I I 

I 

Db 1884 A C C AT T G AAG AGG T T GAT T AAGC C AAC C AA- - 

GTGTAGATGTAGCATTGT 1931 



Qy 3386 TCC-CCCTTAGAAAGCAAGTGTT — 

ACCAAAGTTGTGTTATCTTGAAAGCATTACAGGTA 3442 

I I I I I I I I I I I II I I I I I I I 

Db 1932 T C C AC AC AT T T AAAAC AT T T G AAGG AC C T AAAT T - 
CGT 1968 



Qy 3443 

AGGGCATGTTATGGTTATTTATCATTGTTTAATGAATAGTAGAGGTGTCAAGGGACTATG 35 02 
III I I I I I II I I I I I I I I I I I I I I 

I I I I 

Db 1969 — AGCAAATTCTG TGGCA — GTTT — TAAAAAGTTAAGCTG 

CTAT- 2007 



Qy 3503 

TATACATGATTAGGGTAAGATAGAATGTATTATATATATATATATATATACACACACACA 3562 
I I I I I II I I I I I III I I I I I I 

I I I 

Db 2008 AGTAAGTTACTGGGCATTCTCAATACTTGAATATGGA 

ACA 2047 

Qy 3563 

TATATATAGCTGAATCTTTGGTGTATTGAAATAGGCAGCACTCTGAAAGACAGAAGCTTC 362 2 
III I I I I I I I I I 

I 

Db 2048 TAT 

GC AC AGGGGAAGGAAAT AAC AT T 2 0 73 



Qy 3623 GTCCAGCCACTCT- 

TCAGCACATTCCTTTACTAAGCAGTTTAAAGCCGTCCTAGTGGAGC 36 81 
I I I I I I I I I I I I III 

Db 2 0 74 G C AC T T T AT AAAC AC TGTATTGTA 

AGTGGAAA 2105 



Qy 3682 

AAGCCCTAAAGCAGATTTAATTTTTGCCATTTTCCAAGAATGACGGTGGTGGCTTTTAGT 3 7 41 

III I I I I I I I I I I I I 

Db 2106 ATGCAAT 

GTC T T AAAT AAAAC TAT TT 2131 



Qy 3742 

CAGAAAATGGCCTTCTGTGCTTTCAAAAAAAAAACAAAAAAAAAACCACACACACACATA 3 8 01 



1 1 1 1 1 



I I 



Db 2132 — AAAATTGGC— 

ACCATAAAAAAAAAAAAAAAAAAAAAAAAAGGCCACATGTGCTCGAG 218 7 



Qy 



3 8 02 AAAAACCCAACAGGTC 3 817 

I I I I I I I 
2188 CTGCAGGTC 2196 



Db 
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Qy 12 

GCGGGCTGCGGGGAGCTAAGTCCCCAGATTGGAGGAGGCTGGCTCTGGTCTTCGATGCAC 7 1 



Db 219 8 GCGACCTGC AGCTCGAGCAC- 

CTTTTTTTTTTTTTTTTT 2154 




Qy 72 i 

GCCGGCCCCC 129 



72 AGGAGTGGCCGTTATGGAACGCAGCAGCAGCGTGCAGGGTCAAAGACA — 



I I I I I I 



I I 



Db 2153 TTTTTT TTATGGTGC- 

CAATTTTAAATAGTTTTATTTAAGACATTGCATTTTCCA 210 0 



Qy 13 0 CATG TCAGTGGTCTAGGATGGCCAGTGAAGGCACCAAC- 

ATCC- 171 



Db 2 099 CTTACAATACAGT- 

GTTTATAAAGTGCAATGTTATTTCCTTCCCCTGTGCATATGTTCCA 2 0 41 



Qy 172 CAAGTCCTGTG- 

GTGCGCCAGATTGACAAGCAGTTTCTGATTTGCAGTATATGCCT 2 2 6 



1 1 1 1 1 III 1 1 1 1 1 1 1 III I I I I 1 1 I III 

1 1 

Db 2 0 40 TATTCAAGTATTGAGAATGC-CCAG — 

T AAC T T AC T AT AGC AGC T T AAC T T T T TAAAACT 1984 

Qy 22 7 GGAACGGTACAAGAATCCCA AGGTTC- 

TCCCCTGTCTGCACACTTTCTGCGAGAGGT 2 82 

I I I I I I II I I I I I I I III III II 

I I 

Db 19 83 GCCACAGAATTTGCTACGAATTTAGGTCCTTCAAATGT 

TTTAAATGTGTGG- 1933 

Qy 2 83 GCCTGCAGAACTACATTCCTGCCCACAGTT TAACCCTCTCCT 

GCCC 328 

II I I I I I I III I III I I I I I I I I 

I I I I 

Db 1932 AACAATGCTACAT — 

CTACACTTGGTTGGCTTAATCAACCTCTTCAATGGTGGGCCC 18 7 8 

Qy 329 

AGTGTGCCGCCAGACCTCCATCCTGCCCGAGAAAGGGGTGGCCGCGCTCCAGAACAATTT 3 8 8 

III II I I I I I I III I I I I I I I I I I I I I I 

I 

Db 18 77 TGAG-GAAGC ACCACCA GAGGGAGGAGC-TCCAC- 

CACCAGGAAA T 1835 

Qy 389 

CTTCATCACAAACCTGATGGACGTGCTGCAGCGAACTCCAGGCAGCAACGCTGAGGAGTC 4 4 8 

II I I I I I I I I I I I I I I I I I I I I I I 

Db 183 4 CCCCCAGGCATTCCTCCTGG-CATGC CTCCTG 

CACTCTGG 1796 

Qy 449 

TTCCATCCTGGAGACAGTCACTGCTGTGGCTGCGGGAAAGCCTCTCTCTTGCCCAAACCA 50 8 

I I I I I I I I I I I I I I I I I I I I I I I I I III 

Db 1795 -TACAGCTTGGTGA TGATGGGGTTGC 

AAACTTTCTCCAGCTCTTTC— 1751 

Qy 509 

CGATGGGAATGTGATGGAATTTTACTGCCAGTCCTGTGAGACTGCCATGTGTCGGGAGTG 56 8 
I I I I II III I I I I I 

Db 1750 TGTTGATGTT CAAAT 

TCTTCCTT 1728 

Qy 569 CACGGAGGGGGAGCACGCAGAGCACCCCACAGTTCCACTCAAGGATGTGGTGGA — 

ACAG 626 

II I I I I I I I III I I I I I III I 

I I 

Db 172 7 CTC AGCAGTCTGA 

TTCTTATCAAGCCAGTTGATAATTTCAT 16 8 7 

Qy 627 

CACAAGGCCTCGCTCCAGGTCCAGCTGGATGCTGTCAACAAAAGGCTCCCAGAAATAGAT 6 86 

III I I I I I I I I I III II I I I I I II 

Db 16 86 TACA CTTG- 

TCCAGAATCTTCTGTTTGTCCTCATCGTTAATCT 16 45 

Qy 687 

TCTGCTCTTCAGTTCATCTCTGAAATCATTCATCAGTTAACCAACCAAAAGGCCAGCATC 7 46 

III I I I I I I I I I I I I I I I I I I I I I II 



Db 1644 — TGCCTTGAAGTT— TCTC ATC-TTCAACAGTT 

GCTTTCAT- 1608 

Qy 747 

GTGGATGACATTCATTCCACCTTTGATGAGCTCCAGAAGACTTTAAATGTGCGCAAGAGT 8 06 

I I I I I I I I I I I I I I I I I 

I 

Db 16 0 7 GTTGAAGGCAT AGGACT 

CAAGTGA 158 4 

Qy 807 

GTGCTGCTTATGGAATTGGAGGTCAACTATGGCCTCAAACACAAAGTCCTCCAGTCGCAG 866 
II I I I I I I I I I I I I I I 

Db 15 83 ATTC TTGGATGACACCT 

TGTCC 1562 

Qy 86 7 CTGGATACTCTGCTCCAGGGGCAGGAGAGCATTAAGAGCTGCAGC 

AACTTCACAGC 922 

I I I I I I I I III I I I I I I I I I I I 

I I I I 

Db 1561 CTCTGCTTCT CAT 

CTTCAGCTTTGTACTTCTCAGC 1527 

Qy 923 

GCAGGCCCTCAACCATGGCACGGAGACCGAGGTCCTACTGGTGAAGAAGCAGATGAGCGA 9 82 
III I I I I I II I I I I I I I I I 

Db 1526 TTCCTGGACCAT AC 

GTTCAATGTCTTCCT 1498 

Qy 9 83 GAAGCTGAACGAGCTGGCCGACCAGGACTTCCCCTTG- 

CACCCGCGG G AG AAC G AC C AG C 10 41 

III I I I I I I I I I I I I II III I 

I 

Db 149 7 TGCT CAAACGG CCCTTGTCATTAGTGATAGTA 

ATC 1463 

Qy 1042 

TGGATTTCATCGTGGAAACCGAGGGGCTGAAGAAGTCCATCCACAACCTCGGGACGATCT 1101 

I I I I I I III II I I I I I I I I I I I I I I 

Db 1462 TTGTTCTCTT TTCCCGT ACTCTTG TCCACAGC— AGAGAC- 

ATTG 1421 

Qy 1102 TAACCACCAACGCCGTTGCCTCAGAGAC 

AGTGGCCACGGGCGAGGGGCTGCGGCAGA 1158 

I I I I I I I I I I II I I I I I I I I I I I I 

I 

Db 142 0 AGTATACCATTG GC AT C AAT G T C AAAAG T G AC T T C 

AATCTGAGGAACA 13 73 

Qy 1159 CCATCATCGGGCAGCC CATGTCCGTCA — 

CCATCACCACCAAGGACAAAGACGGT 1211 

II I I I I I I I I I I I I I I II I I I I I III 

I 

Db 13 72 CC TCGGGGTGCAGGAGGTATGCCTGTGAGTTCAAACTTGCCAAG 

CAGGTTGTT 1320 

Qy 1212 GAGCTGTG- 

CAAAACCGGCAACGCCTACCTCACCGCCGAACTGAGCACCCCCGACGGGAG 12 7 0 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 



Db 1319 ATCCTTTGTCATGGCACGC-TCGCCTTCATAAAC-CTGAA- 

TAAGCACACCAGGCTGGT- 126 4 



12 71 CGTGGCAGA- 



CGGGGAGATCCTGGACAACAAGAACGGCACCTATGAGTTTTTGTA 132 4 

I I I I I I I I I I I I I I I I II I I I I I I 

I I 

Db 12 63 — TGTCAGAATAGGTAGTGAAGGTCTGTGTCTGCTTGGTAGG AATG- 

GTGGTATTA 1211 

Qy 132 5 CACTGTCCAGAAGGAAGGGGACTTTACCCTGTCTCTG 

AG AC T C T AT G AC C AG 13 76 

III II I I I I I I I I I I I I II I I I I III 

Db 1210 CGCT TGATGAGGACAGT — 

CATGACTCCACCAGCAGTTTCAATACCAAGGGA 1161 

Qy 13 7 7 CACATCCGAG- 

GCAGCCCGTTTAAGCTGAAAGTGATCCGATCCGCTGATGT 142 6 

I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 116 0 AAG AGG AG T G AC AT C C AAGAGC AGC AAATCT TGAAC — 

ATTCTCAGACTT 1113 

Qy 1427 

GTCTCCCACCACAGAAGGCGTGAAGAGGCGCGTTAAGTCCCCGGGGAGCGGCCACGTCAA 14 86 
I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1112 GTCTCCAGACA-AGATGGC 

TGCCTGGACAGCTGC 10 8 0 

Qy 1487 

GCAGAAAGCTGTGAAAAGACCCGCAAGCATGTACAGCACTGGAAAACGAAAAGAGAATCC 1546 
II I I I I I I I I I I 

Db 10 79 ACCATAAGCA 

AC AGC T T 1063 

Qy 1547 CATCGAAGACGATTTGATCT — TTCGAGTGGGTACCAAAGGAAGAAATAAAGGAG- 

AGTT 1603 

I I I I I II II I I I I I I I I I I II I I I I I I I I I I I I I 

I 

Db 1062 CATCAGGGTTGA — TGCTCTTATTC- 

AGTTCTTTTCCATTGAAGAAGTCTTGGAGAAGCT 1006 

Qy 16 0 4 TACAAATCTTCAGG 

GGGTAGCTGCATCTACAAATGGAAAGATATTAATTGCAGACA 1659 

I I I I I I I II I I I I I I I I I I I 

I I I I 

Db 10 05 TCTGAATCTTGGGGATACGAGTAGAACCACCAACCA 

GGACA 965 

Qy 1660 

GTAACAACCAATGTGTGCAGATATTTTCCAATGATGGCCAGTTCAAAAGTCGTTTTGGCA 1719 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 96 4 ATATCAT-GAATCTGTG ACTTGTCTA — 

GTTTGGCATCTCGAAGGGCTTTCT C 915 

Qy 172 0 TACGGGGACGCTCTCCGGGGCAGCTGCAGCGGCCCA 

CAGGAGTGGCTGTACATCCCA 17 76 

I I I I I I I I I I I I III III II I I I I I I I I I I I 

I 

Db 914 TACTGGG TCCAGGG TGCCACGGAACAGGTCAGCATT — 



CAGTTCTTCAAA 867 



Qy 1777 

GTGGGGACATAATCATTGCCGATTATGATAATAAATGGGTCAGCATTTTCTCCTCCGATG 1836 

I I I I I I I I I I III III I I I I I 

I 

Db 866 TCGGGCACG GGTAATGGAGGTATAGAAGTCG 

ATTCCTTCATAG 82 4 

Qy 1837 

GGAAATTTAAGACAAAAATTGGATCAGGAAAGCTGATGGGACCCAAAGGAGTTTCTGTGG 1896 



Db 823 AGAGAATCGATCTCAATACTGGC-CTGGG- 

GG 781 



Qy 1897 

ACCGCAATGGGCACATTATTGTTGTGGACAACAAGGCGTGCTGCGTGTTTATCTTCCAGC 1956 

III I I I I I I I I I I I I I I I I I I I I I 

I I I I 

Db 78 0 TACGC — TTAGCACGTT CACAAGCAGTACGGAG 

GCGTCTTACAGC 73 8 

Qy 1957 

CAAACGGGAAAATAGTCACCAGGTTTGGTAGCCGAGGAAATGGGGACAGGCAGTTTGCAG 2 016 
III I I I I I III 

I 

Db 73 7 TCTC TTGTTCTC ACTG 

AT 720 

Qy 2017 GTCCCCATTTTGC AGCTGTAAAT- 

AGC AAT AAT GAG AT TAT TAT T AC AGAT T TCC AT 2072 

I I I I I I I I I III I I I I I I I I I I I III I I I I I 

I I 

Db 719 GTCCTTCTTATGCTTGCGCTTAAACTCAGCAATAAAATGGTTGACCATTCGGTTGT- 



Qy 2 0 73 AATCAT-TCTGTCAAGGTGTTTAATCAGGAAGGAGAATTCATGTTGAAGTTTGG- 

CTCA 2129 

I I I I I I I I I I I I III I I I I I I I I II 

I I I I 

Db 66 0 AATCTTCTCCACCCAAGTG GGTGTCTCCA- 

GCTGTAGACTTGACCTCA 614 

Qy 2130 

AATGGAGAAGGAAATGGGCAGTTTAATGCTCCAACAGGTGTAGCAGTGGATTCAAATGGA 2189 
I III I I I I I I I I I I I I I I 

I I 

Db 613 A AGAT TCCATCCTCAATAG-TGAGGATT 

GA 585 



Qy 2190 

AACATCATTGTGGCCGACTGGGGAAACAGCAGGATCCAGGTTTTTGATGGGAGTGGATCA 2 2 49 
I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 58 4 CACATCAAAAGTGCCACCT 

CCCAGGTCAAAGATGAGCACGTTTCT 540 

Qy 2250 

TTTTTGTCCTACATTAACACATCTGCTGACCCACTCTATGGCCCCCAAGGCCTGGCCCTA 23 09 



Ill 1 1 1 1 1 1 1 III 1 1 1 1 I II III I 

Db 539 TTCTGCTCCAACCTT TTTG TCTAAG — 

CCGTAAGCAATAGC A 50 0 

Qy 2310 

ACTTCAGATGGTCATGTTGTGGTTGCAGACTCTGGAAATCACTGTTTCAAAGTCTATCGA 2369 

I I I I I I I III I I I I I I I I I I I 

I I 

Db 499 GCAGCAGTTGG CTC 

ATTAATAATTCTAAGTACATTGA 463 

Qy 2370 

TACTTACAGTAATGGTGGGCAGGTGGATACCCGCTTCCATGGTCTTGCACTATAAACTGG 2 42 9 

II I I I I I I I I I I I I I I III I I I I I I 

I I I 

Db 462 GAC CAGCAATAG TTCCAGCATCTTTGG — TAGC-CTG- 

ACGCTGA 42 2 

Qy 2430 

AATGGATTTCTCAATGCGGGACCAGATTATGACTAGAGTTTTTATGCCAGAAGGAATCAT 2 4 89 
I III I I I I I I I I I I I I III 

I I I 

Db 421 GAGTCATTAAAGTAAGCTGGCACTG TGACCA CAG 



Qy 2 49 0 TGGTGA— 

ACTTTCCAAGGTTATTTCTGAATGTAACAATTTCCTTAAAAATGACTTATC 2546 
I I I I I III I I I I I I I 

I 

Db 384 TGGTAACAGTCTTCCCAAGGT 

AGGCTTCTG 355 



Qy 2547 

CAATTTCTGTATTTCACCTTTAGGGTTAAAAAAAACTCTTCTACTGAATCTATAAAAACT 2 6 06 
I I I I I I I I I I I I I II 

Db 354 CAATTTC 

CTTCATCTTTGTC 335 

Qy 2607 

GCAGTTTTACATCTGTGAACTATGGCTTAAGGGACAGGATTTATGTAGCTAAACTAATTT 2 666 
I I I I II I 

Db 334 

AGAACCATAG 325 

Qy 2667 

TGCAAAT C AAAC AG AC AC T T AAAAAAAC T AGC AT ATGT AAAGGT AT TCGTTAATCCTGTG 2726 
I I I I I I 

I I 

Db 32 4 AAGACAC 

CTCCTCTG 310 

Qy 2727 

AAT GG T AGC T T T T GC AC AG AAC T T C C AAAAGC AAAAC AAAAAC AAAATC T AT T G TAG T T A 2 7 86 
I I I I I I I I I I I I I I I I 

I I 

Db 3 09 GATAGAAGCTTTTGGTC TCTCCC 



Qy 2787 



TATACTTCATTTAACCTAGGTCACAAGACCCAGGGAATCTTCTAACCTCACTTTTACAGT 2 8 46 

I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I 

I 

Db 2 83 TATTCTAC-TTGGACCTTGGGC-CTG CCA — GCATCAT-TCACCAC- 

CATAAAGGGC 233 

Qy 2847 

AGGTATTACTCTTGTGACATTTTTTTGGTTATCAACAACTAAATATAAATTACTTTGGAA 2 9 06 
I I I I I III III I I I I I I III 

Db 232 CAATGTTTCATATCAGAC 

TGGACAACAGCATCATCAAAT 194 

Qy 2907 

AAAGTAAGGCTGTCTTGCAAAATGATCCCAGCTCTGATTAGCAGCCCTCTGGAGTTCAGA 2 966 

I I I I I III I I I I I I I I I I I 

I 

Db 193 CTG-CGT CCA 

ATCAGACGTTTGGCATCAAAA 16 4 

Qy 2967 

ACTTAAGTATCAGTGCAAATTTCTCAACCTTTCTGGGTTAGACAAAGATCCTTTTTTGTG 3 02 6 

III I I I I I I I I I I I III III 

Db 163 ACT GTGTTGGTG GGGTT 

CATTGCAACTT 136 

Qy 3 02 7 TGTTCTTTTCACCACCCCTTTGGC- 

TCACCTTGTATCAGCAAACAAAGTACTTCTTCAGG 30 85 

I I I I I I I I I I I I I I I I I I I I 

I I I I I 

Db 135 GATTCTTTGC GGCATCACC GATCAACCG 

TTCAGT 102 

Qy 3086 

GAAACCTGAAATTTCTAATGCCTTGAAAAGCATATTACAAAAGTAATGCTACCTTTTGGG 3145 
I I I I I I I I I I I I I II 

I I I I 

Db 101 GT CCGT-AAAGGCG ACATA GC 

TTGG- 78 

Qy 3146 

AAACAAACTGCCCCGTTAACTCCAGATCATTGCACTGGAATGTAATCAAGAAAGTTAGTC 32 05 
III III I I I I I I I I I I I 

Db 77 AGTGGTTCGGTTTC- 

CCTGATCATTG 53 

Qy 3206 

ATGTTTTATGTACCATGTTTTCACACGTGTCTCTTCTCTTCGACTTCCTGAAAGCGAAAG 32 65 

II I I I I I I I I I I I I I I I I I 

Db 52 GCAATTATCTCGACTTTTC — 

CGTGCTGGAA 2 4 

Qy 3266 CTTTACCTCCTGCAAATGTCAGCACATGTAGTAGG 33 0 0 

I I I I I I I I I I I I I I I 

Db 23 AACACCCACACAAGACC AGTAGG 1 



RESULT 15 
US-10-478-914-3/C 

; Sequence 3, Application US/10478914 
; Patent No. 7335755 



; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 

; TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
FILE REFERENCE: 7388-80899 

CURRENT APPLICATION NUMBER: US/ 1 0 / 478 , 9 1 4 
; CURRENT FILING DATE: 2003-11-26 

PRIOR APPLICATION NUMBER: PCT/ JP02 / 0529 4 
; PRIOR FILING DATE: 2001-05-30 
; PRIOR APPLICATION NUMBER: JP 2001-162775 
; PRIOR FILING DATE: 2001-05-30 
; PRIOR APPLICATION NUMBER: JP 2001-255226 

PRIOR FILING DATE: 2001-08-24 
; NUMBER OF SEQ ID NOS : 417 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 3 

LENGTH: 2 0 83 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-478-914-3 

Query Match 12.8%; Score 4899.5; 

Best Local Similarity 38.0%; Pred. No. 0; 

Matches 1319; Conservative 0; Mismatches 
Gaps 187; 

Qy 327 

CCAGTGTGCCGCCAGACCTCCATCCTGCCCGAGAAAGGGGTGGCCGCGCTCCAGAACAAT 3 86 



Db 2 0 83 CTAGCG-GCCGC — GACCTGCA GCTCGAGCAC-ATGTGGCCTTTTT- 

TTTT 2038 



DB 1; Length 2 083; 
729; Indels 1421; 



Qy 387 

TTCTTCATCACAAACCTGATGGACGTGCTGCAGCGAACTCCAGGCAGCAACGCTGAGGAG 4 46 
I I I I I III I I I I I III 

I I 

Db 2037 TTTTTTTTATTTTGCCT TCTTGCT CTGG 

TGGTTAG 2 0 03 



Qy 447 

TCTTCCATCCTGGAGACAGTCACTGCTGTGGCTGCGGGAAAGCCTCTCTCTTGCCCAAAC 5 06 

III I I I I I I I I I I I I II I I I I I I I 

Db 2002 TCT — CCTACTATAAACA TGATTT TCCTTCTT- 

CCC 1970 



Qy 507 

CACGATGGGAATGTGATGGAATTTTACTGCCAGTCCTGTGAGACTGCCATGTGTCGGGAG 566 

III I I I I I I I I III 

Db 1969 TTCAG — AGTCCTGT 

CAAT 1953 



Qy 567 

TGCACGGAGGGGGAGCACGCAGAGCACCCCACAGTTCCACTCAAGGATGTGGTGGAACAG 62 6 

I II II I I I I III 

I I I I 

Db 1952 CTTTACTTTTTTTCTCATTAATAT- 

ACAG 1925 



Qy 62 7 CACAAGGCCTCGCTCCAGGTCCA- 

GCTGGATGCTGTCAACAAAAGGCTCCCAGAAATAGA 685 



II 1 1 1 1 1 I 1 1 I II III I 1 1 1 1 1 1 1 1 1 1 1 1 

Db 1924 

AAGTTATACTACAGATACATTCAAGCTCATGACTGCTAAAAACTTCCTGAAATG — 18 71 

Qy 686 

TTCTGCTCTTCAGTTCATCTCTGAAATCATTCATCAGTTAACCAACCAAAAGGCCAGCAT 7 45 

II I I I I I I II I I 

I I 

Db 18 70 AGC C T T AAT T CAT AC 

GAAT 1852 

Qy 746 

CGTGGATGACATTCATTCCACCTTTGATGAGCTCCAGAAGACTTTAAATGTGCGCAAGAG 8 05 

II I I I I I I I I I II III I I I I 

I I I 

Db 1851 CAT TTCTA — TTTGTT CT 

TTTTTTTTTGAGACAGA- 1819 

Qy 806 

TGTGCTGCTTATGGAATTGGAGGTCAACTATGGCCTCAAACACAAAGTCCTCCAGTCGCA 865 

III III III I I I I 

Db 1818 CTCTTACTC — 

TGTCGCCCAG 1800 

Qy 866 GCTGGA- 

TACTCTGCTCCAGGGGCAGGAGAGCATTAAGAGCTGCAGCAACTTCACAGCGC 92 4 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1799 GCTGGAGTGCACTG GCAT GATCT-CGGC 

TCACTGC — 1766 

Qy 925 

AGGCCCTCAACCATGGCACGGAGACCGAGGTCCTACTGGTGAAGAAGCAGATGAGCGAGA 9 8 4 

I I I I II I I I I I I I I I I I I I I I I I 

Db 1765 — AACCTCTGCCT CAC-GAGTT — AGGTC 

AAGCAATT 173 4 

Qy 985 

AGCTGAACGAGCTGGCCGACCAGGACTTCCCCTTGCACCCGCGGGAGAACGACCAGCT-G 10 43 

II I I I I I I I I I I I I I I I I 

I 

Db 1733 CT TCTGCCTCAGCCTCCCGAGG 

AGC TAG 1706 

Qy 1044 

GATTTCATCGTGGAAACCGAGGGGCTGAAGAAGTCCATCCACAACCTCGGGACGATCTTA 1103 
I I I I I I I I I I I 

Db 1705 GAT T AC 

AGGTCCCT 1692 

Qy 1104 

ACCACCAACGCCGTTGCCTCAGAGACAGTGGCCACGGGCGAGGGGCTGCGGCAGACCATC 1163 
I I I I I I I I I I I II I II I I I I I I I 

I I 

Db 1691 GCCACCA TGCCTGAGTAATTTTTGTATTTTTAGTAGAGATGGGG 

TTTC 1644 

Qy 1164 

ATCGGGCAGCCCATGTCCGTCACCATCACCACCAAGGACAAAGACGGTGAGCTGTGCAAA 12 23 

III I I I I I I I I I II I I I I I I 

I 



Db 16 43 ACCATGTTAGCCAAGCTGGTCTCGAATTCC- 

TGACCTCAGGTGA 16 01 



Qy 1224 

ACCGGCAACGCCTACCTCACCGCCGAACTGAGCACCCCCGACGGGAGCGTGGCAGACGGG 12 83 

I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I 

Db 16 0 0 TCCGCCCACCTCAGCCTC-CCAAAGTGCTGGGATT ACGGGCGTGAGCC — 

ACGGT 15 49 

Qy 1284 

GAGATCCTGGACAACAAGAACGGCACCTATGAGTTTTTGTACACTGTCCAGAAGGAAGGG 13 43 
I I I I I I I I I I I I I I I I II 

Db 1548 G CCTGG 

CCTCTATTTTTTATTTTAAATACCT 1518 

Qy 1344 GACTTTACCCTG TCTCTGAGACTCT- 

AT G AC C AGC AC AT C C G AGGC AGC C C G T T T AA 1399 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 1517 TTTGCCCTGAAATCTATGT-CCTCTGATTATCAACACTGCTTA 

CCAGTTTTA 146 7 

Qy 1400 

GCTGAAAGTGATCCGATCCGCTGATGTGTCTCCCACCACAGAAGGCGTGAAGAGGCGCGT 1459 
I I I I I I I I II 

I 

Db 1466 

TTTTTGTTGGTATTTATTTCA T 1445 

Qy 1460 

TAAGTCCCCGGGGAGCGGCCACGTCAAGCAGAAAGCTGTGAAAAGACCCGCAAGCATGTA 1519 

I I I I I I I I I I I I II I I I I I 

I I 

Db 1444 TCAATCCCAG CTCTACAA CT TACCAG 

TATA 1415 

Qy 1520 

CAGCACTGGAAAACGAAAAGAGAATCCCATCGAAGACGATTTGATCTTTCGAGTGGGTAC 15 79 
I III III II I I I I I I I I 

Db 1414 AAAATTTGG ATCAGTTAC 

TTAATCTTGC 1387 

Qy 1580 

CAAAGGAAGAAATAAAGGAGAGTTTACAAATCTTCAGGGGGTAGCTGCATCTACAAATGG 1639 

I I I I I I I I I I I I I 

I I I I I I 

Db 1386 AGTT TCTTC 

CTTTTAAAAATGG 1365 

Qy 1640 

AAAGATATTAATTGCAGACAGTAACAACCAATGTGTGCAGATATTTTCCAATGATGGCCA 1699 

I I I I I I I I I I I I I I I I I I I 

I I I 

Db 136 4 AATAATAT TAT C TAT C ATAT 

AAAATGCCCA 1335 

Qy 1700 

GTTCAAAAGTCGTTTTGGCATACGGGGACGCTCTCCGGGGCAGCTGCAGCGGCCCACAGG 1759 

I I I I III III II I I I I I I I I 



Db 1334 GTTAA 

CTACAATGTGCCA 13 0 0 



-CAAATATGAACTAACAC- 



Qy 1760 

AGTGGCTGTACATCCCAGTGGGGACATAATCATTGCCGATTATGATAATAAATGGGTCAG 1819 

III III 
Db 1299 — GGC 

ACCA 12 93 

Qy 1820 

CATTTTCTCCTCCGATGGGAAATTTAAGACAAAAATTGGATCAGGAAAGCTGATGGGACC 18 79 
I I I I I I I I I II I I I I I I I I I I II 

I I I I 

Db 1292 TTCT AGGGAA TAGAATGTAAATAAGTTCAAGTT— CT- 

TGACC 12 53 

Qy 1880 

CAAAGGAGTTTCTGTGGACCGCAATGGGCACATTATTGTTGTGGACAACAAGGCGTGCTG 1939 
II I I I I I I I I I I I I I 

I 

Db 1252 TGAATAAGTTT ACATTTTAGT 

GAGAAG 1226 

Qy 1940 

CGTGTTTATCTTCCAGCCAAACGGGAAAATAGTCACCAGGTTTGGTAGCCGAGGAAATGG 1999 
I I I I I I I I I I I I I I I 

Db 1225 ACAGACAA 

TAAATCAATAACATGT 1202 

Qy 2000 

GGACAGGCAGTTTGCAGGTCCCCATTTTGCAGCTGTAAATAGCAATAATGAGATTATTAT 2 059 



Db 12 01 CCATTATGC — CAG-- 

TTT 1183 



Qy 2060 

TACAGATTTCCATAATCATTCTGTCAAGGTGTTTAATCAGGAAGGAGAATTCATGTTGAA 2119 

I I I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 1182 TAAATGTCACAAAGAAAA AGGGGGAGCAATCAGATAG TGATG- 

GAA 113 8 

Qy 2120 

GTTTGGCTCAAATGGAGAAGGAAATGGGCAGTTTAATGCTCCAACAGGTGTAGCAGTGGA 2179 

III III I I I I I I I I I I I I I II II I I I I I 

Db 113 7 GTTACACTC TAGAGAGGG 

T GG AC AG AAAAAG T T T C T C T T AGG AG AT G 1091 

Qy 2180 TTCAAATGGAAACATCATTGTGGCCGACTGGGGAAACAGCAGGATCCA 

GGTTTTTGA 2236 

II II I I I I II I I I I I I I I I I III I 

I I I 

Db 109 0 -ACATCTG — AAC AG AG AT C T AAAT G AC T GAG T AC AAAC C 

CCAATGGAAAAGTGA 1039 

Qy 2237 

TGGGAGTGGATCATTTTTGTCCTACATTAACACATCTGCTGACCCACTCTATGGCCCCCA 2 2 96 

I I I I I I I I I I III I I I I I I I II III III 

Db 103 8 TTGGTGTG TTGTGAAACAGCAACACAGC— 



C AG T T T G AC TAG AT G - 



9 9 6 



Qy 2297 

AGGCCTGGCCCTAACTTCAGATGGTCATGTTGTGGTTGCAGACTCTGGAAATCACTGTTT 2356 



Db 995 AG TAGATCAGAATGAC TGAGGTGG — GACTGTGG 

TCA T 96 0 

Qy 2357 

CAAAGTCTATCGATACTTACAGTAATGGTGGGCAGGTGGATACCCGCTTCCATGGTCTTG 2 416 

I I I I I III I I I I I I I I I I III I I I I I I 

I I 

Db 959 CAAAG AGAT CAG-AGAGGTAGCCTG — GGAAGGCCACATCACAG — 

AATG 915 

Qy 2417 

CACTATAAACTGGAATGGATTTCTCAATGCGGGACCAGATTATGACTAGAGTTTTTATGC 2 4 76 

III I I I I I I I I I I I I I 

Db 914 TCCTTTGGGC CTCTGTAAG 

AGCGTTTT 888 

Qy 2477 

CAGAAGGAATCATTGGTGAACTTTCCAAGGTTATTTCTGAATGTAACAATTTCCTTAAAA 2 536 

I I I I I I I I I I I I I I I I I I I I I 



Db 887 G AAAAAAC AG T T C T GAG CAAAGGTTA 

GTCATTTAAC 852 

Qy 253 7 ATGACTTATCCAATTTCTGTATTTCACCTTTAGGGTTAAAAAAAACTCTT 

CTACTGA 2593 

I I I I I I I II I I I I I I II I I I I I III 

I I I 

Db 851 TTGACTTA CT CTTTAG 

AAGGATCACTTTGGCTATTGA 815 

Qy 2 59 4 ATCTATAAAAACTGCAGTTTTACATCTGTGAACTATGGCTTAAGGGACAGGATTT- 

ATGT 2652 

I I I I I I I I I I I I I I I I II I I I I I I I I 

I 

Db 814 GTGGCAAACAACTGGAGAAGTGGTTAGGGGAGCACTG 

TGGGCAGGAATAGAAGG 761 

Qy 2653 

AGCTAAACTAATTTTGCAAATCAAACAGACACTTAAAAAAACTAGCATATGTAAAGGTAT 2 712 
II I I I I I I I I I III 

Db 76 0 AG GAAAAGCAGACTGG 

AGCC 741 

Qy 2713 

TCGTTAATCCTGTGAATGGTAGCTTTTGCACAGAACTTCCAAAAGCAAAACAAAAACAAA 2 7 72 

I I I I I I I I I I I I I II I I I I 

I 

Db 740 TAATCCAGT ACTTC 

AGATGAGAAATGACAGA 710 

Qy 2773 

ATCTATTGTAGTTATATACTTCATTTAACCTAGGTCACAAGACCCAGGGAATCTTCTAAC 2 832 

I II I I I I I I I I I I I I I III 
Db 709 TTTGCCTG GTACATCATTT — CCTA 



TCCC- 



683 



Qy 2833 

CTCACTTTTACAGTAGGTATTACTCTTGTGACATTTTTTTGGTTATCAACAACTAAATAT 2 892 

I I I I I I I I I I I III I I I I I I II I I I I 

I 

Db 6 82 TTTACT TATAACTTAT-TGGTGTATTTTTATTT 

TAAGTGT 6 44 



Qy 2 893 AAATTACTTTGGAAAAAGTAAGGC- 

TGTCTTGCAAAATGATCCCAGCTCTGATTAGCAGC 2951 

I I I I I III I I I I I I I I I I I I I I I I I 

Db 6 43 AAATT — TTT ATAAATGAAGCATAATTTGAAGATTGAT- 

ATGTTCGGA 599 



Qy 2 952 CCTCTGGAGTTCAGAACTTAAGTATCA — 

GTGCAAATTTCTCAACCTTTCTGGGTTAGAC 3009 

I I I I I I II I I I I I I III I I I I I I I I II 

Db 59 8 CTTGTGTT TTCT — ATTATCATGGTG TCTCTCCACATACCT 

TT 558 



Qy 3010 

AAAGATCCTTTTTTGTGTGTTCTTTTCACCACCCCTTTGGCTCACCTTGTATCAGCAAAC 3 069 
I I I I I I I I I I I I I II I I I I I II 

Db 557 TCCCTTT AGTTCT GCTTCTTATTGTGCTCCTCAAGT- 

CTATT 517 



Qy 3070 

AAAGTACTTCTTCAGGGAAACCTGAAATTTCTAATGCCTTGAAAAGCATATTACAAAAGT 3129 
I I I I I I I I III I I I I I I I I I I I I I III 

Db 516 TTTGTTCTT-TTC CCT- 

CTACTTCTACTGTTTTGAACTGTAT 477 



Qy 313 0 AATGCTA — 

CCTTTTGGGAAACAAACTGCCCCGTTAACTCCAGATCATTGCACTGGAATG 318 7 
I I I I I I I I I I I I I I I I I I I I I I 

Db 4 76 -ATGCTATTTCTATTTTGCCAGGAGTTACCCCCTTCAATT 

TTCGTTGTACT 427 



Qy 3188 T AAT C AAG AAAG T TAG T C ATGT T T T ATGT AC CATGTTTT C AC AC G T — 

GTCTCTTCTCTT 3245 

II I I I I I I I I I I I I I II I I I I I I I 

I 

Db 426 — CTC 

TCAGTCTCAAACACATACCCATGCTCTCTTCTGTCCTTCTCTTCATAT 3 76 



Qy 3246 

CGACTTCCTGAAAGCGAAAGCTTTACCTCCTGCAAATGTCAGCACATGTAGTAGGACACC 33 05 
I I I I I III I I I I I I II I I I I I I I I I I I 

I I 

Db 3 75 CATCTTCTAGAA TTAGTTCTAC — CC — 

CGATTTTCTACCAATTTATTTTATTTCC 324 

Qy 33 06 AGTATCCTAGGACAGAGAGCCATAAGTAGCCCTTTGGAGGACTGATGGTGTCA— 

AC 3360 

I III I I I I I II III III III II II I I I I 

Db 323 A — ATC — 

AGTACATT GAT T C AG AT T T AT C AAT AT G T T T T AC C AAT TTTATCATTTTTTT 2 6 8 



Qy 3361 CAAAGGCATGTGATT— 



GATTAATGATTCCCCCTTAGAAAGCAAGTGTTACCAAAGTTG 3417 

I I I I I I I I II I I I I I II I I I I I II II II 

I 

Db 26 7 TGGAGGCTTGTTATTTAAGGTTCATTTTT CTTGGAGTACA 

TCCTTCAGAAG 217 

Qy 3418 

TGTTATCTTGAAAGCATTACAGGTAAGGGCATGTTATGGTTATTTATCATTGTTTAATGA 3 4 7 7 

I III III III I I I I I I I I I I I I I I I 

Db 216 TTCTTTC AGC AAGAAC 

TCATGGTTGTTAATCTTT 183 

Qy 3478 

ATAGTAGAGGTGTCAAGGGACTATGTATACATGATTAGGGTAAGATAGAATGTATTATAT 353 7 

I I I I I I I I I I I I I III 

Db 182 GAGTC CTTTGT TTCTGG 

TGT 163 

Qy 3538 

ATATATATATATATACACACACACATATATATAGCTGAATCTTTGGTGTATTGAAATAGG 359 7 

III I I I I I I I I I I I I I II III I 

I I 

Db 162 TTTAACTTTTTCCCCTCACTCT TGAGTAATAG 

TTTAGCTGGG 121 

Qy 359 8 CAGCAC— 

TCTGAAAGACAGAAGCTTCGTCCAGCCACTCTTCAGCACATTCCTTTACTA 365 4 

III I I I I I I I I III I I I I I I I III 

I 

Db 12 0 CATAAAATTTCAGATTGGCAGTTACTT TTCCTTGGCAC 

TACGA 78 

Qy 3655 AGCAGTTTAAAGCCG-TCCTAGTGGAGCAAGC — 

CCTAAAGCAGATTTAATTTTTGCCAT 3 711 

II I I I I I I I I I II I I I I I I I I I I I I I 
Db 7 7 AG ATGCCGCTCC 

GTTATCTTAT T AC AGC C AAT AT T AC T AT T G 36 

Qy 3712 TTTCCAAGAATGACGGTGGTGGCTTTTAGTCAGAAAATGGCCTTCTGTG 3760 

I I I I II III I I I I I I I I I I I I I I 
Db 35 -TTCC TTTGTAGGTAATCTC CAGTA GGCCAACAGTG 1 
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RESULT 1 
US-10-478-914-4 

Sequence 4, Application US/10478914 

Patent No. 7335755 

GENERAL INFORMATION: 
APPLICANT: NAKAGAWARA, AKIRA 

TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
FILE REFERENCE: 7388-80899 

CURRENT APPLICATION NUMBER: US/ 1 0 / 478 , 9 1 4 
CURRENT FILING DATE: 2003-11-26 
PRIOR APPLICATION NUMBER: PCT/JP02/05294 
PRIOR FILING DATE: 2001-05-30 
PRIOR APPLICATION NUMBER: JP 2001-162775 
PRIOR FILING DATE: 2001-05-30 
PRIOR APPLICATION NUMBER: JP 2001-255226 
PRIOR FILING DATE: 2001-08-24 
NUMBER OF SEQ ID NOS : 417 
SOFTWARE: Patentln version 3.2 
SEQ ID NO 4 

LENGTH: 3210 

TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-478-914-4 

Query Match 35.5%; Score 6914.5; DB 1; Length 3210; 

Best Local Similarity 45.9%; Pred. No. 0 ; 

Matches 1677; Conservative 0; Mismatches 1161; Indels 815; 
Gaps 225; 

Qy 4 GCGCGCCCGAGCCTAGT-CCCCACGCC— GCGGCGCGCCCGGGCTCCCTGC- 

TGATCCCA 59 

III I I I I I I I I I I I I I I II I I I I I I I I I I I II 

I I I I 

Db 1 GCACTGTTG-GCCTACTGGCCCAGGCCTTGC- 

ACACGCAGGTGCTCTCTTAGTGCTCCCT 5 8 

Qy 6 0 GAACAATCAACCATGACGACCGAATCTGGATCAG- 

ACTCGGAATCCAAGCCGGACCAGGA 118 

I III III II III I I I I I I I I I I I II II III 

Db 59 GG CAAGGATGG-GAAGGCAT GTTCAGTCCTAGGAGT — 

AGGAAGGGGCAGAG 107 

Qy 119 G-GCCGA — GCCCCAGGAG-GCGGCGGGGGCGCAGGGGCGGCGGG 

GGCCGTGC 167 

II II I I I I I I I I I I I I II I I I I I I I I I I I I 

I 

Db 108 GTGTTGATGGCCCCTACAGAGCGGCCAAGGACAAGGAGCTGCTGTTCGAAACAGCC- 

TTC 166 

Qy 16 8 CGG AGCCGCCCA AGGAGGAGCAGCAGCAGGCC — CTGG — 

AGCAGTT 210 

II I I I I I I I I I I I III I I I I I I I I 

Db 167 

CTGCTCCCCAACCTGCCTCCCACCCAACAGGTTTTGCATATACTCTACTGGGAAGAGGGA 2 2 6 



Qy 211 CGCCGCCG-CTGCAGCGCACAGCACCCCGGTGCGAGGGAGGTCACTGACAAGGA- 

ACAGG 26 8 



Db 22 7 CACACCCGACTGCA — TCACTGCCCTCCAAGTCTCTCCCTG- 

CCCTGTCCAGCATCCAGG 2 83 

Qy 269 AGTTTGCTGCC AGGGCTGC 

AAAAC AGC T C G AAT AT C AGC AAT T AGAAGAC 318 

I I I I I I I I I I I I I I I I I I I I 

Db 284 A GCACCCCTAGTTGGGGAAGCTTCTGTGACTCCCCCTACAACAGC- 

CTAGGATGGA 33 8 

Qy 319 G ATAAACTTTCTCAGAAATCA-TCT 

AGCAGTAAACTCTCTCGGTCTCCA 366 



Db 339 GTGGGGTTTGTGAACAAATGCAGAAGGCAGTCTTAGGGAGGTCAGCTGACATG — 

CCCC- 395 

Qy 36 7 TTAAAGATTGTCAAAAAGCCTAAAAGCATGCAGT- 

GC AAAG T GAT AC T T C T C G AT GG AT C 42 5 



Db 396 TGGCCTGTGGCTGGGAAGTA-GCAGAGGCTAA 

CGCTC 441 

Qy 42 6 AG AAT AT AC C T G T GAT G T AG AG AAAC GC T • 
TGCTGT 472 



Db 4 42 TGGGGTTGCCAG 

GAGTAGCACTGGATCAGTCAGGTGACAGGGCTCTCCTCTCT 49 4 

Qy 4 73 TTGATAAAGT 

G T G T G AAC AC T T G AAC T T GC T AG AGAAAGAC T AC TTTGGGCTTACGT 529 



Db 495 CTGAGCAGGTCCGGTGGCAGCCTTCAAGGTCGCC — 

ACGCCGTATTCCCTGTATGTCTGT 552 

Qy 53 0 ATCGAGATGCTGAA AACCAGAAGAATTGGTTGGACCCTG 

CTAAGGAA 5 76 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 553 CCCGAGGGGCAGAACGTCACCCTCACCTGCAGGCTC — 

TTGGGCCCTGTGGACAAAGGGC 610 

Qy 577 ATAAAAAAACAGGTTC GAAGTGGTGCTTGGCACTTTTCATTTAATGTG 

AAA 627 



Db 611 ACGATGTGACC— TTCTACAAGACGTGGTAC 

CGCAGCTCGAGGGGCGAGGTGCAGA 66 4 

Qy 628 TTTTATC 

CACCAGACCCTGCCCAACTATCTGAAGATATCACCAGGTACTACCTCTG 6 83 



Db 665 CCTGCTCAGAGCGCCGGCCCATCCGCAAC CTCACGTT 

CCAGGACCTTCACCTG 717 



I I I I I I I I 
— GGTTCTTCCC 



-CCAGAGGACAAG- 
III I I I I I 



Qy 684 

CTTGCAGTTGCGAGATGACATCGTGTCCGGAAGGCTGCCCTGCTCCTTTGTTACCCTGGC 7 43 

I I I I I I I I I I I I I I I I I I I III 

Db 718 C ACCATGGAGGCCACCAGGCTG-CCAACACC 

AGCCACGAC 756 

Qy 744 

CTTGCTGGGCTCCTACACTGTCCAGTCAGAGCTCGGAGACTATGACCCAGATGAATGTGG 8 03 

I I I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 757 CTGGCTCAGCGCC ACGGGCTGGAGTCG— GCCTCCGACC 

ACCATGG 800 

Qy 804 GAGCGATTACATTAGTGAGTTCCGCTTTGC 

AC C AAAC C AC AC T AAAG AAC T G 855 

II I I I I I I I I I I I I I I I I I I I I I 

Db 801 CAACTTCTCCAT 

CACCATGCGCAACCTGACCCTGCTGGATAGCGGCCTCT 85 0 

Qy 856 GAAGAC AAAGTGATCGAGCTGCACAA GAGCCACAGAGG- 

AATGACGCC 9 02 

I II I I I I I II I I I I I I I I I I I I I III 

I I I 

Db 851 ACTGCTGCCTGGTGGTGGAGATCAGGCACCACCACTCGGAG- 

CACAGGGTCCATGGTGCC 9 09 

Qy 903 A 

GCAGAAGCAGAGATGCATTTCTTGGAAAATGCCAAAAAATTATCAATGTATG 955 

I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 910 ATGGAGCTGCAGGTGCAGACAGGCA AAGATGC 

ACCATCCAACTGTGTG 957 

Qy 956 

GGGTAGATTTACATCATGCTAAGGACTCAGAAGGGGTAGAAATTATGTTAGGAGTTTGTG 1015 
I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 958 GTGTA CCCATCCTCCT CCCAGGAGAG — TGAAA 

GTAAGG 99 4 

Qy 1016 CAAGTGGTCTGTTGATATATCGCGACCGGCTGCGAATAAACAGATTTGCCTGGCC — 

CAA 10 73 

I I I I I I I I I I I II 

Db 995 GACCAACCTC TTGCC 

CCTTTTG 1016 

Qy 1074 

GGTTCTAAAGATTTCATACAAACGGAACAACTTTTACATTAAGATCCGGCCGGGAGAGTT 1133 

I I I I I I I I I I I I II I I III I I I I I I I I I I 

I 

Db 1017 GGTTCT-CTGTTTTCTT CTGTCCTCATCCTGCACCCAGACCCTGTTTGGA- 

ACTC 1069 

Qy 113 4 TGAACAATTTGAAAGCACCATTGGGTTTAAGCTGCCAAACCATC- 

GAGCTGCCAAGCGTT 1192 

III I I I I I I I I I I I I I I I I I I II 

I 

Db 10 70 TGGCCTCAT CACC CCAAGCCCTCAGAACCCCC 

CGGT 1105 



Qy 1193 T AT G G AAAG TATGTGTTG - AGC AT CAT AC AT T T T T C AG AC T AC T - 

GTTACCAGA 12 44 



Db 1106 CCTCCTCCTTTTCTGCTGCTGCA- 

CATCCCTTCTGCTTCCTCCTTGGTGCAATCCCCAGA 1164 

Qy 1245 AGCAC-CTCCCAAGAAATTCCTAACCTTGGGT TC— 

CAAGTTTCGTTATAGTGG 1295 



Db 1165 AGCCCACTCTCCTTCCAT — 

CTGCTCTGGAGTCTCTGCTCCTCTTGACTCTCTGGAGTGG 1222 

Qy 1296 CAGGACACAAGCGCAAAC GAGAAGAGC C AG TGCGT — T GAT AG AT - 

CGC 1341 



Db 1223 

CTGTGCCTTGGCAGTGACCTTTGGCCAGGGCAAGTGCCTCATGACAGGTACTGGGTGCCC 12 82 

Qy 1342 CCAGCACCTTACTTTGAACGCTCATCCAGCAAACGTTATACCATGTCTCG 

CAGCTTG 139 8 



Db 12 83 CAGGCAGCTAAGT GCCGCCCTGCCCACCAGC 

CCCCTATGGCTTGGGAAGGCTGG 1336 

Qy 1399 G 

ATGGAGCATCAGTGAATGAAAACCATGAAATATACATGAAGGATTCTATGTCT 1452 

I II I I I I I I I I I I I I I I I I I I I I I 

Db 133 7 GGGTCCTCTTGGCCAACAGGG — TGAAACCCC — ATCTCTAC- 

T AAAAAC AC AAAAAT T A 1391 

Qy 1453 GCTGCAG— AGGTTGGTACTGGCCAGTACGCCACAACAAAAGGCATCTCTC- 

AGACCAA- 150 8 



Db 1392 GC — CAGGCATGGTGGTGCGCGCCTGTA-GTCCCA 

GCTACTCAAGAGGCTGAG 1441 

Qy 1509 -CTTGATCACCACTGTGACTCCGGAGAAGAAGGCTGAGGAGGAGCGGGA- 

CGAGGAAGAG 1566 



Db 1442 

GCAAGAGAATCGCTTGAACCCGGGAGTCAGAGGTTGCAGTGAGCCGAGATCGCGCCACTG 15 01 

Qy 156 7 GACAAACGGAGGAAGGGGGAAGAAGTCACGCCCATCTC- 

GGCCATCCAGCACGAGGGAAA 1625 

II I II I I I I I I I I I I I I I I I I I I 

Db 1502 CACTCCAGCCTGCTGATAGAGCAAGACT CCATCTCAAAAAAAACAACCC- 

AAAATTT 1557 

Qy 1626 GACTGACA — GTGAGC — GCA- 

CGGACACCGCAGCCGACGGGGAGACCACTGCCACTGAG 1680 

I I I I I I I I I I I I I I I I I I I I I I I I I I I III 

Db 155 8 GCCTGGCATGGTG-GCAGGCATCTGTAATCCCAGCTACTCGGGAG 

GCTGAGACATGA 1613 



Qy 1681 

G AG C T AG AAAAAAC T C AAG AT G AC C T GAT G AAAC AT C AAAC C AAC AT T AGC G AG C T G AAA 17 4 0 



Db 1614 GAGCT-GCTTGAACCTGGGAGG — 

C AG AAG T T GC AG T G AGC C GAG AT C AC AC C AC TGC A- 1669 

Qy 1741 AGAACCTTCTTAGAAACCTCAAC-AGACACTG— CCGTAA CGAATGAA- 

TGG 1788 



Db 1670 — 

CTCCAGCCTGGGTGACAGAGCGAGACTCTGTCCCAAAAAATCAAAAAAATCACTTTTG 172 7 

Qy 1789 GAGAAGAGGCTTTCCACCTCC CCCGTGC — GAC- 

TGGCCGCCAGGCAGGAGGATGC 18 41 



Db 172 8 GTAGAGATGCACTCTCGCTATGTTGCCCAGGCTGGTCTTGAACTCCTGG- 

GC 1778 

Qy 18 42 CCCCATGATCGAACCA- 

CTTGTCCCT G AAG AG AAAAT G G AAAC C AAG AC GG AG T C C AG T G 19 0 0 



Db 17 79 TCAATTGATCTTCCCACCTTGACC TCCAA- 

AGTGCTGGG 1816 

Qy 19 01 GATAGAGACGGAACCCACCGTGCACCACCTGCCG CTT 

AGCACTGAGAAG 19 49 



Db 1817 ATTACAGGTGTGAGCCACCATGCCTAGCCT- 

C AGGG AAT T C T TAT AAG AAC T C T AT G AAG 1875 

Qy 1950 GTGGTGCAGGAGACC GTGTTGGTGG-AGGAGCGGCGT- 

GTGGTGCACGCGA 199 8 



Db 18 76 TAG— GCA— 

TCACCATCTTCTCTGTATCCATGGAAAGAGAGGCCTAGAGATGTATGCTA 1931 

Qy 1999 GTGGGGATGC TTCTTACTCGGC GGGAGAC — 

AGCGGGGATGCTGCAGCACAGCC 2 050 



Db 1932 AC TTGCCCAAGCTCACATAGCCCAGGGTAGCATAGCTGGGATGTTG- 

AGCTGAGGC 19 86 

Qy 2 051 C GCATTCACAGGCATTAAAGGGAAAGAGG — GCTCTGCTTGACG- 

GAG 2095 

I I I I I I I I I I I I I I I I I I I I I I I III 

Db 19 8 7 CGGAGGAGAAGTAGCAGTCGCTGGC 

AGAGCACACAGGCTGCTCTGGGGGATGAGCT 2 042 

Qy 2 096 GGGGC — TAAAGAGGAAGG — 

AGGGGAGGAGGTCGCTAAAGCTGTCCTGGAACAGGAAGA 2151 

I I I I I I I I III II II I I I I I I I I I I III II 

I I I 



Db 2 0 43 GGTGCGTTTAAGGAACAGGCCAGCACTGGCATTCGC — AAGCAGT — 

GGGGAAGGGGAGA 2 09 8 



Qy 2152 GACAGCCGCTGCTTCCCGTGAGCGACAAGAGGAGCAGAGTGCAGCC- 

ATC 2200 



Db 2 099 GA-TGCCGAGGTG GTCAGTATCCTGACTTTCAGAG 

GCCTTTTTTTGTTTGTT 2149 

Qy 22 01 C AC AT T T C AG AAAC T T T GG AAC AAAAAC CTCATTTT G A- 

GTCCTCAACGGTGAAGACGGA 2 2 59 



Db 2150 

TTAATTTTTGCTAGATTGATATTAAAAACTCATGTGGAGGAACTCAAGGAATGTTTAGAA 2 2 09 

Qy 226 0 AACCATCAGTTTTGGCAGTGTTTCACCGGGAGGAGTAAAGCTAGAAAT 

TTCCA 2312 



Db 2210 GACCAAAAGTCCC — CAATG ACAG 

G AAC AAAAGC AAC C AAT T T T T AAC T T T C T 2260 

Qy 2313 CAAGAAGTGCCAGTAGTTCACACCGAAACCAAAACCATCACATATGAATCAT 

CACAG 2369 



Db 2261 CTTCTCATTCC TGTTTTCATTG- 

CACATGTAGTCCTTTTGCTCAG 2310 

Qy 23 70 GTCG ATCCAGGCAC- 

AGATCTGGAGCCAGGCGTGCTGATGAGTGC 2413 



Db 2311 GAAGTCTTTGGGGAAATTAAGGATCTTTGAAGCTCT- 

G AAAT AGG T GAT C AGG T TAG T G - 236 8 

Qy 2 414 AC AG AC GAT C AC ATC TGAAACC ACC AG — T AC C AC C AC C AC T AC CCA 

CATCACCAA 2 46 7 



Db 2369 -GTGTCTGT — CAGCTG — 

T C T AAG AG G T T G G AAAAT G AAC TACT C AAG AT AG T C AC G AA 2 423 

Qy 2468 

AACTGTGAAAGGGGGCATTTCAGAGACAAGAATTGAGAAGCGAATAGTCATCACGG- 
II I I I I I I I I I I I I I I I I I I 

Db 2 42 4 AATACTGAAAGTTTG-ATTTTTCTTTCCATATTTGA 

ATTAATTTTTTCTGTTTG 2476 

Qy 2 52 4 — GGGATGCAGACATTG ACCATGACC- 

AGGCGCTGGCTCAGGCAATTAAAGAGGCC 2 5 76 

I I I I I III III III I I I I 

Db 2 477 ACTGGAAGGGGTTTTTGTATAACTAAAACCTCAGCGC 

ATAAAGGAGATT 2 52 5 

Qy 2577 AAAGAGCAGCACCCTGACATGTCAGTGACCAAAGTAGTGGTCCATAAAGAGAC- 

AGAGAT 2 635 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



2523 

III I II 



1 1 1 1 1 1 

Db 2 52 6 TAAAAGGAGCA-CATGATTT AGTG GGTGGGCCAT 

G AAAC TAG AG AT 2570 

Qy 2 636 CACACCAGAAGATGGAGAGGA — 

TTGACCAGAGGAATAACTTAGCTTGCACATGAATGCA 2693 

II I I I I II I I I I I I I I I III II 

I I 

Db 2571 GGGATTTGGGGGTGAATTTGTC AATATC 

TGGATTTTAATCCA 2612 

Qy 269 4 GTCA TGCAAAC 

CGTTAGGAAAACCAGAGCCTATATGGAGTTCCCTCTTCTAAC 2 7 46 

III I I I I I I I I I I I I I II I III I I I I III 

I I 

Db 2 613 GACATCTCTGCTAACGAGCCTTTGGTAAGTCACTTCAGATAC — 

TTTTCCTCCTTTTTAC 2670 

Qy 2747 CCA 

ACTGTACTTGTATCTGTCCGTGGAAAATTTCAGTCCAGAAGAATTGACCTT 2 8 0 0 

I III III III I I I I I I I I I I I I I 

Db 26 71 AAAGAGAGGGCTGGCTTAGTTATTTGCCAAAGCCCCTTCCAGGCC 

TGAATTCCACAA 2 72 7 

Qy 2801 G AC C — AT T AAT AAAGAC AC TGGC AGAG AG AT C T T C C CATAATA 

AAGCAAT 2 8 49 

II I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 272 8 G T AC GAT T T AC T G TAG 

TGTCTTATCACTCTTTCATGTCACAATAGCGTGGAGCATT 2783 

Qy 2 850 C T G AT T C AGC AT C AC T AAAC C GAT AAT GC AT G A- 

AGCAACGATAAAATTACAAAAGAGCA 29 0 8 

III I I I I I III I I I I I I I I I I I I I II 

I 

Db 2 78 4 AGAGAAAAGC C TAG AC T T T TAG T TGATAGCCA- 

GTTGAAATATCATTGAT — A 2 833 

Qy 29 09 GCATTTTTAATTTTCACAAAATGTCTCAGTTTTC-AGC — 

TATACCTGCTCGTTCATAAC 2965 

I I I I I I III I I I I I I I I I I I I I I III II III 

Db 2834 

GAATTTTAGTTTTAGGAAAAATTGGTTTGATTTCTAGCTTTATTACTATTAGGT 2 8 8 7 

Qy 2966 CAACAATATAAACCGTGGTCTCATGTAACACATAAACAAT TCATGCCTT- 

TCATAGT 3 021 

I I I I I I I I I II I I I I I I I II I I I I I 

Db 2888 ATGTGAGCT-TGGGCAAAT 

CGCTTAATCTTTGAGTCTAGTTTTCTCTCAAA 2937 

Qy 3 022 T TAT TAT TAT TAAAGTCTAAA CAAAATTGCAATT — 

TCTTAGGTAACCTTATA 3 0 72 

I I III I I I I I I I II I I I I I I I I I I I 

I I I 

Db 293 8 ATGAGAAC AT T — 

AGGCTAAATGATTTCCGAGTTTCCAGCTAGTCCTAGAGTTCTATATT 2 995 

Qy 3 0 73 TTTACAATAAATGA AGATTACCCT-CAAATGCT — 

AGAAGCTGTCTAGGTCCGTCCG 3126 

I I I I I I I III I I I I I I I I I I I III I I I I I II 



I 

Db 2996 TCTAC-ATAGTTGAATTATTTTATCATGCTGTTGCTGGGGAATATGACTA- 

ACCCTTTTG 3053 

Qy 312 7 GTGTGTCAGATTTCC — TC-AGATTAGATGTGCCAATAACCAAGTT- 

TATTCAGTAAACA 3182 



Db 3 054 AAGCTACTAATTTTATGTCGAGCTTTAAAGT — CCATAA — 

TTGTTATCTTCAGAAAATA 3109 

Qy 3183 — ACTTGTACT TGTTTC 

ATCTGGTTTATTACTCTCACC C AT AAAC AGT AATGA 3233 



Db 3110 TTATTTGACCTACAGTATGTCCAAATC- 

AAT T T AAT AAAAT C GC T T T AT AAC AGG AAAAA 3168 

Qy 3234 CTCTCTGACCCTCTGGAAATATGTAATGCTTCCAATCTTGCTTTGTGTATCTC 3286 

I I I I I I I I I I I I I I III 
Db 3169 AAAAAAAAAAAAAAAAAAAAAAAAAAGGCCAC ATGTG CTC 32 0 8 
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Qy 5 CGCGCCCG AGCC 

TAGTCCCCACGCCGCGGCG 35 



Db 3209 

CGAGCACATGTGGCCTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTCCTGTTATAAAGCG 3150 



Qy 



36 C- 



-GCCCGGGCTCCCTGCTGATCCCAGAACAATCAACCATGACG— 



ACC 8 0 



Db 3149 ATTTTATTAAATTGATTTGGACATACTGTAGGTC 

AAATAATATTTTCTGAAGATAAC 3 093 

Qy 81 GAATCTGGA TCAGACTCGGAATCCA AGCCGGACCAGG 

AGGCCGAGCCC 12 8 



Db 3092 

AATTATGGACTTTAAAGCTCGACATAAAATTAGTAGCTTCAAAAGGGTTAGTCATATTCC 3 033 

Qy 129 CAGGAGGCGGCGGGGGCGCA-GGGGCGGC — 

GGGGGCCGTGCCGGAGCCGCCCAAGGAGG 185 

I I I I I I I I I I I I I I I 

Db 3 032 C C AGC AAC AGC AT G AT AAAAT AAT TC AAC T ATG T AG AAAT AT AG AA 

CTCTAGGA— 2979 

Qy 186 AGC-AGCAGCAGGCCCTGGAG- 

CAGTTCGCCGCCGCTGCAGCGCACAGCACCCCGGTGCG 2 43 



Db 29 78 — CTAGCTGGAAACTCGGAAATCATTTAGC CTAATGTTCTCA- 

TTTTG 2934 

Qy 2 44 AGGGAGGTC ACTGACAAGGAACAGGAGTTTGCTGCCAGGGCT- 

GCAAAACAGCTCGA 299 



Db 2933 AGAGAAAACTAGACT — CAAAGATTAAGCGATT — 

T G C C C AAG C T C AC AT AC C T AAT AG T 2 8 78 

Qy 300 ATATCAGCAATTAGAAGACGATAAACTTTCTCAGAAATC- 

ATCTAGCAGTAAACTC 354 



Db 2 8 77 AATAAAGC 

TAGAAATCAAACCAATTTTTCCTAAAACTAAAATTCTATCAATGATATT 2 821 

Qy 355 T CTCGGTCTCCATTAAA GATTGTCAAAAAGCCTAAAAGC 

ATGCAGT 40 0 



Db 2820 

TCAACTGGCTATCAACTAAAAGTCTAGGCTTTTCTCTAATGCTCCACGCTATTGTGACAT 2 761 

Qy 401 GCAA-AGTGAT ACTTC TCG ATGGA — 

TCAGAATATACCT 436 

I I I I I I I I I I I I I III I I I I I I I I 

Db 2 76 0 G AAAG AG T GAT AAG AC AC T AC AG T AAAT C G T AC T T G T G G AAT T C AG 

GCCTGGAAG 2 706 

Qy 43 7 G T GAT G T AGAGAAACGC T 

CCAGAGGACAAGTGCTGTTTGATAAAGTGTGTGAACACT 493 

I I I I I II II I I I I I I I I I I I I I I 

Db 2 705 GGGCTTTGGCAAATAACTAAGCCAG 

CCCTCTCTTTGTAAAAAGGAGGAAAAGTATC 2650 



Qy 



494 



TGAACTTGCTAGAGAAAGACTACTTTGGGCTTACGTATCGAGATGCTGAAAACCAGAAGA 553 

I I I I I II I I I I I I I I I II I III III I I I I 

Db 2 6 49 TGAAGTGACTTACCAAAGGCT-CGTT-AGCAGAGATGTCTGGAT — 

TAAAATCCAGA — T 2 596 

Qy 55 4 ATTG GTTGGACCCTGCTAAGGAAATAAAAAAACAGGTT 

CGAAGTGG 599 

I I I I II I I I I I I I I I I I I I 

Db 2595 ATTGACAAATTCACCCC — C 

AAATCCCATCTCTAGTTTCATGGCCCACCCACTAA 2543 

Qy 600 TGCTTG-GCACTTT TCATTTA ATGTGAAATT — 

T T AT C C AC C AG AC C C T 6 45 

I I I I I I I I I I I I I I III II I I I I I I 

I I I I 

Db 2542 

ATCATGTGCTCCTTTTAAATCTCCTTTATGCGCTGAGGTTTTAGTTATACAAAAACCCCT 2 4 83 



Qy 646 

GCCCAACTATCTGAAGATATCACCAGGTACTACCTCTGCTTGCAGTTGCGAGATGACATC 7 05 
II I I I I I I I I I I III I II I 

Db 2 482 T C C AG T C AAAC AG AAAAAAT T A 

ATTCAAATATGGAAAGAAAAATCAAACTTTCAGT 2 42 7 



Qy 706 GTGTCCGGAAGGCTGCCCTGC TCCTTTGTTACCCTGGCCTT- 

GCTGGGCTC 755 

I I I I III III I I I I I I I I I III 

Db 2 426 ATTTTCG — TGACTATCTTGAGTAGTTCATTTTCCAACCT 

C T T AG AC AGC T G AC AG A 23 72 



Qy 756 C T AC AC T G T C C AG TCAGAGC-TCGGAGA — CT — 

AT G AC C C AG AT G AAT G 800 

I I I I I III I I I I I I I I I III II II III III 

I 

Db 2371 

CACCACTAACCTGATCACCTATTTCAGAGCTTCAAAGATCCTTAATTTCCCCAAAGACTT 2312 
Qy 801 TGGGAGC 

G AT T AC AT TAG T GAG TTCCGCTT T GC ACC AAACC AC AC T AAAGAAC T G 855 

I I I I I I I I I I I III I III III I I I I 

Db 2311 CCTGAGCAAAAGGACTACATGTGGGAAAT CAATGAAAAC — 

AGGAATGA 2265 

Qy 856 GAAGACAAAGTGATCGA GCTGCACAAGAGCC — ACAGAGGAATGAC 

GCCAGC 9 05 

I I I I I I I I I I I I I I I I I II II II III II 

I 

Db 226 4 GAAGAGAAAGTTAAAAATTGGTTGCTTTTGTTCCTGTCATTGGG — 

GACTTTTGGTCTTC 2207 



Qy 9 06 AGAAGCAGAGATGCATTTCTTGGAAAATGCCAAA 

AAATTATCAATGTATGGGG 958 

II I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 22 06 TAAA CATTCCTTGAGTTCCTCCACATGAGTTTTTAATATCAATCTA- 

GC 2159 

Qy 959 TAGATTTACATCATGC 

T AAG G AC T C AG AAG G G G T AG AAAT T AT G T T AGG AG T T T G T 1014 

I I I I I I I I I I I I I I I I I I I I I I I I I 



Db 2158 AAAAATTAAAACAAACAAAAAAAGGCCTCTGA- 

AAGTCAGGATACTGA 2112 



Qy 1015 GCAAGT- 

GGTCTGTTGATATATCGCGACCGGCTGCGAATAAACAGATTTGCCTGGCCCAA 10 73 

II III II I I I I I I I I I I I I I I I I I I I II 

Db 2111 

CCACCTCGGCATCTCTCCCCTTCCCCACTGCTTGCGAATGCCAGTGCTGGCCTGTTCCTT 2 052 
Qy 1074 

GGTTCTAAAGATTTCATACAAACGGAACAACTTTTACATTAAGATCCGGCCGGGAGAGTT 1133 

I I I I I I I I I I I I I I I I III III III 

Db 2 051 -AAACGCACCAGCTCAT-CCCCCAGAGCAGC CTGTGTGCTCT-GCC 

AGCGAC 2 0 03 

Qy 113 4 T G AAC AAT T T G AAAGC AC C AT TGGG T T T AAGC T - GC C AAAC CATC G A- 

GCTGCCAAGCGT 1191 

II I I I I I I I I II I I I I I II I I I I I I I I I I 

Db 2002 TG — CTACTT CTCC- 

TCCGGCCTCAGCTCAACATCCCAGCTATGCTACC 195 7 

Qy 1192 T T AT GG AAAG TATGTG-TT G AGC AT CAT AC AT T T T T C AG AC T AC T GT T AC C A- 

GA 1244 

III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 1956 — CTGGGCTATGTGAGCTTGGGCAAGTTAGCATACATCTCTAGGCCT- 

CTCTTTCCATGG 1900 

Qy 12 45 AGCACCTCCCAAGAA ATTCCTA — ACCTTGGGTTC 

CAAGTTTCGTTATA 1291 

I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1899 A 

TACAGAGAAGATGGTGATGCCTACTTCATAGAGTTCTTATAAGAATTCCCT 18 4 8 

Qy 1292 GTGGCAGGACAC — 

AAG C G C AAAC G AG AAG AG CCAGTGCGTT GAT AG AT C GC C C AGC AC C 13 49 

I I I I III I I I I I I II I I I I I I I I I I I I 

Db 18 47 GAGGCTAGGCATGGTGGCTCACACCTGTAATCCCAGCACTTTGG- 
AGGTC 1799 

Qy 1350 TTACTTTGAACGCTCAT CCAGCAAACGTTATACCATGTCTCGCAGCTTGG 

AT 1401 

I I I I I I I I II II III I I I I I I I I I I I 

I 

Db 179 8 -AAG G T G G G AAG AT C AAT T G AGC C — CAGGAGTT CAAGAC 

CAGCCTGGGCAAC 17 49 

Qy 1402 GGAGCATCAGTGAATGAAAACC — 

ATGAAATATACATGAAGGATT C 1445 

III I I I I I I III II III III II 

I 

Db 1748 

ATAGCGAGAGTGCATCTCTACCAAAAGTGATTTTTTTGATTTTTTGGGACAGAGTCTCGC 16 89 



Qy 1446 TATGTCTGC 

TGCAGAGGTTGGTACTGGCCAGTACGCCACAACAAAAGGCA 1495 

I I I I I I I I I I I I I I I II 

Db 16 8 8 TCTGTCACCCAGGCTGGAGTGCAGTGGTGTGATCT 

CGGCTCACTGCAACTTCTG 1635 



Qy 1496 TCTCTCA-GACCAACTTGATCACC — 

ACTGTGACTCCG G AG AAG AAG GCTGAGGAG G AGC 1552 



Db 1634 

CCTCCCAGGTTCAAGCAGCTCTCATGTCTCAGCCTCCCGAGTAGCTGGGATTACAGATGC 15 75 

Qy 1553 GGGACGAG G AAG AG G AC AAAC GG AGG AAG G 

GGGAAGAAGTCACGCCCATCTC 16 0 4 

II I I I I I II I I I I I I I I I I I I I 

I I 

Db 1574 CTGCCACCATGCCAGGCAAATTTTGGGTTGTTTTTTTTGAGATGGAGTCTTGCTC- 

TATC 1516 



Qy 1605 

GGCCATCCAGCACGAGGGAAAGACTGACAGTGAGCGCACGGACACCGCAGCCGACGGGGA 
I I I I I I I I I I I I I I I I I I I I 

Db 1515 AG CAGGCTGGAGTG CAGTG 

GCGCGATCTCGGCTCACTGC 1477 



Qy 1665 

GACCACTGCCACTGAGGAGCTAGAAAAAACTCAAGATGACCTGATGAAACATCAAACCAA 172 4 
I I I I I I I I II III I III I III II 

Db 1476 AACCTCTGACTCCCGGGTTCAAG-CGATTCTCTTG CCT 

CAGC 1436 



Qy 1725 CATTAGCG-AGCTGAAAAGAACCTTCTTAGAAAC- 

CTCAACAGACACTGCCGTAACGAAT 1782 
I I I I I I I I I 



Db 1435 C T C T T GAG T AGC TGGGA — 

TGGCTAAT 13 8 8 



I 

-CTAC- 



I I I I I I II I I I I I 
•AGGCGCGCACC — ACCATGCC- 



Qy 1783 GAATGGG AGAAGA- 

CCCCCGTGCGAC 1819 

III I I I I I 



•GGCTTTCCACCT 

I I I I I I III I I I I I 

13 8 7 TTTTGTGTTTTTAGTAGAGATGGGGTTTCACCCTGTTGGCCAAGAGGACCCCC- 



Qy 182 0 TGGCC — GCCA GGC AGGAGGATGCC C C C AT GAT C G AAC C AC T T G T C — 

CCTGAAG 18 70 

I II I I I I I I I I I I I II I II I I I I I I 

Db 133 0 TTCCCAAGCCATAGGGGGCTGGTGG — GC AGGGCGG — 

CACTTAGCTGCCTG 1283 



Qy 1871 

AGAAAATGGAAACCAAGACGGAGTCCAGTGGATAGAGACGGAACCCACCGTGCACCAC — 192 8 

II I I I I I I II III II III III 

Db 12 82 GGGCACCCAGTACCTGTCATGAGGCACTTGCCCTGGCCAAAGGT- 

CACTGCCA 1231 



Qy 1929 CTGCCGCTTAGCACTGAGA AGGTGGTGCAG-GAGACC- 

GTGTTGGTGGAGGA 19 78 

I I I I I I I I I I I I I I I I I I I I I II 

Db 123 0 AGGCACAGCCACT 

CCAGAGAGTCAAGAGGAGCAGAGACTCCAGAGCAGATGGA 117 8 



Qy 19 79 GCGGCGTGTGGTGCACGCGAGTGGGGA- 



TGCTTCTTACTCGGCGGGAGACAGCGGGGATG 2 037 

I I I I I I I II I I I I I I I I I I I I I I I I I I I 

I I I I I I 

Db 1177 -AGGAGAGTGG GCTTCTGGGGATTGC ACCAAGGAGGAAG- 

CAGAAGGGATG 1129 

Qy 2038 

CTGCAGCACAGCCCGCATTCACAGGCATTAAAGGGAAAGAGGGCTCTGCTTGACGGAGGG 2 09 7 

I I I I I I I III I I I I I I I I I I I III 

I I I 

Db 112 8 -TGCAGCA GCAGAAAAGG — AGGAGGAC 

CGGGGGG 109 7 

Qy 2 09 8 GGCTAAAGAGGAAGGAGGGGAGGAGGTCG 

C T AAAG CTGTCCTG G AAC AG G AAG A 2151 

III II I I I I I I I I I I III III II I I I I I 

I I 

Db 1096 TTCTGA — 

GGGCTTGGGGTGATGAGGCCAGAGTTCCAAACAGGGTCTGGGTGCAGGATGA 1039 

Qy 2152 GACAGCCGCTGCTTCCCGTGAGCGACAAGAGGAGC-AGAG — 

TGCAGCCATCCACATTTC 22 0 8 

I II I I I I I I I I I I I I I I I I I I II I I I I 

I I I I 

Db 103 8 G — G AC AG AAG AAAAC AG AG AAC - C C AAAAG G G G C AAG AG GTTGGTCCCTTAC 

TTTC 985 

Qy 2209 

AGAAACTTTGGAACAAAAACCTCATTTTGAGTCCTCAACGGTGAAGACGGAAACCATCAG 226 8 

I I I I I I I I I I I I I I I I II I I I I I 

Db 984 ACTCTCCTGGGAGGAGGA TGGGTACACCACACAGTTGGATGGTG- 

CATC— 937 

Qy 2269 

TTTTGGCAGTGTTTCACCGGGAGGAGTAAAGCTAGAAATTTCCACAAGAAGTGCCAGTAG 232 8 

I I I I I I I I I I I I I I I I I II I III I I I I I 

Db 936 -TTTGCCTGT-CTGCACC TGCAGCT — CCATGGCACCATG — 

GACCCTGT — 893 

Qy 232 9 TTCACACCGA AACC — AAAACCATCACATATGAATCATCACAGGTCG — 

ATCC 2377 

I I I I I II I I I I I I I I I I I I I I I I I I 

I I I I 

Db 892 GCTCCGAGTGGTGGTGCCTGATCTCCACCACC- 

AGGCAGCAGTAGAGGCCGCTATCC 83 7 

Qy 23 7 8 AGGCACAGATCTGGAGCCAGGCGTGCTGATGAGTGCACAGACGATCACA 

TCTGA 2 431 

I I I I I I I I I I I I I I I I I I I I I I I I II II 

I I 

Db 836 A-GCA-GGGTCAGGTTGC — GCATGGTGATGG 

AGAAGTTGCCATGGTGGTCGGA 78 7 

Qy 2 432 AAC CACCAGTACCAC C AC C AC T AC C C AC AT C AC C AAAAC T G T G AAAG G 

GGGCATTTC 2 48 8 

II I I I I I I II III I I I I II I I I I 

Db 786 GGCC GACTCCAGCCCGTGGCGCTGAGCCAGGTC- 

GTGGCTGGTGTTGGCAGCCT 73 4 



Qy 2 489 AG AG AC AAG AAT T G AG AAGC G AAT AG TCAT — CACG- 



GGGGATGCAGACATTGACCATGA 2 5 45 

III I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 733 GGTGGCCTCCATGGTGCAGGTGAAGGTCCTGGAACGTGAGGTTGCGG 

ATGG 6 83 

Qy 2 5 46 CCAGGCGCT-GGCTCAGGCAATTAAAGAGGCCAAAGAGCAGC 

AC C C T G AC AT G T C AG 26 01 

I I I I I I I I I I I I I III I I I I I I III I I I I I 

Db 682 

GCCGGCGCTCTGAGCAGGTCTGCACCTCGCCCCTCGAGCTGCGGTACCACGTCTTGTAGA 623 

Qy 26 02 T G AC C AAAG TAG T G 

G T C C AT AAAG AG AC AG AG AT C AC AC C AGAAGATGGAGAGG 2 655 

I I I I I I I I I I I I I I I I I I I I I I III I I I I I I 

I 

Db 622 AGGTCACA-TCGTGCCCTTTGTCC — ACAGGGCCCAAGAGC- 

CTGCAGGTGAGGGTGACG 56 7 

Qy 2656 AT- 

TGACCAGAGGAATAACTTAGCTTGCACATGAATGCAGTCATGCAAACCGTTAGGAAA 2 714 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 566 TTCTGCCCCTCGGGACA GACATACA-GGGAATACGG- 

CGTGGCGACCTTGAAGGCT 513 

Qy 2 715 ACCA — GAGCCT 

ATATGGAGTTCCCTCTTCTAACCCAACTGTACTTGTATCTG 2 765 

III I III I I I I III I I I I I I I I I I II 

I I 

Db 512 GCCACCGGACCTGCTCAGAGAGAGGAGAGCCC TGTCACCTGACTGATCCAGTG- 

CTA 457 

Qy 2 766 TCCGTGGAAAATTTCAGTCCAGAAGAATTGACCTTGACCATTAATA 

AAGACACTG 2 82 0 

I I I I I I III II II I I I I I II I II I I I I I 

I 

Db 456 CTCCTGGCAAC-CCCAGAGCGG — 

GGAAGAACCTTAGCCTCTGCTACTTCCCAGCCACAG 400 

Qy 2821 

GCAGAGAGATCTTCCCATAATAAAGCAATCTGATTCAGCATCACTAAACCGATAATGCAT 2 8 8 0 

II II III I I I I I I I I I I I I I I III 

I 

Db 399 GCCAGGGG CAT GTCAGCTGA CCTCCCTAAGAC 

TGCCT 363 

Qy 2881 

G AAGC AAC G AT AAAAT T AC AAAAG AGC AGC AT T T T T AAT T T TC AC AAAAT G T C T C AG T T T 2 9 4 0 

I I I I I I I I I I I I I I I 

Db 362 TCTGCATT 

TGTTCACAAA 3 45 

Qy 2941 

TCAGCTATACCTGCTCGTTCATAACCAACAATATAAACCGTGGTCTCATGTAACACATAA 3 0 0 0 

II III III II II I I I I I I I I I I 

I I 

Db 3 44 CCCCACTC CAT — CC 

TAGGCTGTTGTAGGGGGAGTCACAGAA 3 05 



Qy 3001 AC AAT TCATGCCTTT C AT AGT T TAT TAT T — 



AT T AAAG T C T AAAC AAAAT TGC AAT T T C T 3 058 



Db 304 GC — TTC CCCAACTAGGGGTGCTCCTGGATGCTGGACAGGGCAG 

G GAG AG ACT 254 

Qy 3059 

TAGGTAACCTTATATTTACAATAAATGAAGATTACCCTCAAATGCTAGAAGCTGTCTAGG 3118 



Db 2 53 TGGAGGGCAGTG ATGCAGTC GGGTGTGTCCCTC— TTCCCAGTAG- 

AGTATATG 2 03 

Qy 3119 TCCGTCCGGT 

GTGT C AG AT T T C C T C AG AT TAG AT G TGCC AAT AACC AAGT T TAT T C A 3175 



Db 2 02 CAAAACCTGTTGGGTGGGAGG CAGGTTGGG- 

G AG C AG G AAG GCTGTTTC G AAC A 150 

Qy 3176 

GTAAACAACTTGTACTTGTTTCATCTGGTTTATTACTCTCACCCATAAACAGTAATGACT 3235 
I II I I I I I I I I I I I I I I I I I I I I I I 

Db 149 G-CAGCTCCTTGTCCTTGGCCGCTCTG 

TAGGGGCCATCAACA C 108 

Qy 3236 CTCTGACCCT CTGGAAATATGTAATGCTTCCAAT- 

CTTGCTT 3276 

I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 107 

CTCTGCCCCTTCCTACTCCTAGGACTGAACATGCCTTCCCATCCTTGCCAGGGAGCACTA 4 8 

Qy 3277 — TGTGTATCT-CATTTAATT TGTTCCGGTTAAGG 3308 

I I I I I I I I II I I I I III 

Db 4 7 AGAGAGCACCTGCGTGTGCAAGGCCTGGGCCAGT — AGG 11 
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Qy 1 GC GGCGCGCCCGAGCCTAGTCCCCACGCCGCGGCGCGCC— 

CGGGCTCCCTG- 50 



Db 1 GCCTACTGGATTCGCC — 

ATCATTTTGCACACTGCGAGGGGCTCCGTGTGTGCGCCCTGT 5 8 



Qy 51 CTGATCC — 

CAGAACAATCAACCATGACGACCGAATCTGGATCAGACTCGGAATCCAAGC 10 8 



Db 59 CTTGTCTGGCCGTCC— TCATCCCTTCCCCCACCCCCTG CCGCCAC 

TCCGACC 109 

Qy 109 CGGACC — AGGAGGCCGAGCCCCAGGAGGCGG- 

CGGGGGCGCAGGGGCGGCGGGGGC — C 163 



Db 110 CGCTCCCAAAGTGGCT TCACAATAGTCGGTC 

GGCGGTGTAGGCTGC 158 

Qy 164 

GTGCCGGAGCCGCCCAAGGAGGAGCAGCAGCAGGCCC 2 0 0 

I III I I I I II I I I I I I I 

Db 159 

GCACCAGGTCCACACTTAAGCGAAATCAAGGAGACCCCCTTCTACTTCTACCTTTGGGTT 218 

Qy 2 01 TGGAG — CAGTTCGCCGCCGCTGCAGCGCACAGCACCCCGGTGCGAGGGAGGTCA — 

CTG 256 



Db 219 TGGTGCTCAATGCGAAGCTGCTGCAACTCAGA-CACGCC 

TAAGTCAACTCATGCAG 2 73 

Qy 257 

ACAAGGAACAGGAGTTTGCTGCCAGGGCTGCAAAACAGCTCGAATATCAGCAATTAGAAG 316 



Db 2 74 AAAAAG G AG AAAAG T T T TGGTATACAAA — TGCTCTCAGTCCAGC- 

CAG 319 

Qy 317 AC G AT AAAC T T T C T C AG AAATC ATC T AGC AG T AAAC — 

CTCGGTCTCCAT 367 

II III II I I I I I I I I I 

Db 32 0 ACACCAAGC 

CGAAAGGTTGTGCTGGCTGCAACCGAAAGATCAAGGACCGGTATCTTC 3 76 

Qy 36 8 TAAAG ATTGTCAAAAAGCCTAAAAGCATGCAG 

AAAGTGATACTTCTCGAT 42 0 

I I I I I I I I I I I II I I I I I I I 

Db 3 77 TAAAGGCACTGGACAAA 

TACTGGCATGAAGACTGCCTGAAGTGTGCCTGC 426 



II I I I I I I III 
CTC— 



TCT 

II I I I I I I 



— TGC — 

I I I I I I I I III 



Qy 



421 GGATCAGAATATACCT — GTGATGTAGAGAAACGCTCCAGAGGACAAGTGCTGTT- 



TGAT 477 



Db 42 7 TGTGACTGTCGCTTGGGGGAGGTGGGCTCCAC- 

CCTGTACACTAAAGCTAATCTTAT 482 

Qy 478 

AAAGTGTGTGAACACTTGAACTTGCTAGAGAAAGACTACTTTGGGCTTACGTATCGAGAT 53 7 



Db 4 83 CCTTTGT CGCAGAGACTATCT GAGGCT-CTTTGG 

TGTAACGGGAA 52 6 

Qy 53 8 GCTG-AAAACCAGAAGAATTGGTTGGACCCTGCTAAGGAAATAAAAAAAC- 

AGG- 589 



Db 52 7 ACTGCGCTGCCTGTAG — 

TAAGCTCATCCCTGCCTTTGAGATGGTGATGCGTGCCAAGGA 5 8 4 

Qy 59 0 TTCGAAGTGG— TGCTTGGCACTTTTCA 

TTTAATGTGAAATTTTATCCA 636 

III III I I I I I I I I I I I I 
Db 585 CAATGTTTACCACCTGGACTGCTTTGCA — 
TGTCAGCTTTGTAATCAGAGATTTTGT 639 

Qy 63 7 CCAGACCCTGCCCAACTAT — 

CTGAAGATATCACCAGGTACTACCTCTGCTTGCAGTTGC 694 



Db 6 40 G T T GG AG AC AAAT T T T TCC T AAAGA- AT AAC AT GAT 

AC 686 

Qy 695 GAGATGACATCGTGTCCGG 

AAGGCTGCCCTGCTCCTTTGTTACCCTGGCCTT 746 



Db 687 G 

GACTACGAGGAAGGTTTAATGAAAGAAGGTTATGCACCCCAGGTTCGCTGATCT- 7 41 

Qy 747 GCTGGGCTCCTACACTGTCCAGTCAGAGCTCGGAG- 

AC TAT G AC C C AG AT G AAT G T GGG A 8 05 

I I III I I I I I I I I I I I I I I I I 

Db 7 42 ATCAACATCAC — CCC AT T AAGAAT AC AGAGC AC T AC AT T C T — 

TTTATCTTTTTT 793 

Qy 8 06 GC G AT T AC AT TAG T GAG TTCCGCTT T GC AC C AAAC C AC AC T AAAG AAC - 

TGGA 857 



Db 794 GC 

T C C AC AT G T AC AT AAGAAT T G AC AC AGG AAC C T AC TGAAT AGC G T AG AT AT AGG A 85 0 

Qy 858 AG AC AAAG T GAT 

CGAGCTGCACAAGAGCCACAGAGGAATGACGC 901 



Db 851 AGGCAGGATGGTTATATGGAATAAAAGGCGGACTGCATCTGT- 

ATGTAGTGAAATTGCCC 9 09 



III III III 
CCTTTGC CAG— 



Qy 902 

CAGCAGAAGCAGAGATGCATTTCTTGGAAAATGCCAAAAAATTATCAATGTATGGGGTAG 961 



Db 910 CAGTT C AG AGT TGAATGT T TAT TAT T AAAGAAAAAAG T AAT G T AC AT AT - 

GGCTGG 964 

Qy 962 ATTTACAT-CATGCTA AGGACTCAGAAGGGGT- 

AG AAAT T AT G T T AGG AG T - T 1011 



Db 965 

ATTTTTTTGCTTGCTATTCGTTTTTGTGTCACTTGGCATGAGATGTTTATTTTGGACTAT 102 4 

Qy 1012 TGTGCAAGTGGTCTGTTGATATATCGCGACCGGCTGCGAATAAACAGAT 

TTGCCTGG 1068 

III I III I I I I I I I I II I I I I I I 

Db 1025 TGTATATAATGTATTGTAATATTTGAAGCACAAATG 

TAATACAGTTTTATTGTGTTA 1081 

Qy 1069 CCC — 

AAGGTTCTAAAGATTTCATACAAACGGAACAACTTTTACATTAAGATCCGGCCGG 112 6 

II I I I I I I I I I II I I I I I I I I 

Db 10 82 CCATTTGTGTTCCATTTGCTTCTTTGTATTG 

TTGCATTTAGTACAATC 1129 

Qy 112 7 GAGAGTTTGAACAATTTGAAAGCACCATTGGGTTTAAGCTGCC 

AAACCATC 1177 

I I I I I I I I I III I I I I I I I I II 

Db 113 0 -AGTGTTTAAACTTACTGTAT 

ATTTATGCTTTCTGTATTTACCAGCTATT 1178 

Qy 1178 

GAGCTGCCAAGCGTTTATGGAAAGTATGTGTTGAGCATCATACATTTTTCAGAC 1231 



Db 1179 TTAAATGAGCTG— TAACTTTCTAGTAAAGAAT- 

TGAAAAGCAAATCTCACTAATGATAC 1235 

Qy 1232 T AC T G T T AC C AG AAGC A CCTCCCAAGAAATTCCTAACCTTGGGTTC 

CAAG 1281 

I I I I I I I I I I II I I I I I I I I I I 

Db 1236 - 

ACAGATAGATAAAGCAAGTCTATCAACATTAAAAATACTAAAAAATAAAGACACACACA 12 9 4 

Qy 12 82 TTTCGTTATAGTGGCAGGACACAA GCGCAAACGAGAAGAG 

CCAGTG— C 1328 



Db 1295 G AGC AT T T TAG T G AC A- 

TCCACTACTTATTGCCGCTATGAGTTAGAGTCTATCAGTGTTC 1353 

Qy 132 9 GTTGATAGATCGCCC AGCACCT- 

TACTTTGAAC 136 0 



Db 1354 - 

TTGTTATAACCCCCTATTTTCAGGGGGTTAAAAATCAGCTTTAAAAAAATACATAAAAA 1412 



Qy 1361 GCTCATC 



CAGCAAACGTTATACCATGTCTCGCAGCTTGGATGGAGCATCAGTGAATG 1417 

I I I I I III I I I I I III I I I I I I III III 

I 

Db 1413 TTTCATCTTAAAGC — ACTTT C AT T T TAT AC C AAC G T G AAAAG T G C — 

CATTTTTAG 1465 

Qy 1418 AAAAC CAT G AAAT AT AC AT G AAGG AT T C T AT G T C T G C T G C AG AG G T T G - 

GTACTGGCCAG 14 76 

I I I I I I I I I I I I I I I I II II I I I I I I I 

Db 1466 AAT AAC T T T AAAGC T T AA — CAGGTTTCCTTTTAATATCCTTTTTTTGTGTGCT- 

CTT 1520 

Qy 1477 T AC - GC C AC AAC AAAAGGC ATC TC TC AGAC C AAC T T GAT C A- C C AC T G T G AC T C C 

GG 1531 

III III I I I I I I I I III I I I I I I I I I I I 

I 

Db 1521 TACTTACAC AATGGCTTTGTTTTG CTTTTTCAGCCAC 

ACCCCTTATG 1567 

Qy 1532 AGAAGAAG-GCTGAGGAGGAGCGGGACGAGGAA — 

GAG G AC AAAC G G AG G AAG G G G G AAG 158 8 



Db 156 8 TGAACTAGTGCCTTTGGGTATC 

ACGTAAAATTTTTTCCAAAGGGTTACTTTAAAAAT 1624 

Qy 1589 AAGTCACGCCCATCTCGGCCATCCAGCACGAGGGAAAGACTGACAGTGAGC — 

GCACGGA 16 46 

I I I I I I I I I I I I I I I I I I I I II 

I 

Db 1625 CTGTTAC- 

CACAATTATGAGATGATTTTTAAGTGATAAATTAAACTTCTTCTTGTATAAA 16 83 

Qy 16 47 CACCGCAGCCGACGGGGAGACCACTGCCACTGAGGAGCT 

AG AAAAAAC T C AAG AT G A 1703 

III II I I I I I I I I I I I I I I I I I I I 

I I I 

Db 1684 TTCTGC— CC AGATCTCT-CCAC— 

AAGAGCTGAGGGTTTCATAACTTTATGG 1731 

Qy 170 4 C C T GAT G AAAC AT C AAAC C AAC AT T AGC G AGC T G A- 

AAAGAACCTTCTTAGAAACCTCAA 1762 

I I I I I I I I II II I II III I I I I I I I I I 

I 

Db 1732 CTTAAT-AAATGTATGAC — 

AC T G AAAAG AT T T GAG T G T G AAT C T AC TGAAATC AC TATA 17 8 8 

Qy 1763 CAG- 

ACACTGCCGTAACGAATGAATGGGAGAAGAGGCTTTCCACCTCCCCCGTGCGACTG 1821 

I I I I I I I III III I I I I I I III I I I I I 

Db 1789 

ATGCACATTGAAGCTATGATGGTATTTGAGTAGTGAGGTTACTTTTGATCGGAGCAAC— 18 46 

Qy 1822 

GCCGCCAGGCAGGAGGATGCCCCCATGATCGAACCACTTGTCCCTGAAGAGAAA 18 75 

I I I I I I I I I I I I I I I I I I I I I I I I 

Db 18 47 ATAATGCTC ATAGAA — 

TCTTC TAG AAG AAG AG AAAC AAAGG 18 86 



Qy 18 76 -ATGGAAACCAAGACGGAGTCCAGTGGATAGAGACGGAACCCACCGTGCA- 



CCACCTGCC 1933 



Db 1887 G AT T G AT AAAAT GC T G AG AAC TAG T GAT TATATATTTTTC 

TGTATTTACCTGAC 19 40 

Qy 1934 

GCTTAGCACTGAGAAGGTGGTGCAGGAGACCGTGTTGGTGGAGGAGCGGCGTGTGGTGCA 1993 



Db 19 41 ATTTA TTTTAATGTTCAAAAAGTAAACACTTT 

AAGTTTGATGTGTTTTACT 1991 

Qy 199 4 CGC GAGTGGGGATGCTTCTTACTCGGCGGGAGACAGCGGGGATGCTGCAGC — 

AC 2 0 46 



Db 1992 CTCTCATTGTTTTAAGTAATTGCCAACT CAGAATACATC 

ATTCTTAGGCTGAA 2 0 44 

Qy 2 0 47 AGC C C GC AT T C AC AGGC AT T AAAGGGAAAGAGGGC T — C T 

GACGG 2 093 



Db 2045 ATTTGTCTTTC- 

CATTTTTTAAGGTGAAATAGTACTACCTTACGTGATAGCATACAAAGA 2103 

Qy 2 09 4 AGGGGGCTAAAGAGGAAGGAGGGGAGGAGGTCGCT 

AAAGCTGTCCTGGAACAG 2146 



Db 2104 

AG AAAG C T C T AG AAAG AG AAAT T AT GG AG AAT GAT T AT T T AAAT T AC AAT T AAG G AAAT G 2163 

Qy 2147 G AAG AG AC AGC C GCTGCTTCC-CGTGAGC G AC AAG — AGGAGCAGAG 

TGCAGCC — 219 7 

I I I I I I I I I I I II I I I I I I 

Db 216 4 AGAATATGATCCCCT- 

CTTCCGAGTTGCCCACAAACTTGCTTCTTTGCTTTTGCTCCCTG 2222 

Qy 219 8 AT C C AC AT T T C AG AAAC T T T 

GG AAC AAAAAC CTCATTTT GAG T 22 40 

I I I I I I I I I I I I I III 

Db 2223 

TAATAGAACTACTTTTCAACAAATCTAATTTTGCACGGCACCGTTAACCATATTTT 

Qy 22 41 C C T C AAC G G T G AAG AC GG AAAC CAT C AG T TTTGGCAGTGTT- 

TCACC 22 86 



Db 2 2 79 -CACTACAGCAAACTTAG- 

TGCTATGGGTTTTCTTTTTCTTTGTTTTTTTCTTGATCACT 2336 

Qy 22 8 7 GGGAGGAG T AAAGC TAG AAAT T T C C A- C AAG AAG TGCCAGTA- 

G T T C AC AC C G AAAC C AA 23 4 4 



Db 233 7 TGTATAGGAAAC AACATTTTCCAGTGTTATTTGCATATATATTTTGTCC — 

TTCCAA 2391 



■-GCTT — 
I I I I 



I I I I I I I I I 

— 2278 



Qy 23 45 AAC CATCACATATGAATCATCACAGGTCGATCCAGG 

CACAGATCTGGAGCC 2395 

I I I I I I I I I I I I I I I I I I III I II 

Db 2392 TATATGCATTACAGATGAA-AATTAAATGTTATACCTGAATTCTTGGGT- 

TGGGGCCAAA 2449 

Qy 2396 AGGC GTGCTGATGAG TG CACAG 

AC GAT C 2 423 



Db 2450 

ATATTAAGCTGAAAATAATGCTGGTGTGGATTTGTTTTAAAACAAAGCTTTATTATGAAC 2 5 09 
Qy 2424 A— 

CATCTGAAACCACCAGTACCACCACCACTACCCACATCACCAAAACTGTGAAAGGGG 2 4 81 



Db 2510 ATGCATGTGAATCTG — 

GATATTGCCTCTTATTTTTAAGAAAATGGTTCTGTGAAAAGTG 2 56 7 

Qy 2 482 GCATTT CAGAGAC — AAG AAT T G AG AAGC G AAT AG T — CAT 

CAC- 2521 



Db 2568 

AATGATATGTATTTTTCCAAATGCTTCATGGTTAGGAGTCTTCAAGTTCCATGTTCCCCA 2 62 7 

Qy 2522 GGGGGATGC-AGACATTGACCATGACCAGGCGCT GGCTCA- 

GG 2562 

II I I I I I I I I I I II III I I I I I 

Db 2628 

GATTTGAGATATACTAAAGAAAGAAATTCAAAAGTAGCTATTTGGGGCCCACAAAAATAA 26 8 7 

Qy 2 563 CAATTAAAGAGGCCAAAGAGCAGCACCC 

TGACATGTCAGTGACCAAAGTAGTGGTCC 2 619 



Db 26 8 8 CTATTATTTTAGCCTTAGAGCCTTACACTTGTTTCATG- 

AAGAGAAAGGAC T TGC AT AAC 2 7 46 

Qy 262 0 ATAAAGAGACAGAGATCACACC 

AGAAGATGGAGAGGATTGACCAGAGGAATAAC 2673 

I I I I I I I I I II I I I I I I I I 

Db 2 747 CAAAATAAACAAAG — 

CAAGACAAATTAAAAATATGTGGGGGAGAGATCAGTGAAAAG — 2 8 02 

Qy 26 74 T T AGC T TGC AC ATGAATGCAGTCATGC 

AAACCGTTAGGAAAACCAGAGCCTATATGG 2 73 0 



Db 2 8 03 -TGGTTTTC TTAATGCAGCCCTGCTGGTCCCCATT 

TTGA 2849 

Qy 2 731 AGTTCCCTC — TTCTAACCCAACTGTACTTGT 

GTCCGTGGAAAATTTCAG 2 782 



Db 2 850 AATTCACATGGATGTAA 

AATTATAATTGTCAGGATCTTATTCAGATGATCTTTTAAG 2906 



-AACAATTGC 



ATCT — 

I I I I III II III 



Qy 2 7 83 — TCCAGAAGAATTGACCTTG ACCATTAATAAAGACACTGGCA- 

GAGAGAT 2 83 0 



Db 2907 

GTTTAACTGGTTTTGCTTTTGTTTATCTATATGTCAAAATACTTGTAAATTGGGAACAAA 2 966 

Qy 2 831 CTTCCCATAATAAAGCAATCTGA TTCAGCAT CACT 

AAACCGATAA 2 8 75 



Db 2967 CTTCTC 

TCAGCTTCTTGAAGTTGTTCAACTATCCTTGCCACTGGAAGACCAAACA 3 021 

Qy 2 8 76 TGCATGAAGCAAC-GATAAAATTACAAAA GAGCAGCATTTTTAA- 

TTTTC 2923 



Db 3022 

AGGTTTTCACTGCTTTTTCTTTTACATAATATGCTGAGAATTATTTCTTATGCTTTTTAC 3 0 81 

Qy 292 4 ACAAAATGTCTCA GTTTTCAG CTATACCTG 

CTCGT 2958 



Db 3082 

TACAAACAAAATTACTCACCTGGATTAAAGATTAAGGCCTTAATCTGTTTAGATTATCTT 3141 

Qy 2959 TCATAACCAACA AT AT AAAC C G T G GTCTCA — TGTAACACA- 

TAAAC 3002 



Db 3142 

TAATCTCCATGAAATCGTGAAATAAGACAAGAATAGTGTTTCAGCTGTAGGCCATTTTAC 32 01 

Qy 3 0 03 AATTCA-TGCCTTTCA TAG — TTTATTATTATTAAAGTCTAAACAAAATTGC— 

AA 3053 



Db 3202 

AGCTAATTGCCCATAAATTGTAGCATTTATTGACCTGAAGTACTAAGCTAATTGTCTTGA 32 61 

Qy 3 054 TTTCTTAGGTAACC T T AT AT T T AC AAT AAAT G AAG AT T A- 

CCCTCAAATG 3102 



Db 3262 

CTACTCAAAGCCCCTGAATTGTTGTCAACTTTCCCCTTTGTGTTGTGTAGCCCTAACGT- 332 0 

Qy 3103 CTAGAAGC TGTCTAGGTCCGTCCGGTGTGTCAGATTTCCTC 

AG AT TAG A- 3151 



Db 3321 CATTTAGCTTGTTGTCT-GATGCCTCCAGTAGGACA- 

CCTCCGATGGAGCTTTGAT 3374 

Qy 3152 TGTGC — CAATAACCAAGTTTATTC — 

AGTAAACAACTTGTACTTGTTTCATCTGGT 3204 



Db 33 75 TTCTGAGCAGCGAAAGC 

TCCCTTCC T AAG AT GC AT C T C GC AT AGGC TGCC T ATGAT 3430 



Qy 32 05 TTATTAC- 

CCCTCTGGAAAT 3253 

I I I 



- T C T - C AC C C AT AAAC AG T AAT G AC T C T C T G A 



II III I 



I I I I I 



I I I I I 



I I I 



Db 



3431 



GAAGGACCGTGCACCTCCACTCCAACAGAGTGCTGAGTTTAAAAGTTGACCTGTG T 3 4 86 



ATCTTGCTTAATAAATATCTGCTGGATTCTTTCAAAAAAAAA 3544 
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Qy 2 CGGCGCGCCCGAGC- 

CTAGTCCCCACGCCGCGGCGCGCC 39 

III I I I I I I I I II III 

Db 3589 CTGC-AGCTCGAGCACATGT 

GGCCTTTTTTTTTTTTTTTTTTTTT 3546 

Qy 40 

CGGGCTCCCTGCTGATCCCAGAACAATCAACCATGACGACCGAATCTGG 8 8 



Db 35 45 TTTTTTTTTTGAAAGAATCCAGCAGAT- 



Qy 3254 ATGTAAT 

GCTTCCAATCTTGCTTTGTGTATCTCATTTAATTTGTTCCGGTTAAGGA 33 09 
I I I I I I I I I I I I I I I I I I I I I I I I I 



I I I I I 



Db 



3487 TTGTAATTTCACTTTC- 



TGA 3495 



Qy 89 A— 

TCAGACTCGGAATCCAAGCCGGACCAGGAGGCCGAGCCCCAGGAGGCGGCGGGGGC 145 

I I I I I I I I I I I I I I I I I III 

I I 

Db 3 49 4 AAT T AC AAAC AC AG GTC- 

AACTTTTAAACTCAGCACTCTGTTGGAGTGGAGGT 3443 

Qy 146 GCAGGGGCGGCGGGGGCCGT — GCCGGAGCCGCCCAAG-GAGGAGCAGC — 

AGCAGGCCC 200 

III III III I I I I I I I I I I I I I I I II 

Db 3442 GCA CGGT 

CCTTCATCATAGGCAGCCTATGCGAGATGCATCTTAGGAAG 3395 

Qy 201 TGGAGCAGTTCGCCGCCGCTGCAGCGCACAGCACCC-CGGTGCGAG 

GGAGGTC 2 52 

I I I I I I I I I I I I III I I I I I I I I I I I I I I 
Db 3394 -GGAGCT-TTCGCTGCT-CAGAAAT-CAAAGCTCCATCGGAG- 

GTGTCCTACTGGAGGCA 33 40 

Qy 2 53 ACTGACAAGGAACAGGAGTTTGCTGCCAGGGCTGCAAAACAGCTCGAA 

TAT 3 03 

I I I I I I II I II I I I I I I I I I I I I I I I 

Db 3339 TCAGACAACAAGCTAAA TGACGTTAGGGCTAC 

ACAACACAAAGGGGAAAGTTGA 32 86 

Qy 304 CAGCAATTAGAAGACGATAAACTTTCTCAGAAATCATCTAGCAGTAAACTCTC- 

TCGG 360 

I I I I I I I II II II II II I I I I I I I I I 

Db 32 85 CAACAATTCAGGGGCTTTGAGTAGTC AAGAC AAT T AGC T TAG T AC T - 

TCAGGTCAA 3231 

Qy 361 T CTCCATTAAAGATTGTCAAAAAGCC-TAAAA— GCATGCAG- 

TGCAAAGTGATA 411 

I I I I I I I I I I I I III I I I I I I I I I I I I I I I 

I I 

Db 323 0 TAAATGCTACAATTT — 

ATGGGCAATTAGCTGTAAAATGGCCTACAGCTGAAACACTATT 3173 
Qy 412 

CTTCTCGATGGAT C AG AAT AT AC C T G T G AT G T AGAGAAACGC T C C AG AGG AC AAG T 467 

I I I I I I III III I I I I I I I I I I II III II 

I 

Db 3172 CTTGTC-TTATTTCACGATTTCATGGAGAT- 

T AAAG AT AAT C T AAAC AG AT T AAGGC C T T 3115 



Qy 46 8 GCTGTTTGATAAAGTGTGTGAACACTTG 

AAC T T GC TAG AG AAAG AC T AC T T T 519 

I I I I I I I I I I I I I III II II I I I I I I I I I 

Db 3114 AATCTTTAATCCAG- 

G T GAG TAATTTTGTTTGTAG T AAAAAGC AT AAGAAAT AAT TC T C A 3 056 

Qy 52 0 G-GGCTTACGTATC — GAGATGC — TGAAAACCAGAAGAATTGGTTGGAC— CCT 

GC 569 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

I I 

Db 3 055 GCATATTATGTAAAAGAAAAAGCAGTGAAAACC 

TTGTTTGGTCTTCCAGTGGC 3 0 03 



Qy 570 TAAGGAAATAAAAAAAC AGGTTCGAAGTGGTGCTTGG 

CACTTTTCATTTAATG 622 



Db 3002 

AAGGATAGTTGAACAACTTCAAGAAGCTGAGAGAAGTTTGTTCCCAATTTACAAGTATTT 2 9 43 

Qy 623 TGAAATTTT-ATCCACCAGACCCTGCCCAACTAT — 

C T G AAG AT AT C AC C AGG T AC T AC C 6 79 

I I I I I I II I I I I I III II I I I I I I I I I I 

Db 2 9 42 T G AC AT AT AG AT AAAC AAAAGC AAAAC C AG T T AAAC C T T AAAAG AT C AT C T - 

GAATAAGA 2884 



Qy 680 

TCTGCTTGCAGTTGCGAGATGACATCGTGTCCGGAAGGCTGCCCTGCTCCTTTGTTACCC 739 
I I I I I I I I I I I I I III I I I I I 

Db 2 8 83 T C — C T G AC AAT T AT AAT T T T AC A TCCATGTGAATTTCAAG — 

CAATTGTTAATG 2 833 



Qy 740 TGGCCTTGCTGGGCTCC — 

TACACTGTCCAGTCAGAGCTCGGAGACTAT G AC C C AG AT G A 79 7 

III I I I I I I I I II III I I I I I III 

Db 2 832 GGGACCAGCAGGGCTGCATTAAGAAAACCA — CTTTTCACTGA- 

TCTCTCCCCCACATAT 2776 



Qy 798 

ATGTGGGAGCGATTACATTAGTGAGTTCCGCTTTGCACCAAACCACACTAAAGAACTGGA 85 7 
II I I I I I I I I I I I I I I I I II II 

I 

Db 2775 TTTTAATTTGTCTTGCTTTGTTTATTTTGGTTATGCA — AGTCCTTTCT — 

CTTCA 2724 



Qy 858 AGACA-AAGTGATCGAGCTGCACAAGAGCCACAG AGGAAT- 

GACGCCAGC 9 05 

I I I I I I I I I I I I I I I I I II II 

Db 2 723 T G AAAC AAG T G T AAG 

GCTCTAAGGCTAAAATAATAGTTATTTTTGTGGGCCCCA — 2 6 7 0 



Qy 9 06 AGAAGC-AGAGATGCATTTCTTGGAAAATGCCAAAAAATTATC- 

AATGTATGGGGTAGAT 963 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 2669 AATAGCTACTTTTGAATTTCTT 

TCTTTAGTATATCTCAAATCTGGGGAACAT 2618 

Qy 96 4 -TTACATCATG-CTAAGGACTCAGAAGGGGTAGAAATTATGTTAGGAGTT 

TGTGCAA 1018 

I I I I I I I II I I I I I I I I I I II II 

I I 

Db 2617 

GG AAC T T G AAG AC T C C T AAC CAT G AAGC AT T TGGAAAAAT AC AT ATC AT TCACTTTTCAC 2558 



Qy 1019 

GTGGTCTGTTGATATATCGCGACCGGCTGCGAATAAACAGATTTGCCTGGCCCAAGGTTC 10 7 8 

I II I I I III I I I I I I I I I I III II 

I I I I 

Db 2 55 7 AG AAC CATTTTCT T AAAAAT AAGAGGC AATATCCAGATTCACATG CAT- 

GTTC 2506 

Qy 10 79 TAAAGATTT- 

C AT AC AAACGGAAC AAC T T T T AC AT T AAGATCCGGCCGGGAGAGT T 1133 

I I I I I I I I I I I I I I I I I I I I I I I I I I 



Db 2505 AT AAT AAAGC T T T G T T T T AAAAC AAATCC AC ACC AGC AT TAT - 
TTTCAGC 2457 



Qy 113 4 TGAACAATTTGAAAGCACCATTGGGTTTAAG — CTGCCA 

AACCATCGAGCTGCCAAG 1188 



Db 2 456 TTAATATTTTGGCCCCAACCCAAGAATTCAGGTATAACATTTAATTTTC- 

ATCTGTAATG 2398 

Qy 1189 CGTTTATGGAAAG TATGTGTTGAGCATCATACATT 

TTTC — A 1228 



Db 239 7 CATATATTGGAAGGACAAAATATATAT 

GCAAATAACACTGGAAAATGTTGTTTCCTA 2341 

Qy 1229 

GACTACTGTTACCAGAAGCACCTCCCAAGAAATTCCTAACCTTGGGTTCCAAGTTTCGTT 12 8 8 



Db 23 4 0 T AC AAG T GAT - C AAG AA — AAAAACAAAGAAAAAG AAAAC C CAT AGC AC T AAG T T T - 

GCT 2285 

Qy 12 89 ATAGTGGCA G G AC AC AAGC GC AAAC GAG AAG AG CCAG-TGCGTTGA 

TAGATC 1339 



Db 22 8 4 GTAGTGAAAATATGGTTAACGGTGC 

CGTGCAAAATTAGATTTGTTGAAAAGTAGTTC 2228 

Qy 13 4 0 GCCCAGCACCTTACTTTGAACGCTCATC- 

CAGCAAACGT 13 7 7 



Db 222 7 TATTACAGGGAGCAAAAGCAAAGAAGC- 

AAGTTTGTGGGCAACTCGGAAGAGGGGATCAT 2169 

Qy 1378 

TATACCATGTCTCGCAGCTTGGATGGAGCATCAGTGAATGAAAACCATGAAATATACATG 143 7 



Db 2168 ATTCTCATTTC 

CTTAATTGTAATTTAAATAATCATTCTCCATAATTTCTCTTTC 2115 

Qy 143 8 AAGG ATTCTATGTCTGCT 

GCAGAGGTTGGTACTGGCCAGTACGCCACAACAA 1489 



Db 2114 TAGAGCTTTCTTCTTTGTATGCTATCACGTAAGGTAGTACT 

ATTTCACCTTAAAAA 2059 

Qy 1490 

AAGGCATCTCTCAGACCAACTTGATCACCACTGTGACTCCGGAGAAGAAGGCTGAGGAGG 15 49 



Db 2 058 ATGGAA AGACAAATT TCAGC-CTAAGAAT- 

GATGTATTCTGAGTTGG 2014 



Qy 1550 AGC- 



GGGACGAGGAAGAGGACAAACGGAGGAAGGGGGAAGAAGTCACGCCCATCT 16 03 



Db 2 013 C AAT T AC T T AAAAC AATGAGAGAGT AAAAC AC ATC AAAC T T AAAG T G T T T A 

CTTTT 1958 

Qy 16 0 4 CGGCCATCCAGCACG AGGGAAAGACTGACA 

GTGAGCGCACGGACACCGC- 1652 



Db 1957 

T G AAC AT T AAAAT AAAT G T C AGGT AAAT AC AGAAAAAT AT AT AATC AC TAG T T C T C AGC A 189 8 

Qy 1653 AGCCGACGGGGAGAC C AC T GC C AC TGAGGAGC T AG AAAAAAC T C 

AAGA 1700 



Db 1897 TTTTATCAATC 

CCTTTGTTTCTCTTCTTCTAGAAGATTCTATGAGCATTATGT 18 45 

Qy 1701 TGACCTGAT GAAACATC AAACCAACATTAGC- 

GAGCTGAAA 1740 



Db 18 44 TGCTCCGATCAAAAGTAACCTCACTACTCAAATACCATCA- 

TAGCTTCAATGTGCATTAT 1786 

Qy 1741 AG AAC C T T C T - TAG AAAC - C T C AAC AG AC AC T G C C G T AAC G 

AATGAATGGGAGAAGA 1795 



Db 1785 

AGTGATTTCAGTAGATTCACACTCAAATCTTTTCAGTGTCATACATTTATTAAGCCATAA 1726 

Qy 1796 GGCTTT CCACCTCCCCCGT GCGA-CT-GGCCGCC 

AGGCAGGAGGATGCC 18 42 

III III II II I I I I I I I I I I II 

Db 1725 

AG T T AT G AAAC C C T C AGC T C T T G T GGAGAGATC TGGGC AGAAT T T AT AC AAGAAG AAG T T 1666 

Qy 18 43 C C C AT GAT C G — AAC C AC TTGTCCCT G AAG AG AAAAT GG AAAC C AA 

GACGGAGTC 1895 



Db 1665 T AAT T T AT C AC T T AAAAATC ATC TC AT AAT T G 

T GG T AAC AG AT T T T T AAAG T AAC C 1610 

Qy 1896 CAGTGGATAGAG AC — GGAACCCACCGTGCAC- 

CACCTGCCGCTTAGCACTG 19 44 



Db 16 09 CTTTGGAAAAAATTTTACGTGATACCCAAAG- 

GCACTAGTTCACATAAGGGGTGTGGCTG 1551 

Qy 19 45 AGAAGGTGGTGCAGGAGACC 

GTGTTGGTGGAGGAGC 1980 

I I I I II I II I II I II 

Db 155 0 AAAAAG C AAAAC A — AAGCCATTGTGTAAGT- 

AAAGAGCACACAAAAAAAGGATATTAAA 1494 



Qy 19 81 -GG CGTGTGGTGCACGCGAG TGGGGAT- 

GCTTCTTACTCGGCGGGAGACAG 2 029 



Db 1493 

AGGAAACCTGTTAAGCTTTAAAGTTATTCTAAAAATGGCACTTTTCACGTTGGTATAAAA 143 4 

Qy 2 03 0 CGGGGATGCT GCAGCA CAGCCCGCATT 

CACAG 2 061 



Db 1433 

TGAAAGTGCTTTAAGATGAAATTTTTATGTATTTTTTTAAAGCTGATTTTTAACCCCCTG 13 7 4 

Qy 2 062 GCATTAAAGGG AAAGAGGGCTCTGCTTGACGGAGGGGGCTAA AGAGG- 

AAG 2111 



Db 13 73 AAAATAGGGGGTTATAACAAGAACACTGATAGAC 

TCTAACTCATAGCGGCAAT 1321 

Qy 2112 GAGGGGAGGAGGTCGCTAAA — GC TGTCCT 

GGAACAGGAA — G 2150 

II I I I I I I I I I I I I II I I I I I II 

Db 1320 

AAGTAGTGGATGTCACTAAAATGCTCTGTGTGTGTCTTTATTTTTTAGTATTTTTAATGT 12 61 

Qy 2151 AGACAGCCGCTGCTTCCCGTGAGCGACAAGAGGAGCA- 

GAGTGCAGCCATCCACATTTCA 22 09 



Db 1260 TGATAGAC-TTGCTTT ATCTATCTGTGTATCATTAGTG-AG — 

ATTTGCTTTTCA 1210 

Qy 2210 GAAACTTT GGAACAAAAACCTCATTTTGAGTC- 

CTCAACGGTGAAGACGGAAACCA 226 4 



Db 12 09 - AT T C T T T AC T AG AAAG T T AC AGC T C AT T T AAAAT AGC T 

GGTAAATACAGAAAGCA 1155 

Qy 2265 TCAGTTTTGGCAGT — GTTT CACCGGGAGGAGTAAA-GC — 

T AG AAAT T T C C AC 2313 



Db 1154 T AAAT AT — 

AC AG T AAG T T T AAAC AC T GAT T G T AC T AAATGC AAC AAT AC AAAG AAGC AA 109 7 

Qy 2314 AAG AAG TGCCAGTAGT-T C AC AC C G AAAC — CAAAAC CATCACATAT 

GAA 2360 



Db 1096 

ATGGAACACAAATGGTAACACAATAAAACTGTATTACATTTGTGCTTCAAATATTACAAT 103 7 

Qy 2361 TCAT CACAGGTCGATCCAGGCACAGATCTGGAGCCAGGCGT- 

GCTGATGAGTGC 2 413 

III II III II I I I I I I I I I I I I I 

Db 1036 AC AT T ATATACAATAGTC 

CAAAATAAACATCTCATGCCAAGTGACACAAAAACGAAT 980 



Qy 2 414 A-CAGACGATCACATCTGAAACCACCAGTACCACCAC 

CACTACCCACATCACCA— 2 466 

III II I Ml I I I I I I I I I II II I I I I I 

Db 9 79 AGCAAGCAAAAAAATC — CAGCCATATGTA- 

CATTACTTTTTTCTTTAATAATAAACATT 923 

Qy 2 46 7 AAACTGTGAAAGGGGGC- 

AT T TC AGAGAC AAGAAT TGAGAAGCGAATAGTCATCACGGGG 2 52 5 

I I I I I I I I I I I I I I I I I I III II I I I I I I I I I II 

Db 922 CAACTCTGAACTGGGGCAATTTC ACTA-CATACAGATGC AGTC — 

CGCCTTT 874 

Qy 2 52 6 GATGCAGACAT-TGACCATGACCAGGCG — CTGGCTC- 

AGGCAATTAAAGAGGCCAAAGA 2581 

III I I I I I I I I I I I I I II I I I I I I I I I III 

Db 8 73 TATTC CATATAACCAT — 

CCTGCCTTCCTATATCTACGCTATTCAGTAGG 82 6 

Qy 2582 GCAGCACCCTGACATGTCAGTGACCAAAGTA GTGGT- 

C C AT AAAG AG AC 2629 

I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 825 TTCCTG TGTCAAT- 

TCTTATGTACATGTGGAGCAAAAAAGATAAAAGAATGT 7 75 

Qy 263 0 AG AG AT C ACACCAGAAGATGGAGAGGATTGACCAGAG 

GAATAACTTAGCT 2 6 79 

I I I I I I I I I I I I I I I I I I I I I III 

I 

Db 774 

AGTGCTCTGTATTCTTAATGGGGTGATGTTGATAGATCAGCGAACCTGGGGTGCATAA.ee 715 

Qy 26 8 0 TGC — ACAT G AAT GC AG T C AT GC AAAC C G T T AGG AAAA CCAG— 

AGCCTATATGGA 2 731 

II I I I I I I I II I I I I I I I I I I I I I I I I I 

I 

Db 714 

TTCTTTCATTAAACCTTCCTCGTAGTCCGTCTGGCAAAGGATCATGTTATTCTTTAGGAA 655 

Qy 2732 

GTTCCCTCTTCTAACCCAACTGTACTTGTATCTGTCCGTGGAAAATTTCAGTCC 2 7 85 

II II III I III I I I I I I I I I I 

I I I I I I 

Db 654 AAATTTGTCTCCAACACAAAATC-TCTGATTACAAAGCTG-ACATGCAAAG 

CAGTCC 600 

Qy 2 786 AG AAGAATTGACCTTGAC CAT T AAT AAAGAC AC TGGC AGAGA- 

GATCTTCCC 2836 

II II I I I I I I I I I I I I I I I II I I I I I I I I I I I I 

I 

Db 599 AGGTGGTAAACATTGTCCTTGGCACGCATCACCATC- 

TCAAAGGCAGGGATGAGCTTACT 541 



Qy 2 83 7 ATAATAAAGCAATCTGATT-CAGCATCACTAAA 

CCGATAATGCATGAAGCAAC- 2888 

II III I I I I I I I I I I I I 

Db 540 ACAG-GCAGCG — 

CAGTTTCCCGTTACACCAAAGAGCCTCAGATAGTCTCTGCGACAAAG 484 



Qy 



2 8 89 GATAAAATTA-CAAAAGAG — CAGCAT TTTTAATTTTCACAA 



AATGTCTCAGTT 2939 

I I I I I I I I I I III III I I I I I I I I 

Db 483 

GATAAGATTAGCTTTAGTGTACAGGGTGGAGCCCACCTCCCCCAAGCGACAGTCACAGCA 42 4 
Qy 29 40 TTCAGCTATACCTGC — 

TCGTTCATAACCAACAATATAAACCGTGGTCTCATGTAACACA 2 99 7 



Db 423 GGCA-CACTTCAGGCAGTC-TTCAT-GCCAG-TATTTGTCCAGTG 

CCTTTAGAAGA 3 72 

Qy 299 8 TAAACAATTCATGC- 

CTTTCATAGTTTATTATTATTAAAGTCTAAACAAAATTGCAATT- 3 055 



Db 3 71 T-ACCGGTCCTTGATCTTTCG 

GTTGCAGCCAGCACAACCTTTCGGCTT 325 

Qy 3056 TCT TAG 

GTAACCTTATATTTACAATAAATGAAGATTACCCTCAAATGCTA 3105 

III II I I I I I I I I I I I I I I III 

Db 32 4 GGTGTCTGGCTGGACTGAGAGCATTTGTATACCAAAACT 

TTTCTCCTTTTTCTGCAT 268 

Qy 3106 GAAGCTGTCTAGGTCCGTCCGGTGTGTCAGA-TTTCCTCAGATTAG — ATGTGC- 

CAATA 3161 



Db 26 7 G -AG T T G AC T T AGGC 

GTGTCTGAGTTGCAGCAGCTTCGCATTGAGCACCAAA 217 

Qy 3162 AC C AAG T T T AT T C AG T AAA CAACTTGTACTTGTTTC- 

ATCTGGTTTA 3207 

I I I I II I I I I I I I I I I I I I I 

Db 216 CCCAAAGGTA- 

GAAGTAGAAGGGGGTCTCCTTGATTTCGCTTAAGTGTGGACCTGGTGCG 15 8 

Qy 32 0 8 TTACTCTCAC CC AT AAAC AG T AAT G AC T C T C T 

GAC C 32 43 



Db 157 

CAGCCTACACCGCCGAGGACCGACTATTGTGAAGCCACTTTGGGAGCGGGTCGGAGTGGC 9 8 

Qy 32 44 CTCTGGAAAT ATGTAATG CTTCCAATCTTG 

CTTTGTGTA 3282 

III I I I I I I I I I I I I 

Db 9 7 

GGCAGGGGGTGGGGGAAGGGATGAGGACGGCCAGACAAGACAGGGCGCACACACGGAGCC 3 8 

Qy 3283 TCTC ATTTAATTTG TTCCGGTTAAGG 3308 

III I II II I I I I I III 

Db 3 7 CCTCGCAGTGTGCAAAATGATGGCGAATCCAGT — AGG 2 
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Qy 1 GCGGCGCGCC-CGAGCCTAGTC — CCCAC GCCGCGGCGC— 

GCCCGGGCTCC 47 

I I I I I I I I I I I II I I I I I I I I I I I 

I I I I 

Db 22 

GCGGAATTCCTCGAGCACTGTTGGCCTACTGGGTGTACTGTGGAGACTGTCAAAGTCTCC 8 1 

Qy 4 8 CTGCTGATCCCAG-AACAATCAACCATGACGACCGAATCTGGATCAGACTCGGAA 

TC 103 



Db 82 — 

CGGAGCCCAATTTCCGGAAGCGGTGAGTTCTGAAAGAAGTTCCTGCACCGTAGTTTC 13 8 

Qy 104 CAAGCCGGACCAGGAGGCCGAGCC — 
CCAGGAGGCGGCGGGGGCGCAGGGG C 153 



Db 139 C CAAGTCTGCGAATCC CCAACCATGA- 

GCGCCTCGGGCGTACTGTCCTTTACCC 191 

Qy 154 GGCGGGGGC-CGTGCCGGAGCCGCCCAAGGAGGAGC-AGCAGCAG — GCCCTGGA- 

GCAG 208 



Db 192 

AGCAAGGATGGGAGCAGGTGCTGGCCAAAGTGAAACGGGCTGTGGTTTACCTGGACGCCG 2 51 

Qy 209 — TTCGCC GCC GCTGCAG 

CGCACAGCACCCCGGTGCGAGGGAGG 250 



Db 252 

CCTGCGCCGAGAGCCTGCACTGGGGCTGCGGATCCACCCGTCTCCTGGAGGCGGTGGGGG 311 



Qy 251 

TCACTGACAAGGAACAGGAGTTTGCTGCCAGGGCTGCAAAACAGCTCGAATATCAGCAAT 310 

I I I I I I II I I I I I I I I I I I I I I I 

I I I I I 

Db 312 GCCCTGAC TGTCACCTGCGAGAGTT-CG AGCCCGA 

CGCAAT 351 

Qy 311 

TAGAAGACGATAAACTTTCTCAGAAATCATCTAGCAGTAAACTCTCTCGGTCTCCATTAA 3 7 0 

I I I I I I I I I I I I I I I I I 

Db 352 TGGTGGTGGA GCCAAGCAG CCCAAGG- 

CAGTGTTTG 386 

Qy 371 

AGATTGTCAAAAAGCCTAAAAGCATGCAGTGCAAAGTGATACTTCTCGATGGATCAGAAT 43 0 

II I I I I I I I I III I I I I I I I I I I I 

I I 

Db 387 TGCTGAGCTGCCTGCTGAAAGGC CGGACCGTGGAGATCCTAC 

GGGACATCAT 43 8 

Qy 431 

ATACCTGTGATGTAGAGAAACGCTCCAGAGGACAAGTGCTGTTTGATAAAGTGTGTGAAC 49 0 

III III III I I I I I I I I I III 

I 

Db 439 CTGCC GCAGT CACT TCCAGTATTGTGTG 

GTGGTC 472 

Qy 491 

ACTTGAACTTGCTAGAGAAAGACTACTTTGGGCTTACGTATCGAGATGCTGAAAACCAGA 550 

II III I I I I III I I I I III 

I 

Db 4 73 AC AAC CGTGAG CCAC 

GCTGTCCACCTCA 50 0 



Qy 551 AGAATTGGTTGGACCCTGCTAAGGAAATAAAAAAACAG- 

GTTCGAAGTGGTGCTTGGCAC 6 09 

I I I I I I I II I I I I I I I I I I I 

I I I I 

Db 501 CAGCTAATCATGTCCCAGCGGCGG CAGCGGCCGAGATGGAG — 

GGGCA- 5 46 

Qy 610 TTTTCATTTAATGTGAAATTTTATCCACCAGACCCT 

GCCCAACTATCTGAAGATAT 665 

II III I I I I I I II I I I I I I I I I 

I I 

Db 547 GCAGCCGGTGT TCGAGCAG 

CTGGAGGAGAAGCTGTGTGAATGGAT 591 

Qy 666 CACCAGGTACTACCTCTGCTTGCAGTTGCGAGATGACATCGTGTCCGG — 

AAGGCTG 72 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 592 GGGCAACATGAACTA-CACGGC CGAGGTGT- 

TCCATGTCCCGTTATTGCT- 639 

Qy 721 CCCTGCTCCT-TTGTTACCCTGGCCTTG- 

CTGGGCTCCTACACTGTCCAGTCAGAGCTCG 778 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 640 TGCCCCTGTTGCTCCCC — 



ACTTTGCCTTGACTCCAGCTTTTGCATCCCTTTTCCC- 693 



Qy 779 

GAGACTATGACCCAGATGAATGTGGGAGCGATTACATTAGTGAGTTCCGCTTTGCACCAA 83 8 

III I I I I I I I I I II I I I I I I I I I 

Db 69 4 ACTGCTACCC — CAGGATGT GCACCTCCTTAA 

TAGCGCCCG 732 

Qy 839 AC C AC AC T AAAG — AAC T - GGAAGAC AAAG T GAT - C G AGC T GC AC AA 

GAGCCACAGA 891 

III I I I I I I I I I I I I I I I I I I I I I I I I I I III 

I I I I 

Db 733 

ACCGGACAAGAGGAAGCTGGGAAGCCTGGGTGATGTGGACTCCACTACGCTAACCCCAGA 792 

Qy 892 GGAATGACGCCAGCAGAAGCAG 

AGATGCATTTCTTGGAAAATGCCAAAAAATTATCA 948 

I I I I I I I I I I I I I I I III I I I I I 

I I 

Db 793 G CTGCTGCTGCAGATCAGATGC CTAGTGTCAGGCCTCAG 

TTCT 835 

Qy 9 49 ATGT ATGGGGT AGAT T T ACATCATGC T AAGGAC T - 

CAGAAGGGGTAGAAATTATGTTAGG 1007 

I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 836 CTGTGTG AACAT — T T AGG AG T AC GGGAGGAGT 

GTTTTGCT 874 

Qy 1008 

AGTTTGTGCAAGTGGTCTGTTGATATATCGCGACCGGCTGCGAATAAACAGATTTGCCTG 106 7 

II III I I I I I I I I I I I I I I I 

I I I 

Db 8 75 -GTAGGTTCCTTAAGTCAGGT CATC— GCTGCGGAT 

CTG 910 

Qy 106 8 GCCCAAGGTTCTAAAGATTTCATACAAACGGAACA — 

ACTTTTACATTAAGATCCGGCCG 1125 

I I I I II I I I I I I I I I I I I I I I I I III 

I 

Db 911 GCCAATTATGCCCCTG CAAAGAACAGGAAGAAGACTGCTGCA 

GGCAG 957 

Qy 1126 GGAGAGTTTGAACAATTTGAAAGCACCATTGGGTTTAAGCTGCCA- 

AACCATCGAGCTGC 1184 

I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 958 G GCATCAGT- 

GGTTTTTGTGGACAGAACCCTGGATCTCA 995 

Qy 1185 

C AAGC G T T T AT GG AAAG TATGTGTT GAGC ATC AT AC AT T T T TC AGAC T AC T G T T AC C AG A 12 4 4 
I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 996 CAGGAGC AGTTGGACATCATG 

G AG AC AAC T TAG TAG AG A 103 4 

Qy 1245 

AGCACCTCCCAAGAAATTCCTAACCTTGGGTTCCAAGTTTCGTTATAGTGGCAGGACACA 13 0 4 

I I I I I I I I I I I I II I I I I I I I I 

I I I I 

Db 1035 AGATCATTTC-AGCACTTCC CC AGCTCC 



CAGGCCACA 10 70 



Qy 1305 

AGCGCAAACGAGAAGAGCCAGTGCGTTGATAGATCGCCCAGCACCTTACTTTGAACGCTC 136 4 

I I I I I I II I I I I I I I I I I I 

I I I I I 

Db 10 71 CAAATGA TG TGATGGTT AACATGAT 

AGCGCTC 1102 

Qy 1365 ATCCAGCAAACGTTATACCATGTCTCGCAGCTTGGATGGAGCATCA 

GTGAATGAAA 142 0 

I I I I I III I I I I I I I I I I I I I I 

Db 1103 A— CTGCA 

CTCCATACTGAGGAGGAAAATTATAATGTGGTTG — C 1143 

Qy 1421 ACCATGAAATATACATGAAGGATTCTATGTCTGCTGCAGAGGTTGGTACTGGCCAGT- 

AC 1479 

I I I I I I I I I I I I II I I I I 

I I 

Db 1144 ACCAGG CTGTCT — TTC 

AC AAT T C AG T G AC 1171 

Qy 148 0 GCCACAACAAAAGGCATCTCTCAGACCAACTTGA- 

T C AC C AC T G T G AC T C C GG AG AAGAA 153 8 

I I I I I I I I I I III I I I I I I I I I I I I III II 

I 

Db 1172 ACCACAGCCAAAG CCCTATGGGAAGCTTTACTGAACACTAAG-CAC 

AAAGA 1221 

Qy 1539 

GGCTGAGGAGGAGCGGGACGAGGAAGAGGACAAACGGAGGAAGGGGGAAGAAGTCACGCC 159 8 
I I I I I III II I I I I I I I I I I I 

Db 1222 GGCAGTGATGGA 

AGTTCGGAGACATC TAG T G G AAG 12 56 

Qy 1599 

CATCTCGGCCATCCAGCACGAGGGAAAGACTGACAGTGAGCGCACGGACACCGCAGCCGA 165 8 

I I I I I I I I I I I I I I I I I I II I I 

I 

Db 1257 CGGCAAGC AGAGAAAACCTGCCAAT CA- 

AGATGAGTA 1292 

Qy 1659 

CGGGGAGACCACTGCCACTGAGGAGCTAGAAAAAACTCAAGATGACCTGATGAAACATCA 1718 

I I I I I I I I I I I II I I I I I I I I I I I I 

I I I 

Db 12 93 TGGGGAGA GTCACACCGGGAC AGCTC ATGTCCT 

ATATTCA 1332 

Qy 1719 

AACCAACATTAGCGAGCTGAAAAGAACCTTCTTAGAAACCTCAACAGACACTGCCGTAAC 17 7 8 

II I I I I I I I I I I I I I I II 

I I I 

Db 1333 GCTCTTC AAGAAC AACCTCAA 

AGCTCTAAT 1362 

Qy 1779 

GAATGAATGGGAGAAGAGGCTTTCCACCTCCCCCGTGCGACTGGCCGCCAGGCAGGAGGA 183 8 

I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1363 GAATCATTGTGG CCTCCTCCAGCTTG- 



GACTGGCCAC- 



1398 



Qy 1839 

TGCCCCCATGATCGAACCACTTGTCCCTGAAGAGAAAATGGAAACCAAGACGGAGTCCAG 189 8 
III I I I I I I I I I I I I I I I I I I III 

Db 1399 AGCTCAAACGTT-GAAACAC CCACAGACTGCCAAGTGG 

GACAACTTTC — 1445 

Qy 1899 

TGGATAGAGACGGAACCCACCGTGCACCACCTGCCGCTTAGCACTGAGAAGGTGGTGCAG 195 8 



Db 1446 TGGCTTTTGAAAG GCTCCTTCTTCAG AGCATTGGGGAGTC- 

AGCA- 1489 

Qy 1959 

GAGACCGTGTTGGTGGAGGAGCGGCGTGTGGTGCACGCGAGTGGGGATGCTTCTTACTCG 2 018 



Db 149 0 ATGTCCGTTGTGTTAAATCAGCTGC TGCCCATGATT 

AAGCCTGTAACCC- 1538 

Qy 2019 

GCGGGAGACAGCGGGGATGCTGCAGCACAGCCCGCATTCACAGGCATTAAAGGGAAAGAG 2 0 78 

III I I I I I I I I I I I I I I I 
Db 1539 -AGAGAACCAACGAGGA — 
CTACAGCCCTG A 1566 

Qy 2 0 79 GGCTCTGCTTGACGGAGGGGGCTAAAGAGGAAGGAGGGGAGGAGGTCGC 

TAAAGCTG 2135 



Db 1567 GGAACTGCTGATC CT 

TCTCATATATATTTA 1596 

Qy 2136 

TCCTGGAACAGGAAGAGACAGCCGCTGCTTCCCGTGAGCGACAAGAGGAGCAGAGTGCAG 2195 



Db 1597 TTCTGTCACTGGAG AGC TCACGGTA — GACA — 

AAGACCTGTGTGAAG 16 40 

Qy 2196 C— CATCC AC ATTIC AGAAA- 

CTTTGGAACAAAAACCTCATTTTGAGTCCTCAACGGTGA 2 2 52 



Db 16 41 CAGAAGAAAAAGTCAAGAAAGCATTGG- 

GTGA 1683 

Qy 2253 

AGACGGAAACCATCAGTTTTGGCAGTGTTTCACCGGGAGGAGTAAAGCTAGAAATTTCCA 2312 



Db 1684 GGA- 

ATCTGGATTGTCACCTTTGCTGCAAAAAATTACGGACTGGGACTCTTCA 1735 

Qy 2313 CAAGAAGTGCCAGTAGTTCACACCGAAACCAAAACCATCACATATGAA- 

TCATCACAGGT 23 71 



I I I I I I I I I 

-CTCA GGTCTTCT- 



Db 1736 ATTAATCTGAC ATTTCACA 

AATCCAAAATTGCCGTGGATGAACTCTTTACT — T 1787 



Qy 2372 

CGATCCAGGCACAGATCTGGAGCCAGGCGTGCTGATGAGTGCACAGACGATCACATCTGA 2 431 



Db 178 8 CACTTCGGG-ATATTGCTGGAGCTCGGAGT-CTCCTGA AACA 

GTTTAAGTCTGT 1839 

Qy 2432 AAC — 

CACCAGTACCACCACCACTACCCACATCACCAAAACTGTGAAAGGGGGCATTTCA 2 4 89 
I I I I I I III I I I I III 

Db 18 40 ATATGTTCCTGGAAATCATACCCACCAGGCATCTTATAAGC 

CATTGTT 1887 

Qy 2 49 0 GAGACAAGAATTGAGAAGCGAATA-GTCATCACGGGGGATGCAGA 

CATTGACCATGA 2 5 45 



Db 1888 G AAG C AAG — TTGTGGAG G AAAT AT T T C AT C C C G AG AG G C - 

CAGATTCCGTTGATATTGA 19 44 

Qy 2546 CCAGGCGCTGGC- 

TCAGGCAATTAAAGAGGCCAAAGAGCAGCACCCTGACATGTCAGTGA 2 6 0 4 
II I I I I I I I I I I 

Db 19 45 ACA — C-ATGTCTTCAGGC 

TC 1970 

Qy 2605 

CCAAAGTAGTGGTCCATAAAGAGACAGAGATCACACCAGAAGATGGAGAGGATTGACCAG 266 4 
I I I I I I I II II 

Db 19 71 TCCTTAAA AC 

TGGAT 19 85 

Qy 2665 

AGGAATAACTTAGCTTGCACATGAATGCAGTCATGCAAACCGTTAGGAAAACCAGAGCCT 2 72 4 
I I I I I I I I I I I I I I II I I I I 

I I I 

Db 19 86 TTAGCATGTTCATGAAGG TG — AGCCG 

GCCTCATCCT 2020 

Qy 2725 

ATATGGAGTTCCCTCTTCTAACCCAACTGTACTTGTATCTGTCCGTGGAAAATTTCAGTC 2 7 8 4 



Db 2 021 AGT — GACTACCCCCTCCTGATCC TCTTTGTG GTAGGTGG — 

GGTCACAGTC 2 06 8 

Qy 2 785 - C AG AAG AAT T G AC C T T G AC CAT T AAT AAAG AC AC T G G C A 

GAGAGATCTTCCCA 2 83 7 



Db 2 069 TCTGAAG TGAAAATGGTCAAAGATCTTG 

TGGCATCGTTGAAGCCAGGAACCCA 2121 

Qy 2838 — 

TAATAAAGCAATCTGATTCAGCATCACTAAACCGATAATGCATGAAGCAACGATAAAA 2 895 



I I I I I I 
-CTCACTGA- 



Db 2122 GGTAATCGTGCTGTC- 

TTAACA 2162 



-CA-CACGACTCC- 



-TGAAGCCAC- 



Qy 2896 

TTACAAAAGAGCAGCATTTTTAATTTTCACAAAATGTCTCAGTTTTCAGCTATACCT — G 2 953 

III I I I I I I I I I I I I I I I II I I I I I I I I I 

I 

Db 2163 TTCC TGAGCTG TTATTTGC AACTG — 

ACCGACTGCATCCAGACCTTGG 22 0 8 

Qy 2954 

CTCGTTCATAACCAACAATATAAACCGTGGTCTCATGTAACACATAAACAATTCATGCCT 3 013 
I I I I I I I I I I I II II I I I I I 

I I I I I 

Db 22 09 C TTCTGAGCATCCGCTAAGAA GAT 

AAGACCTACTCAAGCTGGAAATGCCG 2258 

Qy 3014 

TTCATAGTTTATTATTATTAAAGTCTAAACAAAATTGCAATTTCTTAGGTAACCTTATAT 3 0 73 

I I I I I I II II I I I I I I I III 

Db 2259 ATGCAATTTTCT GCCACCAC TCCAAATACT 

CCTCC— 2293 

Qy 3 0 74 TTACAATAAATGAAGATTACCCTCAAATGCTAGAA- 

GCTGTCTAGGTCCGTCCGGTGTGT 3132 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 229 4 — ACAACCAGCGTCCCTGTCACT-AATTGCGAGAATGATG — 

GAATTCTGCCTGAAGGGT 23 48 

Qy 3133 

CAGATTTCCTCAGAT TAG AT G T GC C AAT AAC C AAG T T T AT T C AG T AAAC AAC TTGTACTT 3192 

I I I I I III I I I I I I I I I I I I I 

I I I 

Db 2349 C TTGAT ACC TACTCAGTGAGGTACTT- 

TGCTT 2379 

Qy 3193 

GTTTCATCTGGTTTATTACTCTCACCCATAAACAGTAATGACTCTCTGACCCTCTGGAAA 32 52 

I I I I I I I I I I I I I I I I I III II 

Db 2380 G 

GATTGCTGTGATTCTTAAAAAAAAAAAAAAAAAAGGCCACATG 2 423 

Qy 3253 TATGTAATGCTTCCAATCTTGC 3274 

I I I I I I I I I I 
Db 2424 TGCT — CGAGC-TGC 2435 



RESULT 6 

US-10-478-914-61/C 

; Sequence 61, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 

; TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
; FILE REFERENCE: 7388-80899 

; CURRENT APPLICATION NUMBER: US/ 1 0 / 478 , 9 1 4 
; CURRENT FILING DATE: 2003-11-26 
; PRIOR APPLICATION NUMBER: PCT/JP02/05294 
; PRIOR FILING DATE: 2001-05-30 



; PRIOR APPLICATION NUMBER: JP 2001 

PRIOR FILING DATE: 2001-05-30 
; PRIOR APPLICATION NUMBER: JP 2001 

PRIOR FILING DATE: 2001-08-24 
; NUMBER OF SEQ ID NOS : 417 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 61 

LENGTH: 2 43 7 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-478-914-61 

Query Match 29.4%; Score 5713; DB 1; Length 2437; 

Best Local Similarity 42.4%; Pred. No. 0; 

Matches 1432; Conservative 0; Mismatches 863; Indels 1080; 
Gaps 22 0; 

Qy 2 CGGCGCGCCCGAGC-CTAGTCCCCACGCCGCGGCGCGCC 

CGG 42 



Db 2 43 7 CTGC-AGCTCGAGCACATGT- 

GGCCTTTTTTTTTTTTTTTTTTAAG 2394 

Qy 43 GCTCCCTGCTGATCCCAGAACAATCAACCATGACGACCGAATCTGGATC- 

AGACTCGGAA 101 

I I I I I I I I I I I III III I I I I I I I I I I I I I 

Db 2393 AATCACAGC-AATCCAAG — CAAAGTACC TC- 

ACTGAGTAGGTATCAAGACCC 2345 

Qy 102 TCCAAGCCGGA — CCAGGAGGCCGAGCCC-CAGGAGGCGGCGGGGGCGCAGG- 

GGCGGCG 157 



Db 23 44 TTCAGGCAGAATTCCA- 

TCATTCTCGCAATTAGTGACAGGGACGCTGGTTGTGGAG 2 2 9 0 

Qy 158 GGGGCCGTGCCGGAGCCGCCCAAGGAGGAGCAGCAGCAGGCCCTG 

GAGCAGTTCGCC 214 

II II I I II II I I I I I I I I III 

Db 22 89 GAGTATTTGGAGTGGTGGCAGAA — 

AATTGCATCGGCATTTCCAGCTTGAGTAGGTCTTA 2232 

Qy 215 GCCGCT — GCAGC-GCACAGCACCCCGGTGC 

GAGGGAGGTCACTGACAAGGAACA 266 



Db 2231 

TCTTCTTAGCGGATGCTCAGAAGCCAAGGTCTGGATGCAGTCGGTCAGTTGCAAATAACA 2172 

Qy 26 7 GGAGTTTGCTGCCAGGGCTGCAAAACAGCTCGAATATCAGCAATT- 

AGAAGACGATAAAC 325 

III I I I I II I I II I I I I I I I I III I 

Db 2171 GCT — CAGGAATGTTAAGTGGCT 

TCAGGAGTCGTGTGGACAGCACGA 212 7 

Qy 326 TTTCTCAGAAATCATCTAGCAGTAAAC 

TCTCTCGGTCTCCATTAAAGATTGTCAAA 381 

I I I I I II I I I I I I I I II I I I I III I I I I I I 
Db 2126 TTAC-CTGGGTTC — CTGGCT-TCAACGATGCCACAAGATCT 



-162775 
-255226 



TTGACCATTTTCA — 2 0 76 



Qy 382 

AAGCCTAAAAGCATGCAGTGCAAAGTGATACTTCTCGATGGATCAGAATATACCTGTGAT 4 41 
I I I I I I I I I III I I I I I I I I I 

Db 2 0 75 CTTCAGAG AC T G T G AC C C C AC C T AC C AC AAAG AG GATCAGGA 

GGGG 2 03 0 

Qy 442 

GTAGAGAAACGCTCCAGAGGACAAGTGCTGTTTGATAAAGTGTGTGAACACTTGAACTTG 5 01 



Db 2 029 GTAG TCACT AGGATGAG-GCCG GCTCACCT 

TCATGAACATG 199 0 

Qy 502 C TAG AG AAAG AC T AC TTTGGGCT T AC G T AT C G AG AT GC T G - 

AAAACCAGAAGAATTGGTT 56 0 

II III I III I I I I I I I I I I I I I I I I I 

Db 19 89 CT AAATCCAGTTTTAAG GAGAT- 

CAGTGAGGCCTGAAGACATG 19 48 

Qy 561 

GGACCCTGCTAAGGAAATAAAAAAACAGGTTCGAAGTGGTGCTTGGCACTTTTCATTTAA 62 0 

I I I I I I I I I I I I I I I I I 

Db 19 47 TGTTCA ATATCAACGGAATC TGGC 

CTCTCGG 1917 

Qy 621 TGTGAAATTTTATCCACCAGACCCTGC-CCAACTAT — CTGAAGATATCACC 

AGGT 6 73 



Db 1916 GATGAAATATT-TCCTCCACAACTTGCTTCAACAATGGCTTATAAGAT- 

GCCTGGTGGGT 1859 

Qy 674 

ACTACCTCTGCTTGCAGTTGCGAGATGACATCGTGTCCGGAAGGCTGCCCTGCTCCTTTG 733 

I I II II II III I I I I I I I I I I I I III 

Db 1858 AT GAT T T C C AGG AAC AT AT AC AG AC T T AAAC T G T T T C AGG - AG AC T - 

CCGAGCTCCAGCA 1801 

Qy 73 4 TTACCCTG 

GCCTTGCTGGGCTCCTACACTGTCCAGTCAGAGCTCGGAGACTATGAC 7 89 

I I I I I I I I I I I I I I I I III II III 

Db 1800 ATATCCCGAAGTGAAGTAAAGAGTTCATCCACGG — CAATTTTGGATTTGTGA- 

AATGT- 1745 

Qy 79 0 CCAGATGAATGTGGGAG — CGATTACATTAGTGAGTTCCGCTTTG CA— 

CCAAA — 839 

I I I I I I I I III I I I I I I I I I I I I II I 

Db 1744 - 

C AG AT T AAT T G AAG AG T C C C AG T C C G T AAT T T T T TGC AGC AAAGGTGAC AAT C C AG AT T 16 86 

Qy 8 40 C C AC AC T AAAG AAC T GG AAGAC AAAG T GAT C — G AGC T GC AC AAG AGC — 

CACAGAGGAA 895 

I I I I I I I I I I I I I I I I I I II II I I II I I I I 

Db 16 85 C C T C AC AG AAG AC C T G — 

AGCCAATGCTTTCTTGACTTTTTCTTCTGCTTCACACAGGTC 162 8 



Qy 896 TGACGCCAGCAGAAGCAGAGATGCATTTCTTGGAAAATGCCA- 

AAAAATTATCAATGTAT 954 



Db 162 7 TTTGTCTACCGTGAGC TCT 

CCAGTGACAGAATAAATATATATGAGA 1582 

Qy 955 GGGGT-AGATTTACATCA-TGCT-AAGGACTCAGAAGGGGTAGAAATTAT — 

GTTAGGAG 10 09 



Db 1581 AGGATCAGCAGTTCCTCAGGGCTGTAGTCCTC 

GTTGGTTCTCTGGGTTACAGG 1529 

Qy 1010 TTTGTGCAAGTG-GTCTGTTGATATATCGCGACCGGC — 

TGCGAATAAACAGATTTGCCT 1066 



Db 152 8 CTTAATCATGGGCAGCAGCTGATTTAACACAACGGACATTGC- 

TGACT 1482 

Qy 106 7 GGCCCAAGGTTCTAAAGA 

T T T C AT AC AAAC GG AAC AAC T T T T AC AT T AAG ATC 1119 



Db 1481 -CCCCAATGCTCTGAAGAAGGAGCCTTTCA-AAAGCCAGAA- 

AGTTGTCCCACTTGG— C 142 7 

Qy 112 0 CGGCCGGGAGAGTTTGAACAATTTGAAAGC ACCATTGGGTTTAAGCTGCC — 

AAAC 1173 



Db 1426 AGTCTGTGGGTGTTTCAAC-GTTTG — AGCTGTGGCCA 

GTCCAAGCTGGAGGAGGC 13 74 

Qy 1174 CATCGAGCTGCC- 

AAGCGTTTATGGAAAGTATGTGTTGAGCATCATACATTTTTCAGAC- 1231 



Db 13 73 CACAATGATTCATTAGAGCTTTGAGGTTGTTCTTGAAGAGCTGAATA 

TAGGACA 132 0 

Qy 1232 — T AC T G T T AC C AG AAGC AC - 

CTCCCAAGAAATTCCTAACCTTGGGTTCCAAGTTTCGTT 12 8 8 

I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 1319 TGAGCTG-TCCCGGTGTGACTCTCCCCA 

TACTCATCTTGATTGGCAGGTTT 12 70 

Qy 1289 

ATAGTGGCAGGACACAAGCGCAAACGAGAAGAGCCAGTGCGTTGATAGATCGCCCAGCAC 13 4 8 



Db 1269 -TCTCTGCTTGCCGCTTCC ACTAGATGTCTCCGAACTT 

CCATCAC 1226 

Qy 13 49 — CTTACTTTGAACGCTCA TCCAGCAAACGTTATACCAT — 

GTCTCGCAGCTTGGAT 1401 



Db 1225 TGCCT-CTTTGT— GCTTAGTGTTCAGTAAA-GCT-TCCCATAGGGCT- 

TTGGCT 1177 



Qy 1402 GGAGCATCA- 



GTGAATGAAAACCATGAAATATACATGAAGGATTCTATGTCTGC 145 4 

I I III I I I I I I I I I I I I I II I I I I I II 

I 

Db 1176 GTGGTGTCACTGAATTGTGAAAGACAGCC-TGGTGCAACCAC 

ATTATAATTTTCC 1123 

Qy 1455 

TGCAGAGGTTGGTACTGGCCAGTACGCCACAACAAAAGGCATCTCTCAGACCAACTTGAT 1514 

II II I I I I I I I I I I II I I I I I III 

I I 

Db 1122 TCCTCAGTATGG-AGTG — CAGTGAGC GCTATCATGTTAACC — 

AT 1082 

Qy 1515 CACCAC 

TGTGACTCCGGAGAAGAAGGCTGAGGAGGAGCGGGACGAGGAAGAGGAC — 1569 

III I I I I I I I I I I I I I I I I I I I I III I I 

Db 1081 CACATCATTTGTGTGGCCTGGGA GCTGGGGAAGTGC 

TGAAATGATCTT 103 4 

Qy 1570 — 

AAACGGAGGAAGGGGGAAGAAGTCACGCCCATCTCGGCCATCCAGCACGAGGGAAAGA 162 7 
II I I I I I I I I I I I I I I I I I I 

Db 1033 CTCTAC TAAGTTGTCT CCATGATGTCCA- 

ACTGCTC 999 

Qy 162 8 CTGACAGTGAGCGCACGGACACCGCAGCCGACGGGGAGACCACTG- 

CCACTGAGGAGCTA 16 86 

I I I I I I I I I I I I I I I I I I 

Db 99 8 CTGTGAGATC CAGGGT 

TCTGTCCAC 974 

Qy 1687 

GAAAAAACTCAAGATGACCTGATGAAACATCAAACCAACATTAGCGAGCTGAAAAGAACC 17 46 
I I I I I I I I I I I I I I III I I I I 

I 

Db 9 73 — AAAAACCACTGATGCCCTG CCTGC 

AGCAGT C 9 43 

Qy 1747 

TTCTTAGAAACCTCAACAGACACTGCCGTAACGAATGAATGGGAGAAGAGGCTTTCCACC 18 06 

I I I I I III I I I I I I I I I I I I III 

Db 9 42 TTCTT CCT GTTCTTTGCAGGGGC — ATAATTGG 

CCAGA 907 

Qy 1807 

TCCCCCGTGCGACTGGCCGCCAGGCAGGAGGATGCCCCCATGATCGAACCACTTGTCCCT 1866 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I 

I I I I 

Db 9 06 TCCGC — AGCGA-TG ACCTGACTTAAGGA-ACCTAC 

AGCAAAACACTCCTCCCG 857 

Qy 1867 

GAAGAGAAAATGGAAACCAAGACGGAGTCCAGTGGATAGAGACGGAACCCACCGTGCACC 192 6 
I I I I I I II I I I I I I I I I 

Db 856 TACTCCTAAATGTTCAC 

ACAGAGAACTGAGG 826 

Qy 192 7 ACCTGCCGCTTAGCA- 

CTGAGAAGGTGGTGCAGGAGACCGTGTTGGTGGAGGAGCGGCGT 19 85 

I I I I I I I I I I I I I I I I I I I I I I I I I 



I 

Db 825 - C C T G AC AC T AGGC AT C T GAT C TGCAG C- 

AGCAGC-T 792 



Qy 19 86 GTGGTGCACGCG- 

AGTGGGGATGCTTCTTACTCGGCGGGAGACAGCGGGGATGCTGCAGC 2 0 4 4 



Db 791 CTGGGGTTAGCGTAGTGGAG TCCACATCACC CAG 

GCTTC 753 

Qy 2 0 45 ACAGC — CCGCAT-TC ACAGGCATTAAAGGGAAAGAGG- 

GCTCTGCTTGACGGAGGG 2 09 7 

I I I I I I I I I I I I I I II I I I I I I I I I I 

I I 

Db 752 CCAGCTTCCTCTTGTCCGGTCGGGCGCTATT AAGGAGGTGCACATC 

CTGGGGT 700 

Qy 2 09 8 GGCTAAAGAGGAAGGAGGGGAGGAGGTCGCTAAAGCT 

G T C C T G G AAC AG G AAG AG AC 2154 

II I I I I I II I I I I I I I I III I I I I I 
Db 699 AGC 

AGTGGGAAAAGGGATGCAAAAGCTGGAGTCAAGGCAAAG 65 8 

Qy 2155 

AGCCGCTGCTTCCCGTGAGCGACAAGAGGAGCAGAGTGCAGCCATCCACATTTCAGAAAC 2 214 



Db 657 TG GGGAGCAAC AGGGGCA AGCAAT-AAC 

GGGAC 626 

Qy 2215 TTTGGAACAAAAACCTCATTTTGAGTCCTCAACGGTGAAG 

AC G G AAAC CAT CAG T T T 22 71 

I I I I I I I I I I I I I I I I I II I 

Db 625 -ATGGAACA 

CCTCGGCCGTGTAGTTCATGTTGCCCATC 589 

Qy 2272 

TGGCAGTGTTTCACCGGGAGGAGTAAAGCTAGAAATTTCCACAAGAAGTGCCAGTAGTTC 2331 
II I I I I I I I I I I I I I I I I I III 

Db 58 8 CA TTCAC ACAGC T TCTCCTCCAGC— 

TGCTCG A 558 

Qy 2332 

ACACCGAAACCAAAACCATCACATATGAATCATCACAGGTCGATCCAGGCACAGATCTGG 2391 



Db 557 ACACCGGCTGC-TGCCCCTC CATCTC-GGCCG CTGCCGCCG— 

CTGG 515 

Qy 2392 AGCCAGGCGTGCTGATGAGTGCACAGAC 

GATCACATCTGAAACCACCAGTACCACCA 2 448 

i i 1 1 1 1 ill mi i i 1 1 1 1 ii 1 1 1 1 1 1 1 

Db 514 GACATGATTAGCTGTGAGGTGGACAG- 
CGTGGCTCACGGTTGTGACCACCA 465 

Qy 2449 

CCACTACCCACATCACCAAAACTGTGAAAGGGGGCATTTCAGAGACAAGAATTGAGAAGC 2 5 0 8 



Db 464 - 

GATGATGTCCC 42 9 



-CACAATAC- 



-TGGAAGTG- 



-ACTGCGGCA— 



Qy 2509 GAATAGTCATCACGGGGGATGCAGACATTGACCATGACCAGGC-GCT- 

GGCTCAGGCAAT 2 566 

II II I I I I I I I I I I I II I I I I I I I I I I I I II 

I 

Db 42 8 GTAGGATCTCCACGG TCCGGCCTTT CAG 

CAGGCAGCTCAGCACAAACACT 3 79 

Qy 2567 

TAAAGAGGCCAAAGAGCAGCACCCTGACATGTCAGTGACCAAAGTAGTGGTCCATAAAGA 2 62 6 

III I I I I I III I I I I 

Db 378 GCC-TTGGGCTGCT TGGC 

TCCA 358 

Qy 2627 

GACAGAGATCACACCAGAAGATGGAGAGGATTGACCAGAGGAATAACTTAGCTTGCACAT 2 6 86 

I I I I I I I I I I II I I I I I I 

Db 357 CCACCA ATTG — CGTCGG 

GCTCGAAC — 33 4 

Qy 2687 

GAATGCAGTCATGCAAACCGTTAGGAAAACCAGAGCCTATATGGAGTTCCCTCTTCTAAC 2 7 46 
II III II III I I I I I I 

I 

Db 333 TCTCGCAG GT 

GACAGTCAGGGCC C 310 

Qy 2747 

CCAACTGTACTTGTATCTGTCCGTGGAAAATTTCAGTCCAGAAGAATTGACCTTGACCAT 2 8 06 

I I I I I II I I I I I I I I I I I I I 

Db 3 09 CCCACCG CC TCCAGGAGACGGG 

TGGATCC- 281 

Qy 2807 

TAATAAAGACACTGGCAGAGAGATCTTCCCATAATAAAGCAATCTGATTCAGCATCACTA 2 866 

I I I I III I I I I I I I I I I 

Db 280 GCAG CCCC AGTGCAGGCT — 

CTCGGC 257 

Qy 2867 

AAC C G AT AAT GC AT G AAGC AAC G AT AAAAT T AC AAAAGAGC AGC AT T T T T AAT T T T C AC A 2 92 6 
I I I I I I I II II 

I I I I 

Db 2 56 GCAGGCGGCGTC CAGGTAAAC 

CACA 232 

Qy 2927 AAATGTCTCAGTTT — 

TCAGCTATACCTGCTCGTTCATAACCAACAATATAAACCGTGGT 2 9 8 4 

I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 231 GCCCGTTTCACTTTGGCCAGC ACCTGCTC CCATCCTTG 

CTGGGT 188 

Qy 2985 

CTCATGTAACACATAAACAATTCATGCCTTTCATAGTTTATTATTATTAAAGTCTAAACA 3 0 4 4 
I III II I I II I I I I I I I III 

Db 187 AAAGGACA-GTACGCC 

CGAGGCGCTCATGGTTGGGGATTC 149 



Qy 3045 

AAAT T GC AAT T T C T T AGG T AAC CTTATATT T AC AAT AAATGAAGAT T AC C C T C AAAT GC T 310 4 

III I I I I I I I I III I III III I I I I 

I I 

Db 148 GCAG — ACTTGGGAAAC TACGGTGCAGGA— ACTTCTTTCAGA- 



Qy 3105 

AGAAGCTGTCTAGGTCCGTCCGGTGTGTCAGATTTCCTCAGATTAGATGTGCCAATAA.ee 316 4 

III I I I I I II I I I I I I I I I I 

I I 

Db 106 CACCGCT TCCGGAAATTGGG 

CTCCGGGAGACTTTGACAGTCTCC 63 



Qy 3165 A-AGTTTATTCAGTAAACAACTTGTACTTG 

TTTCATCTGGTTTATTACTCTCACCC 3219 

I I I I I I I I I I II I I I I I III I I I I I 

Db 62 

ACAGTACACCCAGTAGGCCAACAGTGCTCGAGGAATTCCGCAGCTTT 1 

Qy 322 0 ATAAACAGTAATGAC 323 4 

I III I I II 
Db 15 TAGAGCAGAAGTAAC 1 



RESULT 7 

US-10-478-914-20/C 

; Sequence 20, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: N AKAG AWARA , AKIRA 

; TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
; FILE REFERENCE: 7388-80899 

; CURRENT APPLICATION NUMBER: US/ 1 0 / 478 , 9 1 4 

; CURRENT FILING DATE: 2003-11-26 

; PRIOR APPLICATION NUMBER: PCT/ JP02 / 0 52 9 4 

; PRIOR FILING DATE: 2001-05-30 

; PRIOR APPLICATION NUMBER: JP 2001-162775 

; PRIOR FILING DATE: 2001-05-30 

; PRIOR APPLICATION NUMBER: JP 2001-255226 

; PRIOR FILING DATE: 2001-08-24 

; NUMBER OF SEQ ID NOS : 417 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 2 0 

LENGTH: 2 495 

TYPE: DNA 
; ORGANISM: Homo sapiens 
US-10-478-914-20 

Query Match 28.4%; Score 5535.5; DB 1; Length 2495; 

Best Local Similarity 42.5%; Pred. No. 0; 

Matches 1399; Conservative 0; Mismatches 988; Indels 905; 
Gaps 191; 

Qy 1 

GCGGCGCGCCCGAGCCTAGTCCCCACGCCGCGGCGCGCCCGGGCTCCCTGCTGATCCCAG 6 0 

I I I I I I I I I I I I I I I I I I I I I I III I 

Db 2492 GCGGC CGCGA-CCT GCAGC- 

TCGAGCACATGTGGCCTTTTTTTT 2 451 



Qy 61 AACAATCAACCATG — 

ACGACCGAATCTGGATCAGACTCGGAATCCAAGCCGGACCAGGA 118 

I I I I I I III III I I I I I I I 

Db 2 45 0 TTTTTTTTTACATGTTTAAATATATTTTATTTCAAGTTAATATAAC- 

AGCC A 2 40 0 

Qy 119 

GGCCGAGCCCCAGGAGGCGGCGGGGGCGCAGGGGCGGCGGGGGCCGTGCCGGAGCCGCCC 17 8 

I II II I II II II I I I I I 

Db 2399 TGATTTGTTCATTTAAGT GAAATGCT T C G AT AAAAAC C CCGGTG1 

CAC 2352 

Qy 179 AAGGAGGAGCAGCAGCAGGCCCTGGAGCAGTT- 

CGCCGCCGCTGCAGCGCACAGCACCCC 23 7 

II I I I I I I I I I I I I I I I I I 

I 

Db 2351 T T GG T T AAAAT AT T GG AG AC AGAT T AAATG T T AAGC C AC TAT AG TAG T G 

ACTCT 2298 

Qy 23 8 GGTGCGAGGGAGGTCACT G AC AAGG AAC AG GAGTTTG — 

CTGCCAGGGCTGCAAAACAGC 295 

I I I I II I I I I I I I I II II I I I I I I I I I 

Db 229 7 GGTTCTTTAGA CTATTGCATGTACCTTGTTAATGAACT- 

ACAGGAATGCACA 22 47 

Qy 296 

TCGAATATCAGCAATTAGAAGACGATAAACTTTCTCAGAAATCATCTAGCAGTAAACTCT 355 
III III III II I I I I I I I I I I I I 

I I 

Db 22 46 TTGATATAATGCA CAATGTC-TTCACATGTCTC AATCCTCATTC 

CT 2202 

Qy 356 CTCGGTCTCCATTAAAGATTGTCAAAAAGCCTAAAAGCATGCAGTG — 

C AAAG T GAT AC T 413 

II I I I I I I I I II I I I I I I I I I III I 

I I I 

Db 2201 CT TCCTTTCATGCCT CCCTAATGCCT- 

CAGTATCCAATATAAAACT 2157 

Qy 414 

TCTCGATGGATCAGAATATACCTGTGATGTAGAGAAACGCTCCAGAGGACAAGTGCTGTT 4 73 

I I I I I I I II 

I I 

Db 2156 AC AAAAAC T AC 

TGTTCTTTT 2137 

Qy 474 

TGATAAAGTGTGTGAACACTTGAACTTGCTAGAGAAAGACTACTTTGGGCTTACGTATCG 533 

II I I I I I I I I I I I II I I I I I I III 

Db 2136 CGTTTGAGTG-GTATTTAATTAAAACTTC 

AATTCTATGTTATTTC 2093 

Qy 534 

AGATGCTGAAAACCAGAAGAATTGGTTGGACCCTGCTAAGGAAATAAAAAAACAGGTTCG 593 

II I I I I I I I I II I II I I I I I I II 

I 

Db 2 092 CACAG TCCAGTA-ATTTATTTTAAATTTG — 



AGTAATTTCAAATTCCACAAACA 2 0 42 



Qy 59 4 AAG- 

TGGTGCTTGGCACTTTTCATTTAATGTGAAATTTTATCCACCAGACCCTGCCCAAC 652 

II II I Mil I I I I I I I I I I I I I I I I I I I 

Db 2 0 41 AAAC T G AAGAAC AGC AAT ATTTTGTTTGAATTTCTCTCTTCTGTACACT 

CAGT 1989 

Qy 653 

TATCTGAAGATATCACCAGGTACTACCTCTGCTTGCAGTTGCGAGATGACATCGTGTCCG 712 

I I I I I I I I I II I I I I I I I I I I I I I I 

I 

Db 19 8 8 GATC T - AAAACACCACAATATCCAACATACACAAACC — 

TCAGGGAAGGGTTAGTAAAC- 1933 

Qy 713 GAAGGCTGCCCTGCTCCTTTGTTACCCTGGC-CTTGCTGGGCTCCT 

ACACTGTC- 765 

I I III I I I I I II II I I I I I I 

I I I 

Db 1932 ACACACAAGATTGGAATCATGGTGCT — 

CTTTGCTCCTGAATGGAATGGTCC 1883 

Qy 766 — CAGTCAGAGCTCGGAGACTATGACCCAGATGAATGTGGG-AGCGATTACAT 

TAGT 819 

III I I I I I II II III II I I I I I I I I I I I 

I I I 

Db 18 82 CACAGAAAAAGCAC — AGGATACAGCACA 

ACATAAGGGCACCTGTTACATATGAAGT 182 8 

Qy 82 0 GAGTTCCGCTT T GC AC C AAAC C AC AC T A — 

AAGAACTGGAAGACAAAGTGATCGAGCTGC 877 

III I I I I II I I I I I II I I I I I I I 

I 

Db 182 7 GAG C AAAAC AT AC T AGC AT TTTCTATATG - CAT AAT G 

GGGAAAC 1785 

Qy 878 

ACAAGAGCCACAGAGGAATGACGCCAGCAGAAGCAGAGATGCATTTCTTGGAAAATGCC- 936 

I I I I I I I I I I III I I I II III 

I 

Db 1784 

CTGCATAGGTTAGAGGGCCTTTTACGCTCATTTAAAAATCAGGCAAGTTGTCT 1732 
Qy 937 - 

AAAAAATTATCAATGTATGGGGTAGATTTACATCATGCTAAGGACTCAGAAGGGGTAGA 995 
I I I I I I I I I I I I I I I III III III 

I I 

Db 1731 GTAACATTTTTCAATAATCTGGGAAGCACTGCA— ATCCAGTGTAC 

GA 1686 

Qy 996 

AATTATGTTAGGAGTTTGTGCAAGTGGTCTGTTGATATATCGCGACCGGCTGCGAATAAA 1055 
I I I I I I I I I I I I I I I I II III 

I 

Db 1685 TGTGCTGATTAGTGTT GTCT-TTGTT GGC — 

ACTG A 1653 

Qy 1056 CAGATTTGCCTGG-C CCAAGGTT 

C T AAAG AT T T C AT AC AAAC GG AAC AAC T 1105 



Db 1652 

CAGTCTTGCATGGTCATATGCCAGTGTTTTTGCTTTAGCTTTTCTTTGAATAAAACAGAT 1593 



Qy 1106 TT 

TACATTAAGATCCGGCCGGGAGAGTTTGAACAATTTGAAAGCACCATTGGGTTTA 1162 
II I I I I I I I I I II I I I I I I I I I 

I I I I I 

Db 1592 TTAAATGCATTTAGA-CAATACG TATTTGTAAGCT 

GTTTA 1554 

Qy 1163 

AGCTGCCAAACCATCGAGCTGCCAAGCGTTTATGGAAAGTATGTGTTGAGCATCATACAT 12 2 2 

I I I I I I I I I I I I I I I I I I I I I II 

I 

Db 1553 CAT AC AC T AAAT T T AAGAGATT CAATAT 

TAGAGTTTCTTTGTT 1511 

Qy 1223 

TTTTCAGACTACTGTTACCAGAAGCACCTCCCAAGAAATTCCTAACCTTGGGTTCCAAGT 12 82 

I I I I I I I I III III I III III III 

I I 

Db 1510 TCTTTAAAC-AC TACAGAGTGCAAAT — CAGGTTCTTCACAAC 

AGA 146 8 

Qy 1283 

TTCGTTATAGTGGCAGGACACAAGCGCAAACGAGAAGAGCCAGTGCGTTGATAGATCGCC 13 42 

II I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 146 7 TTGAATATTG-AGCAGTTCTTTA 

AAG AAAG AG G G G G AAG AAAAAAAGC C 1420 

Qy 1343 CA— 

GCACCTTACTTTGAACGCTCATCCAGCAAACGTTATACCATGTCTCGCAGCTTGG 1399 

II II II I I I I I II III I II I II 

Db 1419 C AAG T G AAT AAAAC AT T G AA- AC T AT T C C — CCTTCGAAAATAAAT 

TCTAAA 1371 

Qy 1400 

ATGGAGCATCAGTGAATGAAAACCATGAAATATACATGAAGGATTCTATGTCTGCTGCAG 1459 

III I I I I I I II II I I I I I I I I I I I I I 

I 

Db 13 7 0 ATGATGTG GAATGTGAAATAAGGTTTTAACAT-AGGTGATCCAAGTTT 

AT 1322 

Qy 146 0 AGGTTGGTACTGGC — CAGTACGCCAC- 

AAC AAAAGGC AT C T C T C AG AC C AAC T T G 1512 

I I I I II I I I I II II I I I I I I I I I I I II 

Db 1321 AGTTAGAAACAAAAAGAAGTCCTTCATGAAATAAAGGTTAC 

AAGAACACGTTGCCTG 1265 

Qy 1513 -AT C AC C AC T G T G AC T CCGGAGAAGAAGGC TGAGGAGG AG CGGGACGAGGA- 

AGAGG 156 7 

I III I I I I I I I I II II I I I I I I I I I 

I 

Db 1264 

TTTTCCCCCATTATAAACTGAGAAGTGGGTAAAGACGATGTTTCAGTACGAAAATAGGTG 12 05 

Qy 156 8 ACAAACGGAGGAAGGGGGAAGAAGTCA-CGC- 

CCATCTCGGCCATCCAGCACGAGGGAAA 1625 



Ill III 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I 1 1 

12 0 4 ACTACAGGA TCAGCGCTTCATCTC — ACATGCTGTACCC- 



Qy 1626 GACTGACAGTGAGCGCACGGACACCGCAGCCG- 

ACGGGGAGACCACTGCCACTGAGGAGC 1684 

I I I I I I I III I I I I I I I I I I III 

I I 

Db 116 4 GCCAGGCTGTG GCTGTCCATACGGTGGTGCGGTATAACCATAACCA- 

AAAGGTGC 1111 

Qy 1685 

TAGAAAAAACTCAAGATGACCTGATGAAACATCAAACCAACATTAGCGAGCTGAAAAGAA 17 4 4 

II I I I I I I I I I I III 

Db 1110 CTGGGGAGGGACGG CAACA CTG- 

GGT 1086 

Qy 17 45 CCTTCTTAGAAACCTCAACAGACACTG-CCGTAACGAATGAAT — 

G G G AG AAG AG G C T — 1799 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1085 CCTGC- 

TGTCAGCATCAACTGGGGCTGACCATAAACAATGGGTTGTGTCTCTGTAGCTTG 102 7 

Qy 1800 TTCCACCTC CCCCGTGCGACTG — 

GCCGCCAGGCAGGAGGATGCCCCCATGATC — 1851 

I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1026 TTCTTCTTCTTTTCTCAGTGATTCTGAAGCATCTA 

ATTTATCCACCTTTGTCAA 9 73 



Qy 1852 GAACCACT 

T G T C C C T G AAG AG AAAAT G G AAAC C AAG AC GG AG T C C AG T GG AT AG AG A 19 0 8 

III II I I I I II I I I I I I I I I I I I I I I I III 

I 

Db 9 72 GTACTCCTTCATGACCTGGATGA-AATAGGG 

CATGGCAAAATCCATGATATTGTG- 919 

Qy 19 09 CGGAACCCACCGTGCACCACCT GCCGCTTAGCACTGAGAAGGTGGT- 

GCAGGAGA 1962 

I I I I I I I I II II I I I I I I I I I I I I I I I I 

I 

Db 918 CCTCCATGCAGTTTCTAGGACGACATCTGGC- 

CTTAAAAGATCGTAACAGGTAA 866 

Qy 1963 CCGTGTTGGTGGAGGAGCGGCGTGTGGTGCACGCGAGTGGGGATGCTTCTTACT — 

CGGC 2020 

I I I I I I I I I I I I I I I I I I I I 

I I 

Db 865 AC AGACAAGCTCC AAAGCACTC 

TCTTTTTTCTTCCTGC 828 

Qy 2 021 GGGAG — 

ACAGCGGGGATGCTGCAGCACAGCCCGCATTCACAGGCATTAAAGGGAAAGAG 2 0 7 8 

I I I I I I I I I I I I I I I I I I I I I III II 

I I I 

Db 82 7 AAAAACCACTGCAGGAGTTCTTCAGC-CA ATTCAGTATCTTT — 

AGATTCAGAA 777 



Qy 2079 GGCT- 

CTGCTTGACGGAGGGGGCTAAAGAGGAAGGAGGGGAGGAGGTCGCTAAAGCTGTC 213 7 



Db 7 76 GCATACTGCATTGCATC CTT 

GTAAAGGCTGTC 745 

Qy 213 8 C TGGAAC AGGAAGAGAC AGC -CGC TGCTTCCCGT G A — 

GCGACAAGAGGAGCAGAGT 2191 

I I I I I I I I I I I I I I I I I I I I 

Db 744 TTTCTTGCACAG 

CTCTACACTCTGTTTCCAGCGATTGTTGCCTTTGAAGAGATAA 690 



Qy 2192 

GCAGCCATCCACATTTCAGAAACTTTGGAACAAAAACCTCATTTTGAGTCCTCAACGGTG 2 2 51 
I I I I I I I I I I I I I I III I III III 

I I 

Db 6 89 GCAGCAATTCTCCT GAACT CAATGAGTTCATGTT-TTTCC- 

AAACGCTG 6 43 



Qy 2 2 52 AAGACGGAAACCATCAGTTTTGGCAGTGTT- 

TCACCGGGAGGAGTAAAGCTAGAAATTTC 2310 

I I I I I I I I I I I I I I I I I I I 

I 

Db 6 42 A GCAAGCGA— GATATTGTCAAAGTTGTCATAAG- 

CATCTATTGATGTT 59 7 



Qy 2311 C AC AAG AAG T GC C AG TAG T T C - - 

ACACCGAAACCAAAACCATCACATATGAATCATCACA 236 8 

II I I I I I I II I I I I I I I I 

Db 596 CGC AG AG C C T G AT AAT CTTCTTCTG T AAT AAAAA- 

GATTGTTCAAT 552 



Qy 2369 GGTCGATCCAGGCACAGATCTGGAGCCAGGCGTGCTGA — 

T GAG T GC AC AG AC GAT C AC A 2 426 

I I I I I I I I I I I I I I I I I I I I I I I 

Db 551 GATTCATTCACAGATTTGTTGTTATG- 

GTTCTGAACTGAACGCAAATACGGT 501 



Qy 2427 

TCTGAAACCACCAGTACCACCACCACTACCCACATCACCAAAACTGTGAAAGGGGGCATT 2 4 86 
I I I I I I I I I I I I I I 

I I 

Db 50 0 TTCACCAGTG GTAGCTGTTTAA- 

CCTT 475 

Qy 2487 

TCAGAGACAAGAATTGAGAAGCGAATAGTCATCACGGGGGATGCAGACATTGACCATGAC 2 5 46 
III I I I I I I I I I I I I I I I 

I I I 

Db 474 GCTGA AATAATTGA CTGCACGAGTG 

TGAT 4 46 



Qy 2547 CAGGCGCTGGCTCAGGCAATTAAAGAGGCCAAAGAG-CAGCACCCTGA- 

CATGTCAGTGA 26 0 4 

I III II I I I I I I I I I I I I 

I 

Db 4 45 C CAA 

C C G T GG AG AC AGC AC CAT C AGC AAATC AT T T A 410 



Qy 2 6 05 C C AAAG TAG T GG T C C AT AAAG AG AC - 

AGAGATCACACCAGAAGATGGAGAGGATTGACCA 2663 



Db 409 ACA ACAGAGGCTTGAATTCT 

AAGTAGAACTGTATTGCTCT 3 70 

Qy 2664 

GAGGAATAACTTAGCTTGCACATGAATGCAGTCATGCAAACCGTTAGGAAAACCAGAGCC 2 723 
I I I I I I I I I I I I III III I I I I I I 

Db 369 GTAGTATAG TTCCACAT — TGGCAACCTTG 

GTAATGATATC 331 



Qy 2 72 4 TATATGGAGTTCCCTCTTCTAACCCAACTGTACTTGTATCTGTC — 

CGTGGAAAATTT — 2779 

I I I I I I I I I I I I II I I I I I I I II 
Db 330 — TTTGAATTGCC — 

CTTCTTTCCAGGCATCAGTTGGATGATTCATCATGGTAATTATGG 2 75 



Qy 2 78 0 CAGTC- 

C AG AAG AAT T G AC C T T G AC CAT T AAT AAAG AC AC TGGC AGAG AG AT CTTCCCAT 2 83 8 

III III I I I I I I I I I I I I I I I I I I I I 

I I I I 

Db 2 74 CATTATCATATTCTTCATACTTGTC ATACAAAAACAC C 

AGTTCTGCCCAA 225 



Qy 2839 

AATAAAGCAATCTGATTCAGCATCACTAAACCGATAATGCATGAAGCAACGATAAAATTA 2 89 8 

I II II I I I I I I I I I I I 

Db 22 4 AGATGAG 

CTTGTTCTGCAGCTCTTAGC 19 8 



Qy 2899 

CAAAAGAGCAGCATTTTTAATTTTCACAAAATGTCTCAGTTTTCAGCTATACCTGCTCGT 2958 
III I I I I I I I I I I I I I 

Db 19 7 ACCTTGGGAATATTCACT 

CTAGACCAG 171 



Qy 2959 

TCATAACCAACAATATAAACCGTGGTCTCATGTAACACATAAACAATTCATGCCTTTCAT 3 018 
I I I I I I I I I I I I I I 

Db 170 AACAGC TCCAGGTGCTC CCTCAT 

TTTCTG 142 



Qy 3019 

AGTTTATTATTATTAAAGTCTAAACAAAATTGCAATTTCTTAGGTAACCTTATATTTACA 3 0 7 8 
II III I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 141 AG— GCTTAAATTTAGAGTAT AGAAT AGC T AAT T C AGTAAAC 

ATTCCC 96 



Qy 3 0 79 ATAAATGAAGATTACCCTCAAATGCTAGAAGCTGTCTAGGTCCGTCC- 

GGTGTGTCAGAT 313 7 

II II I I I I I I I I I I I I I I I I III I I I I I 

Db 95 AT GTGAGCTCGCTCAAGTCCCAG-TGCTG-CT TCCAACATGGTG- 

ATCAGC- 4 8 



313 8 TTCCTCAGATTAGATGTGCCAATAACCAAGTT-TATTCAGTAAACAACTTGT 3188 
I I I I I I I I I I I I III I II I I I I I II II 

4 7 -TCTTCAAAATA GCCACGATCCTGATAGTAGCCAGTAGGCCAACAGT 2 
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US-10-478-914-64 

; Sequence 64, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 

TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
; FILE REFERENCE: 7388-80899 

; CURRENT APPLICATION NUMBER: US/ 1 0 / 478 , 9 1 4 

; CURRENT FILING DATE: 2003-11-26 

; PRIOR APPLICATION NUMBER: PCT/JP02/05294 

PRIOR FILING DATE: 2001-05-30 

PRIOR APPLICATION NUMBER: JP 2001-162775 
; PRIOR FILING DATE: 2001-05-30 
; PRIOR APPLICATION NUMBER: JP 2001-255226 
; PRIOR FILING DATE: 2001-08-24 
; NUMBER OF SEQ ID NOS : 417 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 6 4 

LENGTH: 212 0 
TYPE: DNA 
; ORGANISM: Homo sapiens 
US-10-478-914-64 

Query Match 28.1%; Score 5468.5; DB 1; Length 2120; 

Best Local Similarity 41.6%; Pred. No. 0; 

Matches 1334; Conservative 0; Mismatches 710; Indels 1161; 
Gaps 19 8; 

Qy 3 GGCGCGCCCGAGCCTAGTCCCCACGCCGCGGCGCGCC- 

CGGGCTCCCTGCTGATCCCAGA 61 



Db 3 GGCCTGTACG-GAAGTGTTACTTCTGCTCTAAAAGCTGCGGAATTCCT 

CGAGC 54 

Qy 62 AC AAT C AAC CAT G AC G AC C GAA — 

TCTGGATCAGACTCGGAATCCAAGCCGGACCAGGAG 119 

II I II I I I I I I I I I I I I I I I I III 

Db 55 ACTGTTGGCCT ACTGGAGTGCTGAAGTAGGCGCGGA 

CGTGCC 96 

Qy 120 

GCCGAGCCCCAGGAGGCGGCGGGGGCGCAGGGGCGGCGGGGGCCGTGCCGGAGCCGCCCA 179 



Db 9 7 -CGGTGCC TGGC-GCGTGGTAGCAGGCGC- 

CCGGTGCCCCGGCCGGCGAAGACCA 148 

Qy 180 AGGAGGAGCAGCAGCAGGCCCTGGAGCAGTTCGCCGCC- 

GCTGCAGCGCACAGCACCCC- 23 7 

III I I I I I I I I I I I I I I I I I I I I I I 

Db 149 TGGCGTTCATGGTGAAGACCATGGTG GGCGGCCAGCTGAAGAAC 

CTCACTG 199 

Qy 238 

GGTGCGAGGGAGGTCACTGACAAGGAACAGGAGTTTGCTGCCAGGGCTGCAAAACAGCTC 2 9 7 



Db 200 GGAGCCTGGGAGG CGGCGAGGATAAGGGAGATGGGGACAAGTCGG- 



CAGC-C 2 49 



Qy 298 

GAATATCAGCAATTAGAAGACGATAAACTTTCTCAGAAATCATCTAGCAGTAAACTCTCT 35 7 

I I I I I I I I I I I I I II I I I I I I 

Db 250 GAAGCTCAG GGC-ATGAGC CGGG 

AGGAGTA 2 78 

Qy 358 

CGGTCTCCATTAAAGATTGTCAAAAAGCCTAAAAGCATGCAGTGCAAAGTGATACTTCTC 417 

II I I I I I I I I I I I I I I I I I I I I 

Db 2 79 CG AGGAGTATC AGAAGC AAC T C G T G - 

GAAGAGA A 311 

Qy 418 

GATGGATCAGAATATACCTGTGATGTAGAGAAACGCTCCAGAGGACAAGTGCTGTTTGAT 4 7 7 

I I I I I I I I I I II II I I I I I I I I I I I I I I 

Db 312 GATGGAGCGGGATGCACAGTTCACACAGAGGAAGGC AGAGCGG 

GCC 357 

Qy 478 AAAGTGTGTGAACACTTGAACTTGCTAGAGAAAGACTACTTTGGGCTTAC — 

G TAT C GAG 535 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I 

Db 358 ACACTGCG-GAGC C AC T T C C G AG AC AAAT AC — C 

GGCTACCCAAGAACGAG 405 

Qy 536 A-TGCTGAAAACCAGAAGAATTGGTTGGACCCTGCTAAGGAAATAAAAAAACAGGT- 

TCG 593 

I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 406 ACAGATGAGAGCCAGAT CCAGAT — GG 

CAGGTGGAG 439 

Qy 594 

AAGTGGTGCTTGGCACTTTTCATTTAATGTGAAATTTTATCCACCAGACCCTGCCCAACT 653 

I I I I I I I I I I I I I I I I I I 

I 

Db 440 ACGTGGAGCT 

GCCCCGGGAGCTGGCCAAGAT 470 

Qy 654 

ATCTGAAGATATCACCAGGTACTACCTCTGCTTGCAGTTGCGAGATGACATCGTGTCC — 711 

I I I I III II III I I I I I I I I I I I I I 

Db 471 GATCGAGGAGGACACAGAGGA GGAGGAG - G AG AAG G C C T C - 

AGTCCTT 516 

Qy 712 GGAAGGCTGCCCTGCTCCTTTGTTACCCTGGCCTTG CTGGGCTC-CT 

ACACT 762 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 

Db 517 GGGCAGCTGGCCAG — CCTT CCTGG- 

CTTGAACCTGGGCTCACTCAAGGACAAG 56 7 

Qy 763 GTCCAGTCAGAGCTCG GAGA- CTATGACC — 

CAGATGAATGTGGGAGCGATTACATTA-G 818 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 56 8 GCCCAGGCCACACTGGGGGATCTCAAGCAATCAGCTGA GAAGTG- 

TCACGTCATG 621 



Qy 819 TGAGTTCCGCTTTGCACC 

AAACCACACTAAAGAACTGGAAGACAAAGTGATCGAGC 87 4 

III I I I I I III I I I I I I I I I I I I II III 

I I 

Db 622 TGA CCACTT— CCCCGGGGTTACC-CACT GGGCTGG — GCCCCCATGA — 

GGGC 66 8 

Qy 875 

TGCACAAGAGCCACAGAGGAATGACGCCAGCAGAAGCAGAGATGCATTTCTTGGAAAATG 93 4 
I I I I I I I I I I I I I I I I I I I I I 

Db 669 T AAGA GTGTGTCAACTTC CAGGGACCCA- 

TACT 70 0 

Qy 935 CCAAAAAATTATCAATGTATGGGG- 

TAGATTTACATCATGCTAAGGACTCAGAAGGGGTA 993 

III I I I I I I I I I I I I I I I III 

Db 701 CCA TTTGGGGCTTTGTTTCC — CTTGC 

CCCA 72 9 

Qy 994 

GAAATTATGTTAGGAGTTTGTGCAAGTGGTCTGTTGATATATCGCGACCGGCTGCGAATA 1053 

I I I I I II II I I I I II III 

Db 73 0 TCCTAGT — TCC-AAGACCTTTCCCATCC — 

ATGC 759 

Qy 1054 

AACAGATTTGCCTGGCCCAAGGTTCTAAAGATTTCATACAAACGGAACAACTTTTACATT 1113 
I II I I I I III I I I I I I I 

Db 76 0 — C CCAAGCCTA 

TCTTCTGGTTTCTTCCT 786 

Qy 1114 

AAGATCCGGCCGGGAGAGTTTGAACAATTTGAAAGCACCATTGGGTTTAAGCTGCCAAAC 1173 

I I I I I I I I I I III I I I I I III III 

I 

Db 78 7 CTCC-GCTGGGAG TAAAGTCCCCATC 

TTCACTCTAC C 822 

Qy 1174 CATCGAG 

CTGCCAAGCGTTTATGGAAAGTATGTGTTGAGCATCATACATTTTTCAGA 123 0 

III I I I I I I I I II I I I I I I I I I I III 

I I 

Db 823 CTTCAGGACCCTCCCCACC AGCTCAGCCTGTGGAG- 

GCCTCCCAAGATTGTAGGA 8 76 

Qy 1231 

CTACTGTTACCAGAAGCACCTCCCAAGAAATTCCTAACCTTGGGTTCCAAGTTTCGTTAT 12 9 0 

I I I I I I I I I I II I I I I I I I I I I I I I 

Db 8 77 AT AGGCCCATCCC TCTCTGGCCAT-GGCCCCAAG- 

TTC 912 

Qy 1291 

AGTGGCAGGACACAAGCGCAAACGAGAAGAGCCAGTGCGTTGATAGATCGCCCAGCACCT 135 0 

I I I I I I I I I I I I I I I I I I I I I I 

Db 913 CTGCACACAGGAGCACCCACAGAGAGACA 

CACACA 9 47 

Qy 1351 

TACTTTGAACGCTCATCCAGCAAACGTTATACCATGTCTCGCAGCTTGGATGGAGCATCA 1410 
I I I I I I I I I I I I I 



1 1 1 

Db 9 48 GGACACAAAAC CCCTGGCACG 

TTCA 972 

Qy 1411 

GTGAATGAAAACCATGAAATATACATGAAGGATTCTATGTCTGCTGCAGAGGTTGGTACT 14 7 0 
I I I I I I I I I I I I I I I I I 

Db 9 73 GAG AC AG AAGC C AC AG AC AC AT C C 

CGGCA 1001 

Qy 1471 

GGCCAGTACGCCACAACAAAAGGCATCTCTCAGACCAACTTGATCACCACTGTGACTCCG 153 0 
III I I I I I I I I I I I III 

I I I I 

Db 10 02 C AGAC AGAC AC -ACACGAGGC CAG 

CTCC- 1028 

Qy 1531 

G AG AAG AAG GCTGAGGAGGAGCGGGAC G AGG AAG AGG AC AAAC GG AG G AAG G G G G AAG AA 159 0 
II III 

Db 1029 CT 

TGCG T 1035 

Qy 1591 GTCACGCCCATCTCGGCCATCCAGCACGAGGGAAAGACTGACAGTGAGC — 

GCACGGACA 16 48 

III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I 

Db 1036 GTCCAGCCCCTC-CAGACA-CCACCACTC- 

AG AAAC T C T G AG AG AG AGC ATGGGC AGAC A 1092 

Qy 1649 - 

CCGCAGCCGACGGGGAGACCACTGCCACTGAGGAGCTAGAAAAAACTCAAGATGACCTG 17 0 7 

I I I I I I I I I I I I I I I I I I I I 

I 

Db 1093 CCCTCAGCAGAC-AGGAGGCCT GAGTT 

CCAG 1122 

Qy 1708 

AT G AAAC AT C AAAC C AAC AT T AGC G AGC T G AAAAGAACC T TC T T AG AAAC C T C AAC AG AC 176 7 

II I I I I I I I I II I I I I I I I I I III I 

I I 

Db 1123 TC — TCCACCTTTATTGTTCTTGAAAGCCCCTGCT CTC- 

TCTGA- 1163 

Qy 1768 

ACTGCCGTAACGAATGAATGGGAGAAGAGGCTTTCCACCTCCCCCGTGCGACTGGCCGCC 182 7 
I I I I I I I I I I I I III 

Db 1164 GCCTTA TTTCATCATC 

TGTAA 1184 

Qy 1828 

AGGCAGGAGGATGCCCCCATGATCGAACCACTTGTCCCTGAAGAGAAAATGGAAACCAAG 18 8 7 

I I I I I I I I I I I I I I I I I 

I I I 

Db 1185 AATGGGAATGTCC TGAATG ACTT 

CTAAG 1212 

Qy 1888 ACGGAGTCCAGTGGATAGA 

GACGGAACCCACCGTGCACCACCTGCCGCTTAGCACTG 1944 

I II I I I I I I I I I I I I I I I I I I I I I I 



Db 1213 GCTCTTTC TGGCTTGAACTGTCAGAGCCAAGC CCACATCCCTC 

CT- 1257 

Qy 1945 

AGAAGGTGGTGCAGGAGACCGTGTTGGTGGAGGAGCGGCGTGTGGTGCACGCGAGTGGGG 2 0 0 4 
I I I I I I I I I I I I I I III 

Db 1258 TGG-GCAGG GCAGCAGC 

TGC 1276 

Qy 2005 

ATGCTTCTTACTCGGCGGGAGACAGCGGGGATGCTGCAGCACAGCCCGCATTCACAGGCA 2 06 4 

I I I I III I I I I I I I I I I III 

Db 12 77 -TGCCACAGCCTC CAGCG GCTGC 

CAC 13 01 

Qy 2 065 TTAAAGGGAAAGAGGGCTCTGCTTGACGGAG- 

GGGGC T AAAGAGGAAGGAGGGGAGGAGG 212 3 

I I I I I I I I I I I I I I I I III I I I I I I 

I I I 

Db 13 02 TGTGGGCTCTGGGAGCCGGAGCGATGCT 

GTGTGAGAGGCAGAGT 13 45 

Qy 2124 TC 

GCTAAAGCTGTCCTGGAACAGGAAGAGACAGCCGCTGCTTCCCGTGAGCGACAA 2179 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 

Db 13 46 GCCAAGGATGAAGCTGGCACTGAACAGTAAGCGGCTCCAG— GCCTCCTCTGGGC- 

C— 1399 

Qy 2180 

GAGGAGCAGAGTGCAGCCATCCACATTTCAGAAACTTTGGAACAAAAACCTCATTTTGAG 2239 



Db 140 0 CAGGGCCCAGCCAATTTCTGTTCTGTTCCTGTAGAACG- 

TCTGGAT 1447 

Qy 22 40 TCCTCAACGGTGAAGACGGAAAC- 

CATCAGTTTTGGCAGTGTTTCACCGGGAGGAGTAAA 229 8 



Db 1448 TCCATAGC TGGAATCTCCTC — TCTTAGC TC 

AGTGAA 1482 

Qy 2299 

GCTAGAAATTTCCACAAGAAGTGCCAGTAGTTCACACCGAAACCAAAACCATCACATATG 235 8 

I I I I I I I I I I I I I I I I I I I I I 

Db 14 83 AAATAAAAATCCCAAATGGTGTGCC 

TACCTTCCCA 1517 

Qy 2359 

AATCATCACAGGTCGATCCAGGCACAGATCTGGAGCCAGGCGTGCTGATGAGTGCACAGA 2 418 

III I I I I I I I I I I I I II I I I I I I 

I I 

Db 1518 CTTC TTACTGGCTT CCAGGAGTCTTG — 

GAGTTCATAGC 155 4 

Qy 2 419 CGATCACATCTGAAACC — ACCAGTACCACCACCACTACCCACAT- 

CACCAAAACTG— T 2473 

I I III II I I I I I I I I I I I I I I I I I III 

Db 1555 CCCCCGAGCCTG— 



CCTTAAAGGGGTGTCCTCCACCCCCCACCTACAGCTTCACAGGAG 1611 



Qy 2 474 G AAAG GGGGCATTT C AG AG AC AAG AAT T G AG AAGC G AAT 

AGTCATCACGGG — 2 52 4 

I I I I I I I I I I I I I ! I I I I I I I I I I II II 

Db 1612 GGGAGAGGGCA-TCCAGTGCTAGGAGT — 

AGAAGTGTCTCCAGCTCTGTTCTCTTGGGGC 1668 

Qy 2 52 5 GGATGCAGACATTGACCATGAC- 

CAGGCGCTGGCTCAGGCAATTAAAGAGGCCAAAG 2580 

I I I I I I II II II I I I I I I I I I I I I I 

Db 1669 CCTGGGTGAAGGTGGGGTCTGGGGCTTATGAAATAGGTCTGGGCTTTGAGGAGG 



Qy 2581 AGCAGCACCCTGACATGTCAGTGACCAAAGTAGTGGTCCATAAAGAG — 

AC AG AG AT C AC 2 63 8 

I I I I I I II I I I I I III I I I I I II I I I I 

I I I 

Db 172 7 AGCAGCCTCAT — TATGTGGG 

GAAGATGGGGCCTCTGGGGCGTCACTGAGACCAC 1779 

Qy 2639 

ACCAGAAGATGGAGAGGATTGACCAGAGGAATAACTTAGCTTGCACATGAATGCAGTCAT 2 69 8 

I I I I I I I I I I II I III II 

I 

Db 178 0 A GGTGGGG CCGGGG CTGGAC CGCA 

GCTGTCTT 1811 

Qy 2699 

GCAAACCGTTAGGAAAACCAGAGCCTATATGGAGTTCCCTCTTCTAACCCAACTGTACTT 2 758 
I II I I I I I I I I I I I I I I I I I I I 

Db 1812 G GG — TGCCTGTGCCT GCACCC- 

CTCCTCACCC 1841 

Qy 2759 

GTATCTGTCCG T GG AAAAT T T C AG T C C AG AAG AAT TGACC T T G AC CAT T AAT AAAG AC AC 2 818 

I I I I I I I I I I II II I I I I 

Db 18 42 TAGAGA CGGAAGATGTG 

CAAAAAGAAAGA 18 70 

Qy 2819 

TGGCAGAGAGATCTTCCCATAATAAAGCAATCTGATTCAGCATCACTAAACCGATAATGC 2 8 7 8 
II II I I I I I I I I I I I II I 

Db 18 71 AGGAAGGGCAAC TGCATTC 

CAGCCCCAC 189 8 

Qy 2879 

AT G AAGC AAC GAT AAAAT T AC AAAAGAGC AGC AT T T T T AAT T T TC AC AAAAT GTCTCAGT 2938 
I I II I I I I I I I I I I I I I 

Db 1899 ACTGTGATGACTTT GAGCC TGTCCTTTCC 

CTCC— 1931 



Qy 2939 

TTTCAGCTATACCTGCTCGTTCATAACCAACAATATAAACCGTGGTCTCATGTAACACAT 2 99 8 

I I I I I I I I I I I I I II I I I I I I I I I I I I I 

I I 

Db 1932 -TTGAGCCTGTCTTGCTTGTCC CCTGTAAAATGAACAGT — 

CCCCCTCTCCCCCAA 1984 



Qy 2999 A AACAATTCATGCCTTTCATA-GTTTATTATTATTAAAGTCTA-AACAAA — 

ATTG 3050 



I I 



Db 19 85 ATAGTAATAATACATG — 

TTTCAAAGGGTGACCATTTATAAAGCATATGACAAACCATAT 2 0 42 



Qy 3 051 CAATTTCTTAGGTAAC- 

CTTATATTTACAATAAATGAAGATTACCCTCAAATGCTAGAAG 3109 



Db 2 0 43 CAATAAAT GTAACTCATTCTTTTAAAAAAAAAAAAAAAAGGCCACATGTGCTCG- 

AG 2098 



Qy 



3110 



CTGTCTAGGTC-CGTCCGGTGTGTC 3133 
III I I I I I I I I I I I II 
CTG — CAGGTCGCGGCCGCT-AGAC 212 0 



Db 



2099 
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Qy 29 CGCGGCGCGCC— 

CGGGCTCCCTGCTGATCCCAGAACAATCAACCATGACGACCGAATCT 8 6 



Db 1 CACTGTTGGCCTACTGGCTACTATCAGGATCGTGG- 

CTATTTTGAAGAGCTGATC- 5 4 



GGATCAGACTCGGAATCCAAGCCGGACCAGGAGGCCGAGCCCCAGGAGGCGGCGGGGGCG 146 



Qy 



87 



Db 55 
ACATGGGAA 100 




I I I 



Qy 147 CAGGGGCGGCGGGGGCCGTGCCGGAGCC 

GCCCAAGGAGGAGCAGCAGCAGG 19 7 

II I I I I II I I I I I I I 

Db 101 

TGTTTACTGAATTAGCTATTCTATACTCTAAATTTAAGCCTCAGAAAATGAGGGAGCA — 15 8 



Qy 198 

CCCTGGAGCAGTTCGCCGCCGCTGCAGCGCACAGCACCCCGGTGCGAGGGAGGTCACTGA 2 5 7 
I I I I I I I I I I I I I I I I I I I 

Db 159 -CCTGGAGCTGTT CTGGT- 

C TAG AG T G AAT AT T C C 191 



Qy 258 CAAGGAACAGGAGTTTGCTGCCAGGGCTGCAAAAC- 

AGCTCGAATATCAGCAATTAGAAG 316 

I I I I I I I I I I I I I I I I I I I I I I I I I II III 

I 

Db 192 CAAGG TGCT — AAGAGCTGCAGAACAAGCTCATCTTTGGGCAG- 

AACTG 237 



Qy 317 

ACGATAAACTTTCTCAGAAATCATCTAGCAGTAAACTCTCTCGGTCTCCATTAAAGATTG 3 76 
II I I I I III I I I I I I I I I I 

Db 23 8 GTGTTTTTGTATGACAAGTATGAAGAATATGATAATGC- 

CATAATTACCATGATGAATCA 296 



Qy 3 77 TCAAA AAGCCT— 

AAAAGCATGCAGTGCAAAGTGATACTTCTCGATGGATCAGAATA 431 

I I I I I I I I I II I I I I I I I I I I II II III 

Db 297 

T C C AAC TGATGCCTG G AAAG AAG G G C AAT T C AAAG AT AT CAT T AC CAAGG T 3 4 7 



Qy 432 

TACCTGTGATGTAGAGAAACGCTCCAGAGGACAAGTGCTGTTTGATAAAGTGTGTGAACA 491 

III I I I I I I I I I I I I I I I I I I I I I I I II I 

I 

Db 3 48 TGCC AAT G T GG AAC TAT AC T AC AG AG — C AA- 

T AC AG T T C T AC T TAG AAT T C AAGC C 401 



Qy 492 

C T T G AAC T T G C TAG AG AAAG AC TACTTTGGGCT T AC G T AT C GAG AT G C T G AAAAC C AG AA 551 
II I I I I I II III I I I I I I I I I I 

Db 402 TCTG TTGTTA AATGATTT— GCTGA TGGTGCTG- 

TCTCCA 43 8 



Qy 552 

GAATTGGTTGGACCCTGCTAAGGAAATAAAAAAACAGGTTCGAAGTGGTGCTTGGCACTT 611 

I I I I I I I I II I I I I I I III III I II 

I 

Db 439 CGGTTGGATCACACTCGTGCAGTCAATTAT TTCAGCAAGGT — 

TAAACAGCT 48 8 

Qy 612 TTCATTTAATGTGAAATTTTATCCAC CAGACCCTGCCCAAC- 

TATCTGAAGA 662 

III I I I I I I III I I I I I I I I I I I I I I I I 

I 

Db 489 ACCACT 

GGTGAAACCGTATTTGCGTTCAGTTCAGAACCATAACAACAAATCTGTGAA 5 45 



Qy 663 TATCACCAGGTACTA- 

CCTCTGCTTGCAGTTGCGAGATGACATCGTGTCCGGAAGGCTGC 721 



I I I 1 1 I I I I 1 1 1 1 1 I 1 1 1 1 I 1 1 1 1 

1 1 1 1 

Db 5 46 T G AAT C AT TGAAC AATC T T T T TAT T AC AGAAG- AAGAT TATCAGGCT — 

CTGC 595 

Qy 722 CCTGCTCCTTTGTTACCCTGGCCTTGCTGGGCTCCTACACTGTCCAG 

TCAGAGCT 7 76 

I I I I I I II II I I I I I I I I I I I I 

I 

Db 596 G AAC AT C AAT AGATGC T T ATGAC 

AACTTTGACAATATCTCGCTTGCTCAGCGTT 649 

Qy 777 CGGAGACT- 

ATGACCCAGATGAATGTGGGAGCGATTACATTAGTGAGTTCCGCTTTGCAC 835 

I I I I I I I I I I I I I MM I I I M M I 

I 

Db 650 T G G AAAAAC AT G AAC T C AT T GAG T T C AGG AG 

AATTGCTGCTTATCTC 696 

Qy 836 — CAAA — 

C C AC AC T AAAG AAC T GG AAG AC AAAG T GAT C G AGC TGC AC AAG AGC C AC AG A 891 

MM I M I I I I I I I I I M M M I M M M M M I 

Db 69 7 TTCAAAGGCAACAAT CGCTGGAA-ACAGAGTG- 

TAGAGCTGTGCAAGA 7 42 

Qy 892 GGAATGACGCCAGCAGAAGCAGAGATGCATTTCTTGGAAAATGC 

CAAAAAAT 9 43 

I I M M II I I I I I I I I I MM MM 

I I 

Db 7 43 AAGACAGCCTTTACAAGGATGCAATGC 

AGTATGCTTCTGAATCTAAAGAT 792 

Qy 944 

TATCAATGTATGGGGTAGATTTACATCATGCTAAGGACTCAGAAGGGGTAGAAATTATGT 10 03 

I M M M I M I I M M I M I II I I I I I I I I 

Db 793 ACTGAATTGGCT-GAAGAAC-TCCTGCAGTGGTTTTTGCA — 

G G AAG AAAAAAG AG 8 44 

Qy 1004 

TAGGAGTTTGTGCAAGTGGTCTGTTGATATATCGCGACCGGCTGCGAATAAACAGATTTG 1063 
I I I I I I I I I I I I I I I II I I I I 

I I 

Db 8 45 AGTGCTTTGGAGC TTGTCTGTTTACCTGTTACGATC 



Qy 1064 

CCTGGCCCAAGGTTCTAAAGATTTCATACAAACGGAACAACTTTTACATTAAGATCCGGC 1123 

I I I I I I I I I I I I III 

I I 

Db 884 TAAGGCC AGATGTCGTCC 

TAGAAACTGC 911 

Qy 1124 

CGGGAGAGTTTGAACAATTTGAAAGCACCATTGGGTTTAAGCTGCCAAACCATCGAGCTG 1183 

I I I I I I I I I I I I I I I I I I I I I I I I III 

Db 912 ATGGAG GC AC AAT AT CA-TGGATTT 

TGCCATGCCCTATTTCAT 953 



Qy 1184 CCAAGCGTTTATGGAAAGTATG TGT TGAGC AT CAT AC AT T T T T C AG AC T - 

ACTGTT 1238 



1 1 1 I I I 1 1 1 II III I 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 

Db 954 

CCAGGTCATGAAGGAGTACTTGACAAAGGTGGATAAATTAGATGCTTCAGAATCACTGAG 1013 
Qy 1239 ACCAGAAGC — 

ACCTCCCAAGAAATTCCTAACCTTGGGTTCCAAGTTTCGTTATAGTGGC 12 96 

I I I I I I II I I I I I I I I I I II I I I I I I 

I 

Db 1014 AAAAG AAG AAGAAC AAGC T AC AGAGAC AC AAC C CATTGT 

TTATGGT — C 1060 

Qy 129 7 AGGACACAAG CGCAAACGAGAAGAGCCAGTGCGTTG — 

ATAGATCGCCCAGCACCT 135 0 

II I I I I II II I I I I I I I I I I I I I I I I I 

I I I I I I 

Db 1061 AG — C C C C AG T T GATGC TGAC AGC AGGACC C AG T — 

GTTGCCGTCCCTCCCCAGGCACCT 1116 

Qy 1351 

TACTTTGAACGCTCATCCAGCAAACGTTATACCATGTCTCGCAGCTTGGATGGAGCATCA 1410 

I I I I I I I I I I I I I I I III I I I I III 

I I 

Db 1117 TTTG GTTATGGTTATACC — GCACCACCGTATGGAC — 

AGCCACA 1157 

Qy 1411 

GTGAATGAAAACCATGAAATATACATGAAGGATTCTATGTCTGCTGCAGAGGTTGGTACT 14 7 0 
I III I I I I 

I I I I I 

Db 1158 G CCTGG CTTTG 

GGTAC- 1173 

Qy 1471 

GGCCAGTACGCCACAACAAAAGGCATCTCTCAGACCAACTTGATCACCACTGTGACTCCG 153 0 

I I I I I II II I I I I I I I I I I I I I 

I 

Db 1174 AGCATGTGAGATGAAGC GCTGATC-CTG 

TAGTCACCTATTTTCGTACT 122 0 

Qy 1531 

GAG AAG AAG GCTGAGGAGGAGCGGGACGAG G AAG AG G AC AAAC G G AG G AAG G G G G AAG AA 159 0 

III III 

Db 1221 

GAAACATCGTCT 1232 

Qy 1591 

GTCACGCCCATCTCGGCCATCCAGCACGAGGGAAAGACTGACAGTGAGCGCACGGACACC 165 0 

I I I I I I I I I I I I I I I I I I I I I I I I I II I II 

I I I 

Db 1233 -TTACCCACTTCTCAG T T T AT AATGGGGGAAA- AC AGGC AAC GTGTTCTTGTA- 

ACC 1286 

Qy 1651 

GCAGCCGACGGGGAGACCACTGCCACTGAGGAGCTAGAAAAAACTCAAGATGACCTGATG 1710 

I I I I I I II II I I I I I I I I I 

I I 

Db 12 87 TTTATTTCATGAAGGACTTCT TTTTGTTTCT AACT ATAAACT — 

TG 1330 



Qy 



1711 



AAAC AT C AAAC C AAC AT T AGC G AGC T G AAAAG AACC T TC T T AG AAAC C T C AAC AG AC AC T 17 7 0 



Db 1331 GATC ACCTATGTT AAAACCTTATT- 

TCACATTCCACA 1366 



Qy 1771 GC C G T AAC G AAT G AATGGGAGAAGAGGC TTTCCA 

CCTCCCCCGTGCGACT 1820 

I I I I I I I I I I I I I I I I I I III III 

I 

Db 1367 

TCATTTTAGAATTTATTTTCGAAGGGGAATAGTTTCAATGTTTTATTCACTTGGGCTTTT 142 6 

Qy 1821 GGCCGCCAGGCAGGAGGATGCCCC CAT GAT CG AACC ACT TGTCC 

CTGAA 1869 

II I I I I I I I I I I I I I III II 

Db 1427 TTTCTTC 

CCCCTCTTTCTTTAAAGAACTGCTCAATATTCAATCTGTT 14 73 

Qy 18 70 GAGAAAATGGAAACCAAGACGGAGTC — 

C AG T GG AT AG AG AC GG AAC C C AC C G T GC AC C A 192 7 

I I I I I I I I I I I I I I I I I I I I I I II II 

I 

Db 1474 GTGAA GAACCTGATTTGCACTCTGTAGTGTTTAAAGA 

AAC AAAG AAAC T 152 2 

Qy 1928 

CCTGCCGCTTAGCACTGAGAAGGTGGTGCAGGAGACCGTGTTGGTGGAGGAGCGGCGTGT 19 8 7 

I I I I I I I I I I I I I I I I II I II 

I 

Db 1523 CTAATATTGAATCTCTTA-AATTTAGTGTATGTAAACAGCTT 

ACAAATACGTAT 1575 

Qy 1988 

GGTGCACGCGAGTGGGGATGCTTCTTACTCGGCGGGAGACAGCGGGGATGCTGCAGCACA 2 0 4 7 

II II I I I I I I I I I I III I I I I I I I I 
Db 1576 TGT CTA AATGCATTTAAATCTGTTTTATTCA- 

AAGAAAAGC T AAAGC A — 1622 

Qy 2048 

GCCCGCATTCACAGGCATTAAAGGGAAAGAGGGCTCTGCTTGACGGAGGGGGCTAAAGAG 210 7 

I I I I I I I I I III III I I I I I I I I I 

I 

Db 1623 AAAACACTGGCAT 

AT G AC C AT GC AAG AC TGT C AG T GC C AAC AAA 1666 

Qy 2108 G AAG GAGGGGAGGAGGTCGC T AA — 

AGCTGTCCTG G AAC AG G AAG AG AC AG CCGCTGCTT 2165 



I I I I I 

Db 166 7 GAC AAC AC T AAT C AG C — ACATCGTAC ACTGGATTGCAG- 

TGCTT 1708 

Qy 2166 CCC— 

GTGAGCGACAAGAGGAGCAGAGTGCAGCCATCCACATTTCAGAAACTTTGGAACA 2 2 23 

III II I I I I I I I I I III II I I I I II 

I 

Db 1709 CCCAGATTATT G AAAAAT G T T AC AG A CAACTTGCCTGATTTTTAAAT 

GAGCG 176 0 



Qy 2224 

AAAACCTCATTTTGAGTCCTCAACGGTGAAGACGGAAACCATCAGTTTTGGCAGTGTTTC 2 2 83 

III I I I I I I I I I I I I I I 



Db 1761 TAAA AGGCCCTC TAACC TATGCAG- 

GTTTC 1789 

Qy 2284 

ACCGGGAGGAG T AAAG C T AG AAAT T T C C AC AAG AAG T GC C AG TAG T T C AC AC C G AAAC C A 23 43 
II II II I I I I I I I I I I II I I I I I 

Db 179 0 CCCATTATGCATA TAGAAAATGCTA 

GTATGTTTTGCTCAC 1829 

Qy 2344 

AAACCATCACATATGAATCATCACAGGTCGATCCAGGCACAGATCTGGAGCCAGGCGTGC 2 4 03 

I I I I I I I I I I I I I I I I I I I 

I I I I 

Db 183 0 TTCATATG TAACAGGT GCCCTTAT 

GTTGTGC 1860 

Qy 2404 

TGATGAGTGCACAGACGATCACATCTGAAACCACCAGTACCACCACCACTACCCACATCA 2 463 
II I I I I 

Db 1861 TGT 

ATCCTGT G 1871 

Qy 2464 

C C AAAAC TGT G AAAG GGGGCATTT C AG AG AC AAG AAT T G AG AAGC G AAT AG TCATCACGG 2523 

I I I I I I I I I III I I I I I I I I I I I I I I I I 

Db 18 72 CTTTTTCTGT GGGACCATTC CATTCAGGAGCAAAGAG- 

CACCAT— 1914 

Qy 2524 

GGGATGCAGACATTGACCATGACCAGGCGCTGGCTCAGGCAATTAAAGAGGCCAAAGAGC 2 583 
I I I I I III II III III I II 

Db 1915 — GATTCCAATCTTG TGT GTGTTTACTAA — 

CCCTT 1946 

Qy 2584 

AGC AC C C T G AC AT G T C AG T G AC C AAAG T AG T GG TCC AT AAAG AG AC AG AG AT C AC AC C AG 2 6 43 

I I I I I III III III III I II I I I I I I I 

Db 1947 CCCTG — AGGTTTGTG TATGT — 

TGGATATTGTGGTGTTTTAGATCACT 1991 

Qy 2644 

AAGATGGAGAGGATTGACCAGAGGAATAACTTAGCTTGCACATGAATGCAGTCATGCAAA 2 7 03 

I I I I I II I I I I I I I I III I I I I I I III 

Db 1992 G AG T G T AC AG AAG AG AG AAAT T 

CAAACAAAATATTGCTGTTCTTCA — 2 03 7 

Qy 2 70 4 CCGTTAGGAAAACCAGAGCCTATATGGAGTTCCCTCTTCTAACCCAACTGTAC 

TTGT 2760 

I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 2 03 8 — GTTTTG TTTGTGGAATT 

T G AAAT T AC TC AAAT TTAAAAT AAAT 2080 

Qy 2761 ATCT- 

G T C C G T G G AAAAT T T C AG T C C AG AAG AAT T G AC C T T G AC CAT T AAT AAAG AC AC T 2 819 

I I I I I I I I I I I I II I I I I I I I I II I I I I I I II 



I 

Db 2081 TACTGGACTGTGG-AAATAACA TAGAATTGAAGTT- 

TTAATTAAATACCACTCA 2132 



Qy 2 82 0 GGCAGAGAGATCTTCC C AT AAT AAAGC AAT C T 

GATTCAGCATCACTAAACCGATAA 2 8 75 

I I I I I I I I I I II I I I I I I I I I I 

Db 2133 

AAC G AAAAG AAC AG TAG T T T T T G TAG T T T T AT AT TGGAT AC TGAGGC AT T AGGG AG 218 8 

Qy 2876 

T GC AT G AAGC AAC G AT AAAAT T AC AAAAGAGC AGC AT T T T T AAT T T TC AC AAAAT GTCTC 2935 

I I I I I I I III I I I I I III I I II II I I I I 
Db 2189 -GCATGAAAGGAAGA- 

GGAATGAGGATTGAGACATGTGAAGACATTGTGCATTATATCAA 2 2 46 

Qy 2936 

AGTTTTCAGCTATACCTGCTCGTTCATAACCAACAATATAAACCGTGGTCTCATGTAACA 2 995 

II I I I I I I I I I I I I I I I I I I I III I I I I I I I I I 

I I 

Db 22 47 TGT GC-ATTCCTG-TAGTTCATTAACAA- 

GGTACATGCAATAGTCTAAAGAACCA 229 8 

Qy 2996 

CATAAACAATTCATGCCTTTCATAGTTTATTATTATTAAAGTCTAAACAAAATTGCAATT 3 055 

I I I I III I I I I I III I I I I I I I I I I 



-GAGTCACTACTATAGTGGCTTAACATTTAATCTGTCT — CCAATATT- 



Qy 3 056 T C T T AG G T AAC -CTTATATT T AC AAT AAAT G AAG - 

ATTACCCTCAAATGCTAGAAGCTGT 3113 

I I I I I I III II I I I I I I I I I I I I I I I II 

I 

Db 2 3 46 AACCAAGTGACACCGGGGTTT 

TTATCGAAGCATTTCACTTAAATGAACAAA T 239 7 

Qy 3114 

CTAGGTCCGTCCGGTGTGTCAGATTTCCTCAGATTAGATGTGCCAATAACCAAGTTTATT 3173 

I II I I I I I I I I II I I I I I I I I I I I I 

Db 2398 CATGG C TGTTATATTAACTTGAAATA- 

AAATATATTTAAACATGTAAAAA 2 4 46 

Qy 3174 CAGTAAA 

CAACTTGTACTTGTTTCATCTGGTTTATTACTCTCACCCATAAACA 32 2 6 
I III I I I I I I I I I I I I I 

I I 

Db 2 447 AAAAAAAAAAAAGGCCACATGTGCTCG AGCTG 

CA 2480 

Qy 3227 GTAATGACTCTCTGACCCTCTGG 3249 

I I I I I I I I I I 

Db 2481 GG TCGCGG — CCGCTAG 2495 
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Qy 47 

CCTGCTGATCCCAGAACAATCAACCATGACGACCGAATCTGGATCAGACTCGGAATCCAA 106 

I I I I I I I I I II II I I I I I I I I I 

Db 1 CCTACTGGT-CTTGTGTGG GTGTTTTCCAGCAC-GGAAAAG — 

T C G AG AT AAT T 50 

Qy 107 GCC 

GGACCAGGAGGCCGAGCCCCAGGAGGCGGCGGGGGCGCAGGGGCGGCGGGGGCC 163 



Db 51 GCCAATGATCAGGGAAACCGAACCACT 

CC 79 

Qy 164 

GTGCCGGAGCCGCCCAAGGAGGAGCAGCAGCAGGCCCTGGAGCAGTTCGCCGCCGCTGCA 2 23 



Db 8 0 AAGCT-ATGTCGCCTT TACGGACACT- 

GAACGGTTGATCGGTGATGC- 12 4 

Qy 224 

GCGCACAGCACCCCGGTGCGAGGGAGGTCACTGACAAGGAACAGGAGTTTGCTGCCAGGG 



Db 125 - C G C AAAG AAT C AAG T T GC AAT G AAC C C C AC C AAC - 
GTTTTTGATGCC 173 



Qy 284 

CTGCAAAACAGCTCGAATATCAGCAATTAGAAGACGATAAACTTTCTCAGAAATCATCTA 
I I I I I I I I I I I I 

Db 174 AAACGTCT GATTGGA 

CGCAGATTTGATGAT 2 03 

Qy 3 44 GCAGTAAACTCTCTCGGTCT- 

CCATTAAAGATTGTCAAAAAGCCTAAAAGCATGCAGTGC 402 

I I I I I I I I I I I I I I I I I I I I III 

I I I 



283 

I I I I I I I I 



343 

I I I I III 



Db 204 GCTGTTGTC C AG T C T GAT AT G AAAC AT T G G C CCTTTATG- 

GTGG 246 



Qy 403 

AAAG TGATACTTCTCGATGGAT C AG AAT AT AC C T G T GAT G TAG AG AAAC GCTCCAGAGGA 462 
I I I I I I I I II II I II I I I I I I I I I 

Db 2 47 TGAATGATGCTG GCAGGCCCAAGGT CC AAGTAGA- 

ATACAAGGGAGAGAC 295 

Qy 463 CAAGTGCTGTTTGATAAAGTGTGTGAACACTTGAACTTGCTAGAGAA — 

AGACTACTTTG 52 0 



Db 296 CAAAAGCT 

TCTATCCAGAGGAGGTGTCTTCTATG 329 

Qy 521 GGCTTACGTATCGAGATGCTGAAAAC — 

CAGAAGAATTGGTTGGACCCTGCTAAGGAAAT 578 



Db 33 0 G T T C T G AC AAAG AT G AAGGAAAT TGC AG AAG 

CCTACCTTGGGAAG 3 74 

Qy 579 AAAAAAACAGGTTCGAAGTGGT- 

GCTTGGCACTTTTCATTTAATGTGAAATTTTATCCAC 63 7 

I II II I I I I I I II II I I I I I I I I 

Db 3 75 ACTGTTACCAATGC — TGTGGTCACAGTGCCAGCTTACTTTAATG 

ACTCTCAGCGT 42 8 

Qy 638 

CAGACCCTGCCCAACTATCTGAAGATATCACCAGGTACTACCTCTGCTTGCAGTTGCGAG 697 



Db 42 9 CAGG — CTACCAAAGATGCTGGAACTATTG-CTGGT-CTCAATGTACTT-- 

AG 475 

Qy 698 

ATGACATCGTGTCCGGAAGGCTGCCCTGCTCCTTTGTTACCCTGGCCTTGCTGGGCTCCT 75 7 



Db 476 AATTATTAATGAGCCAACTGCTG— CTGCT— ATTGCTTAC— GGCTTAG- 

ACA 522 

Qy 75 8 ACACTGTC CAGTCAGA 

GCTCGGAGACTATGACCCAGATGAATGTGGGAGCG 808 



Db 523 AAAAG G T T G G AG C AG AAAG AAAC G T G C T C 

GAGGTGGCA 5 73 



Qy 809 

ATTACATTAGTGAGTTCCGCTTTGCACCAAACCACACTAAAGAACTGGAAGACAAAGTGA 86 8 



Db 574 CTT TTGATGTGT 

CAATCCTCACTATTGAGGATGGAATCTTTGAGG 618 

Qy 869 

TCGAGCTGCACAAGAGCCACAGAGGAATGACGCCAGCAGAAGCAGAGATGCATTTCTTGG 92 8 

I I I I I I I I I I I I I I I I I I I I I I I III 



I I I I I I I I 
-ATCTTTGACCTGG- 



I 

Db 619 TCAAGT CTACAGCTGGA- 

GACACCCACTTGGGTG G AG AAG AT T T T G 663 



Qy 929 

AAAATGCCAAAAAATTATCAATGTATGGGGTAGATTTACATCATGCTAAGGACTCAGAAG 9 8 8 

I I I I I I I I I I I I I II III I I I I I I 

I I I I I 

Db 66 4 ACAA — CCGAATGGTCAACCATTTTATTGCTGAGTTT 

AAGCGCAAGCATAAGAAG 716 

Qy 9 89 GGGTAGAAATTATGTTAGGAGTTTGTGCAAGTGGTCTGTTGATATATCGCGACCG- 

GCTG 1047 

I I I I I I I I I I I I I I I I II I I I I I I 

I I I 

Db 717 G ACATCAGTGAGAA CAAGAGAGCTG 

TAAGACGCCTCCGTACTG 759 

Qy 10 48 CGAATAAAC- 

AGATTTGCCTGGCCCAAGGTTCTAAAGATTTCATACAAACGGAACAACTT 1106 

I I I I I I I I I I I I I III III I I I I I I 

Db 76 0 CTTGTGAACGTGCTAAGCGT-ACCC TCTC 

TTCCAGCACCCAGGCCA 804 

Qy 1107 TTACATTAAGATCCGGCCGGGAGAGTTTGAACAATTTGAAAGCACCATTGGGTT- 

TAAGC 1165 

I I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 8 05 — GTATTGAGATC GATTCT CTCTATGAAGGAATC 

GACTTCTATAC 8 47 

Qy 1166 TGCCA- 

AACCATCGAGCTGCCAAGCGTTTATGGAAAGTATGTGTTGAGCATCATACATTT 122 4 

I I I I I I I I I I I I I I I I I I I I I II 

Db 848 CTCCATTACC— CG TGCC CGATT 

TGAAGAACTGAATG 8 82 

Qy 1225 T T C AG AC T AC T G T T AC C AG AAGC AC C TCCCA 

AGAAATTCCTAACCTTGGGTT 1276 

I I I I I I I I I II I I I I I I I I I I I I I I III III 

I 

Db 8 83 — CTGAC — CTGTT — CCGTGGCACCCTGGACCCAGTAGAGAAAGCCCT 

TCGAGAT 932 

Qy 1277 - 

CCAAGTTTCGTTATAGTGGCAGGACACAAGCGCAAACGAGAAGAGCCAGTGCGTTGATA 1335 

I I I I I I I I I I I I I II III III I I I I 

I 

Db 933 GCCAAACT AGACAAGTCACAGATTCAT— GATATTGTCCTG 

GTTGGTG 978 

Qy 1336 GATCGCCCAG 

CACCTTACTTTGAACGCTCATCCAGCAAACGTTATACCATGTCTCGC 1392 

III I I III II I I I I I I I I I I I I I I I I I I 

Db 9 79 GTTCTACTCGTATCCCCAAGATTCAGAAGCTTCTCCA— AGAC- 

TTCTTCAATGGAAAAG 1035 

Qy 1393 

AGCTTGGATGGAGCATCAGTGAATGAAAACCATGAAATATACATGAAGGATTCTATGTCT 1452 
III I I I I I I I I I I I I I I I I I I I 



Db 1036 AACTGAATAAGAGCATC AACCCTG ATGAAG- 

CT 1067 



Qy 1453 

GCTGCAGAGGTTGGTACTGGCCAGTACGCCACAACAAAAGGCATCTCTCAGACCAACTTG 1512 

I I I I I I I I I I I I I I I I I I I I I 

I I I I 

Db 106 8 GTTGCTTA TGGTGCAG — CTGTCC AGGCA 

GCCATCTTG 1103 

Qy 1513 

ATCACCACTGTGACTCCGGAGAAGAAGGCTGAGGAGGAGCGGGACGAGGAAGAGGACAAA 15 72 

I I I I I I I I I I I I I I I I I I I 

Db 1104 TCTGGAGAC-AAGTCTGAGAATGTTC 

AAGATTTG 1136 

Qy 1573 

CGGAGGAAGGGGGAAGAAGTCACGCCCATCTCGGCCATCCAGCACGAGGGAAAGACTGAC 1632 

II I I I I I I I I I I I I I I I I III 

Db 113 7 CTGCTCTTG GATGTCACTCCTCTTTC— CCTT 

GGTAT 1171 

Qy 1633 

AGTGAGCGCACGGACACCGCAGCCGACGGGGAGACCACTGCCACTGAGGAGCTAGAAAAA 1692 
I I I I I I I I I I I I I III 

Db 1172 TGAAACTGCTG GTGGAGTC — 

ATGAC 1195 

Qy 1693 

ACTCAAGATGACCTGATGAAACATCAAACCAACATTAGCGAGCTGAAAAGAACCTTCTTA 1752 

I I I I I I I I I I I I I I I I I I I I I 

I 

Db 1196 TGTCCTCATCAAGCGTAATACCACCATT 

CCTACCAA 1231 

Qy 1753 G AAAC C T C AAC AG AC AC T G - 

CCGTAACGAATGAATGGGAGAAGAGGCTTTCCACCTCCCC 1811 



Db 1232 G CAGACACAGACCTTCACTACCTATT 

CTGACAACCAGCCT 12 71 

Qy 1812 

CGTGCGACTGGCCGCCAGGCAGGAGGATGCCCCCATGATCGAACCACTTGTCCCTGAAGA 18 71 
I I I I I I I I I I I I I I I I I I I I 

Db 12 72 GGTGTGCTT ATTCAGGTTTATGAAGGC 

GAGCGTGCCA T 13 09 

Qy 18 72 GAAAATGGAAACCAAGACGGAGTCCAG— TGGATAGAGACGGAACCCACCGTGC- 

ACCA 1927 

I I I I I I I I I I I III III II I I I I I I I I I I 

I I I 

Db 1310 GACAAAGGATAACAA CCTGCTTGGCAAGTTT — GAACTCACAG- 

GCATACCT 135 8 

Qy 192 8 CCTGCCGCTTAGCACTGAGAAGGTGGTGC 

AGGAGACCGTGTTGGTGGAGGAGCG 19 81 

I I I I I III I I I I I I I I I I I I I I I I 



Db 1359 CCTGC ACCCCG 

AGGTGTTCCTCAGATTGAAGTCACTTTTGACATTGATGC- 14 0 8 



Qy 1982 

GCGTGTGGTGCACGCGAGTGGGGATGCTTCTTACTCGGCGGGAGACAGCGGGGATGCTGC 2 0 41 



Db 1409 -C-AATGGTATACTCAA TGTCTCTGCTGTGGACA 

AGAGTACGGG 145 0 

Qy 2042 

AGCACAGCCCGCATTCACAGGCATTAAAGGGAAAGAGGGCTCTGCTTGACGGAGGGGGCT 2101 



Db 1451 AAAAG AG AAC AAG AT T AC TAT C AC T AAT G ACAAGGG — CCGTTTGA 

GCA 149 7 

Qy 2102 

AAAG AG G AAG GAGGGGAGGAGGTCGC T AAAG C T G T C C T GG AAC AG G AAG AG AC AG C C G C T 2161 
I I I I I I I I I I I I I I I I I I I I 

Db 149 8 A GGAAGACATTGAACGTATGGTCCAGGAA 

GCTGAGAAGTACAAA 1541 

Qy 2162 GCTTCCCGTGAGCGACAAGAGGAGCAGAGTGCAGCCATCCAC- 

ATTTCAGAAACTTTGGA 222 0 



Db 1542 GCTGAAGATGAGAAGC-AGAGGGACAAGGT GTCATCCAAGAATTC 

ACTTGAGT 1593 

Qy 2221 ACAAAAACCTCATTTTGAGTCCTCAAC- 

GGTGAAGACGGAAACCATCAGTTTTGGCAGTG 2279 



Db 159 4 CCTATGCCTTCAACATGA — AAGC AAC T G T T G AAG AT G AG AAAC T T C A 

AG 1646 

Qy 22 8 0 TTTCACCGG G AGG AG T AAAGC T AG AAAT T T C C — 

AC AAG AAG T GC C AG TAG T T C AC AC C G 233 7 



Db 16 47 ATTAAC GAT GAG G AC AAAC AG AAG AT T C T G G AC AAG - 

TGTAAT G 16 89 

Qy 2338 

AAACCAAAACCATCACATATGAATCATCACAGGTCGATCCAGGCACAGATCTGGAGCCAG 239 7 



Db 169 0 A AAT TAT C A ACTGGCT-TGAT- 

AAGAATCAG 1718 

Qy 239 8 GCGTGCTGATGAGTGCACAGACGATCA-CATCTGAAACCACCAGTACCA 

CCACCA 2 451 



Db 1719 AC- 

T G C T GAG AAG G AAG AAT T T G AAC AT C AAC AG AAAG AGC T GG AG AAAG T T T G C AAC C - 17 76 

Qy 2452 

C T AC C C AC AT C AC C AAAAC TG TG AAAGGGGGC AT T TC AGAGAC AAGAAT T G AG AAGC G AA 2 511 



I I I I 
AGGCA- 



II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 I III 

1 1 1 

Db 1777 C CAT CAT C AC C - AAGC T G T AC C AG AG T G C A GGAGGCATGCCAGGAG- 

GAA 1824 

Qy 2512 

TAGTCATCACGGGGGATGCAGACATTGACCATGACCAGGCGCTGGCTCAGGCAATTAAAG 2 5 71 
I I I I I I I I I II I I I I I I I 

Db 1825 TG CCTGGGGGAT TT 

CCTGGTGGTG 1848 

Qy 2572 

AGGCCAAAGAGCAGCACCCTGACATGTCAGTGACCAAAGTAGTGGTCCATAAAGAGACAG 2 631 
I I I I I I I I III I I I I I I I I 

I I I 

Db 1849 GAGCTCCTCCC TCTG GTGGTGCTTCCT 

CAG 1878 

Qy 2 632 AG AT C AC AC C AG AAG AT GGAGAGG AT T G A- 

C C AG AGG AAT AAC T T AGC T T GC AC AT G AAT 2690 

I I I I I I I III I I I I I I I I III II III 

I 

Db 18 79 GG — CCCACCA-TTGAAGAGGTTGATTAAGCC AAC C AAG — 

TGTAGATGTA- 192 4 

Qy 2691 GCAGTCATGCAAACCGTTAGGAAAACCAGAGCCTATATGGAGTTCC- 

CTCTTCTAACCCA 2749 

I I I I I I I I I II I I II I I I I I I I II III 

I 

Db 1925 GCATTGTTCCACACATTT AAAAC 

AT T T G AAGG AC C T AAAT T C G T AGC AA 1973 



Qy 2750 

ACTGTACTTGTATCTGTCCGTGGAAAATTTCAGTCCAGAAGAATTGACCTTGACCATTAA 2 8 09 
I I I I I I I I I I I I I I I I I I I 

I I 

Db 1974 AT TCTGTGG CAGTTTTAAAAAGTTAAGCTG- 

C TAT AG 2 0 09 

Qy 2810 

TAAAGACACTGGCAGAGAGATCTTCCCATAATAAAGCAATCTGATTCAGCATCACTAAAC 2 869 
I I I I I I I I I I I I I I I II 

Db 2010 T-AAGTTACTGG GCATTCT 

CAATACT 2 03 4 

Qy 2870 

C GAT AAT GC AT G AAGC AAC G AT AAAAT T AC AAAAGAGC AGC AT T T T T AAT T T T C AC AAAA 2 92 9 
I I I I I I I I I I I I I I I I I I 

Db 2 035 TGAAT-ATGGAAC 

ATATGCACAGGGGAAG 2 062 

Qy 2 93 0 TGTCTCAGTTTTCAGCTATACC — 

TGCTCGTTCATAACCAACAATATAAACCGTGGTCTC 2987 

I I I I I I I I I I I I I I I I I I III 

Db 2 063 GAAAT AAC AT TGC AC - T T T AT AAAC - 

ACTGTATTGTAAGTGG 2102 



Qy 29 8 8 AT G T AAC AC AT AAAC AAT TC - 

ATGCCTTTCATAGTTTATTATTATTAAAGTCTAAACAAA 3046 

I I I I I I I I I I I II 



II II 



Db 2103 

AAAAC T AT T T AAA 213 4 



-AAAATGCAATGTCTTAAAT- 



Qy 3047 A' 

CCTCAAATGC 3103 



3 0 4 7 ATTGCAATTTCTTAGGTAACCTTATATTTACAATAAATGAAGATTAC 



Db 2135 ATT 

GGCACCATAAAAAAAAAAAAAAAAAAAAAAAAAGGCCACATGTGC 2182 



Qy 



3104 TAGAAGCTGTCTAGGTC 3120 

I I I I I I I I I I I I 
2183 TCG-AGCTG — CAGGTC 2196 



Db 
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Qy 4 8 CTGCTGATCCCAGAACAATCAACCATGACGACCGAATCTGGATCAG- 

ACTCGGAATCCAA 106 



Db 2336 CAGCT CGAGCACATGTGGCCTTTTTTTTTTTTTTTTTTTTTGTATTC — 

TCAGA 2285 

Qy 107 GCCGGACCAGGAG-GC — 

CGAGCCCCAGGAGGCGGCGGGGGCGCAGGGGCGGCGGGGGCC 163 



Db 22 8 4 GCTG — CCAGGAGTGCATCGAGCC- 



CT 2252 



Qy 



16 4 GTGC-CGGAGCCGCCCAAGGAGGAGCAGCAGCAGGCCCTGGAGCAGTTCGCC- 



GCCGC 219 

I I I I I I I I I I I I I I I I I I I I II I III II 

I 

Db 2251 

GTTCTCTGAATCCCCCATCTTTCTGCAGCTCCAAGCTTTGTGTCCCACAGCCTGTGACTC 2192 

Qy 22 0 TGCAGCGCACAGCAC 

CCCGGTGCGAGGGAGGTCACTGACAAGGAACAGGAG-T 2 71 

II II III I I I I I I I I I I III I I I I I II 

Db 2191 TG-TGCTAACAAATCGCTATTGTCCAGTG GGGCGAATGGTGGC- 

TGGAACTAAAGAA 213 7 

Qy 2 72 TTGCTGCCAGGGCTGCAAAACAGCTCGA-ATATC- 

AGC AAT T AG AAG AC G AT AAAC T T T C 329 

I I I I I I I I I I I I II III I I I I I II II II 

I I I I I 

Db 2136 TTGCTGTCT-GGTTTCTATTCAAATCCAGGTAGCGAGATATATGA 

ATGGACTTT- 2084 

Qy 33 0 TCAGAAATCATCTAGCAGTAAAC-TCT- 

CTCGGTCTCCATTAAAGATTGTCAAAAAGCCT 387 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2 0 83 TC — GAATCGTCATGTGAATAACGTCTGCTCGG CATGAAGG 

CTCAGAGC — 2 03 7 

Qy 388 AAAAGCATGCAGTGCAAAGTGATACTTCTCGATGGAT 

CAGAATATACCTGTGATGTA 444 

I I I I I I III II I I I I III II I I I I I I 

I I I 

Db 2 036 CATGCTAGG — AAGGATTA — 

ACTCGTAGGCTGACCACTAACATCCTTTGTGGTA 1986 

Qy 445 

GAGAAACGCTCCAGAGGACAAGTGCTGTTTGATAAAGTGTGTGAACACTTGAACT 499 

I I I I I I I III I I I I I III II I I I I I I I I 

I 

Db 19 85 CGAGGGAGAAACATTCC CAAGTATCATTT TATTCACACTT- 

AATT 1942 

Qy 50 0 TGCTAGAGAAAGACTAC TTTGGGCTTACGTATCGAGATGCTGAAAAC 

CAGA 55 0 

I I I I I II I I I I I I II III I I 

I 

Db 1941 

TTCTATCCCATACCCCCAAAATAAGGCTAGCTATTTAATTAGTTGGCCGCTTTTCTCTTA 18 82 
Qy 551 

AGAATTGGTTGGACCCTGCTAAGGAAATAAAAAAACAGGTTCGAAGTGGTGCTTGGCA-C 6 09 

I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 1881 ATTTTTAGT — GTTTCTGTTGATAATGTGTGA 

GTTTGGGAAAATGCTAAGTAGC 183 0 

Qy 610 TTTTCATTTAATG 

T G AAAT T T T AT C C AC C AG AC C C T GC C C AAC T ATC TGAAGAT ATC 666 

I I I I I I I I I II I I I I I I I I I I I I I I 

I I I 

Db 1829 TTTTCACTTAGAACACTGTTATTTTCTCTTTAAAGTTTT — 

C T AC C T T AC AT TTATTATA 17 72 



Qy 66 7 ACCAGGTACTACCTCTGCTTGC — AGTTGCGAGATGACATCGTG 

TCCGG 713 

I I I I I I I I I I I I I I I I I I I I I II 

I 

Db 1771 GCATAGT — TATCTTT- 

ATAGCATAGATGCAGAAAGTAAGAGAGAGCTTGTTTTTTCAAG 1715 

Qy 714 AAGGCTGCCCT GCTCCTTTGTTACCCTGGCCTTGCTGGG 

CTCCTACACTGTC 765 

II I I I I I I I I I I II I I I I III I I I I I I 

I 

Db 1714 AAAACAACCCTTTAAAATACTTT 

CCAACCCATGAAGGGAAAAATCCTCCTTTTTT 1660 

Qy 766 

CAGTCAGAGCTCGGAGACTATGACCCAGATGAATGTGGGAGCGATTACATTAGTGAGTTC 82 5 

I I I I I I I I I I I I I I I I 

I I 

Db 1659 C CCCCAAGTGCAT TCTAC- 

TTATTACTTTG 1631 

Qy 826 CGCTTTGCACCAAA 

CCACACTAAAGAACTGGAAGACAAAGTGATCGAGCTGCACAAG 8 82 
I I I I I I I I I I I I I I I I 

Db 1630 

CATTTTTCTCCCAAAGTCCAAATTTATG 1603 

Qy 8 83 AGCCACAGAGGAATGACGCCAG — 

CAGAAGCAGAGATGCATTTCTTGGAAAATGCCAAAA 940 

I I I I I I II III I I I I I I I I I I I I I 

I I I I I 

Db 16 02 CAAAGAAAATAGAAACAAGTTCAAATGCA ATGCATT 

AACCAAAT 1559 

Qy 941 

AATTATCAATGTATGGGGTAGATTTACATCATGCTAAGGACTCAGAAGGGGTAGAAATTA 10 0 0 
I I I I I I I I I I I I I I I I I 

Db 155 8 AA — AACAA 

GTCTCCTTCAAATTAGGAAC 1532 

Qy 1001 

TGTTAGGAGTTTGTGCAAGTGGTCTGTTGATATATCGCGACCGGCTGCGAATAAACAGAT 106 0 

I I I I II I III 

Db 1531 CAACC- 

TAAGCATTTGTAAAGTG 1510 

Qy 1061 

TTGCCTGGCCCAAGGTTCTAAAGATTTCATACAAACGGAACAACTTTTACATTAAGATCC 112 0 

III I I I I I I I I I I I I I I I I I I I I I I 



- AG AAT C AG AAT T C T T T T AAA- ■ 



AATTAGATTTGGAACC 1470 
Qy 1121 

GGCCGGGAGAGTTTGAACAATTTGAAAGCACCATTGGGTTTAAGCTGCCAAACCATCGAG 118 0 
I I I I I I I I I I II III II 

Db 1469 T G AAC TAT AT - AAT T CAT AAT T CT 

CATTTTT 1440 



Qy 1181 CTGCCAAGCGTTTATGGAAAGTATGTGTTGAGCATCATACATTTTTCAGACTACTGT- 



Db 1439 CTG TG< 

ATTATTTTATCTTTCTCCTGTATA 1407 



Qy 12 40 C C AG AAGC AC C - T C C C AAGAAAT 

TCCTAACCTTGGGTTCCAAGTTTCGTTATAG 1292 

I I I I I I III II I III II I I I I 

Db 1406 CCTGAAAAAATGTCCATAGGCTTAAAGGGTCATGCTTTTACATTCC 

TTC 1358 



Qy 12 93 TGGCAGGACACAAGCGCAA — 

ACGAGAAGAGCCAGTGCGTTGATAGATCGCCCAGCACCT 135 0 

II I I I I I II II I I I I I 

I 

Db 135 7 CATATCACAGGTACTATGAAGTAAGGAGAC- 

TTTTAGG T 1320 



Qy 1351 

TACTTTGAACGCTCATCCAGCAAACGTTATACCATGTCTCGCAGCTTGGATGGAGCATCA 1410 

I I I I I I II I I I I I II I I I I I I I I I I I 

I 

Db 1319 TTCTTT TTGTC TTAAAC TCAGACAGCTTTGT — 

AAGCAGTA 12 81 



Qy 1411 

GTGAATGAAAACCATGAAATATACATGAAGGATTCTATGTCTGCTGCAGAGGTTGGTACT 14 7 0 

III I I I I I I I I I I I I I 

Db 12 8 0 GTG T G TAG AT T AC AAG A 

GTTAG 1259 



Qy 1471 

GGCCAGTACGCCACAACAAAAGGCATCTCTCAGACCAACTTGATCACCACTGTGACTCCG 153 0 

I I I I I I I I I I I I I I I I 

I 

Db 1258 ACAAAAG CAGGC 

GCGACTGAG 123 8 



Qy 1531 GAG AAG AAG GCTGAGGAGGAGC — 

GGGACGAGGAAGAGGACAAACGGAGGAAGGGGGAAG 15 8 8 

I I I I II I I I I I I I I I I I I 
Db 123 7 AAG-AGTTGGTGGGGGAGAAGCTTGGGGC- 
ACTTCC 1204 



Qy 1589 AAGTCACGCCCATCTCGGCCATCCAG — 

C AC G AG G G AAAG AC TGACAGTGAGCG C AC GG A 1646 

I I I I I II I I I I I I I I I I II I 

Db 12 03 -TGTCAC T C AAC AC AT T C C AG AT C AC T AAAAAAT T T C - ■ 

CACAC— 1163 



Qy 1647 

CACCGCAGCCGACGGGGAGACCACTGCCACTGAGGAGCTAGAAAAAACTCAAGATGACCT 17 06 

I I I I I I I I I I I I I I I I I I I 

I I 

Db 1162 — CCTCTGCATTC CCCCTTGCC CACTCCAGTT- 

CCC 1130 



Qy 1707 

GAT G AAAC AT CAAAC C AAC AT T AGCGAGC TGAAAAGAACC T TC T T AG AAAC C T C AAC AG A 1766 



Db 1129 GGTATTTTCTGATTCCATATGTTGTG 

GTATTTACCATACTTCTCTCCCTCACTAGG 1074 

Qy 176 7 C AC T GC C G T AAC G AAT G AATGGGAGAAGAGGC T T T C C ACCTCCCCCGTGCG— 

ACT 182 0 



Db 1073 CTCTGGC 

AAGACTGCTTCAGAGGGGATGCATTCCTTTAGATTGCACAAAGCGGAGCT 1017 
Qy 1821 

GGCCGCCAGGCAGGAGGATGCCCCCATGATCGAACCACTTGTCCCTGAAGAGAAAATGGA 18 8 0 
II I I I I I I I I I I I I I I I I 

Db 1016 GGGAAAATGGCTGGCAGTTTC 

AG AAT C TAG T C AC GAT C GC AC GC AT GAG 96 8 



Qy 1881 

AACCAAGACGGAGTCCAGTGGATAGAGACGGAACCCACCGTGCACCACCTGCCGCTTAGC 19 4 0 

III II I I I I I II I I I I I I I I I I I I I I I 

I 

Db 96 7 CACCTC-ACACA-TCCA TGTC CCTACC CGCCCCC— 

CCGCT— C 930 



Qy 1941 

ACTGAGAAGGTGGTGCAGGAGACCGTGTTGGTGGAGGAGCGGCGTGTGGTGCACGCGAGT 2 0 0 0 
I I I I I I I I I I I I I I 

Db 929 CC 

GCCCCTGCAGCTGGCTGACCTG 906 



Qy 2001 

GGGGATGCTTCTTACTCGGCGGGAGACAGCGGGGATGCTGCAGCACAGCCCGCATTCACA 2 06 0 
I I I I I I I I I I I I II I I I I I I I 

I 

Db 905 TCTCACCC ACTGCTGG 

CCTATCGAACGGCCAGGACTGTCT 866 



Qy 2 061 GGCATTAAAGGGAAAGAGGGCTC-TGC — TTGACGGAGG- 

GGGC T AAAGAGGAAGGAGGG 2116 

I I I I I I I II I I I I I I I I 

Db 865 GGTTTT 

GGCTCGTGCCTTTGTCCATGTCTGGCTT 832 



Qy 2117 

GAGGAGGTCGCTAAAGCTGTCCTGGAACAGGAAGAGACAGCCGCTGCTTCCCGTGAGCGA 2176 
I I I I I I I I I I I I I I I I I I I I I 

Db 831 AGTTC-CTCT — CTGTC TATGCTTGCCTCTAC — 

CCCCACCGCCC 792 



Qy 2177 C AAG AG G AG C -AG AG T GC AGC C AT C C AC AT T T C 

AGAAACTTTGGAACAAAAAC 222 8 

I I I I I I I I I I I I I I I I I I I I I I I I III 

I I I I I I 

Db 791 CAGGCGGCACAAGTGTTTGGCCA — CAC AAAAC T AGAGAT AGAAAAGGTGGT - ■ 



Qy 2229 

CTCATTTTGAGTCCTCAACGGTGAAGACGGAAACCATCAGTTTTGGCAGTGTTTCACCGG 2 2 8 8 

III II I I I I I I I I I I I I I I I I I I 



Db 735 TTCA AACTTTTC — TAAATTC TCCAACAGTT 

TATTTCT 700 

Qy 2289 

GAGGAGTAAAGCTAGAAATTTCCACAAGAAGTGCCAGTAGTTCACACCGAAACCAAAA.ee 23 4 8 
I I I I I I I I II I I I I I 

I I I I I 

Db 699 TGTGAATTTCTTCC 

TTCTTTAAATACTCCATTTTAAGAAAACA 657 

Qy 2349 

ATCACATATGAATCATCACAGGTCGATCCAGGCACAGATCTGGAGCCAGGCGTGCTGATG 2 4 0 8 

I I I I I I I I I I I I I I I I I I I I I I I I III II 

I I 

Db 656 A— AAAAATTAATTATC TAAAGGCAAAGA-ATGGA AAGCAACCT- 

TTG 613 

Qy 2409 

AGTGCACAGACGATCACATCTGAAACCACCAGTACCACCACCACTACCCACATCACCAAA 2 46 8 

III I I I I I II I I I I I I I I I I 

I 

Db 612 TGTTC CTTATAATA AC T G AC T T C AT AAC T C 

TCTCC — A 577 

Qy 2469 

AC T G T G AAAG GGGGCATTT C AG AG AC AAG AAT T G AG AAGC G AAT AG T C AT C AC G G G G G AT 2 52 8 
I I I I I I I I II I I I I I I III 

I 

Db 576 GCTGCGTTATGGGATGTGT 

ATAAAAAGCTTCTGTTC T 540 

Qy 2529 

GCAGACATTGACCATGACCAGGCGCTGGCTCAGGCAATTAAAGAGGCCAAAGAGCAGCAC 258 8 

I III I I I I I I I I I I I I I I I I I I I I I 

I 

Db 539 GAGAACA AAGGAGCACGTGCAG AAAT GAG AC G AAAAA — 

ATC-C 499 

Qy 2589 

C C T G AC AT G T C AG T G AC C AAAG TAG T GG T C C AT AAAGAGAC AG AG AT C AC AC C AG AAG AT 2 6 4 8 
I I I I I I I I I I I I I I I I I I 

Db 49 8 ACTGAC AGTATTCCATTA 

CACA 477 

Qy 2649 

GGAGAGGATTGACCAGAGGAATAACTTAGCTTGCACATGAATGCAGTCATGCAAACCGTT 2 7 0 8 

I I I I I I I I I I I I I I I I I I I II 

I 

Db 4 76 AATTACTT AAAAG AT T T TAG T C AAGC 

CCCT- 447 

Qy 2 709 AGGAAAACCAGAGCCTATATGGAGTTCCCTCTTC-TAACCCAACTGTACTTGTAT 

CT 2764 

I I I I I I I I I I I I I I I II II I I I I I 

I I 

Db 446 

C AAC AG AT T C AAT T T T AAAATGGC T T T T AGT T AAAAAAAAAAAAT TGAAAG T GC T 392 



Qy 2 765 GTCCGTGGAAAATTTCAGTCCAGAAGAATTGACCTTGACCATTAATAAAGAC- 

ACTGGCA 2 823 



II I 1 1 1 II I 1 1 1 1 I I 1 1 1 II III 1 1 1 1 1 

1 1 

Db 391 TACCCAGTAAA AGAACCGAAG — 

TAGTCCTGAACTGTTACGTAAGACTTTTTACA 339 

Qy 2 82 4 GAGAGATCTTCCCATAATAAAGCAATCTGATTCAGCATCACTAAACCGAT- 

AATGCATGA 2 8 82 

I I I I I I I I I I I I I I I I III 

I 

Db 33 8 GTTGGATCTT — TGTCAAAAGGGGATGGGGGT 

GATGGGAGAAAGC 296 

Qy 2 8 83 AGC AAC G AT AAAAT T AC AAAAGAGC - 

AGCATTTTTAATTTTCACAAAATGTCTCAGTTTT 2941 

I I I I I I I I I I I I I I I I I I I I I II I III III 

I I I I 

Db 295 AGCAACG — 

ACAATCAAAAAAGTTCGAGCTGCTGTGGCTAGAGGACAACTTCTGTGTTTC 23 8 
Qy 2942 CAGCT 

ATACCTGCTCGTTCATAACCAACAATATAAACCGTGGTCTCATGTAACACAT 2 99 8 
I I I I I I I I I I I I I II III I III 

I I I I 

Db 23 7 CAGATAGGATTCTTGCT-GTAGAAATGGAAC — TTCCAGCC 

AGCACA- 19 4 

Qy 2999 AAACAATTCATGCCTTTCATAGT 

T T AT TAT TAT T AAAG TC TAAAC AAAAT TGCAAT 3054 

I I I I I I I I III I I I I I III I I I I I I I I 

I I I 

Db 193 GCAT- 

CCTGTCCCAGTAGAGAAATGAGTTTGTCAGTTAAAACAAAAAAAAAAT 142 

Qy 3055 

TTCTTAGGTAACCTTATATTTACAATAAATGAAGATTACCCTCAAATGCTAGAAGCTGTC 3114 

II III III I I I I I I I I I I I 

I I 

Db 141 TAGATAC TGGA ACCC 

AGGCTAGACGAGGTA 112 

Qy 3115 

TAGGTCCGTCCGGTGTGTCAGATTTCCTCAGATTAGATGTGCCAATAACCAAGTTTATTC 317 4 

I I I I I I I I I I I I I I I I III I I I I 

I I 

Db 111 TTGAACCGCGC CAGATTTCCT TGC 

AGCCCTGTCTGCTC 74 

Qy 3175 AGTAAACAACTTGTACTTGTT — 

TCATCTGGTTTATTACTCTCACCCATAAACAGTAATG 3232 

II II I I I I I I I I I I I I I I 

Db 73 AGCTCGCA TTCCAGTAGGCCAACAG 

TGCTCGAGGAATTCCGCAG 3 0 

Qy 3233 ACTCTCTGACCCTCTGGAAATATGTAATGCTTCCAATC 3270 

III III I I I I I I I I I I I I 

Db 2 9 -CTTTTAGAGC AG AAG T AAC AC T T C C G T AC 1 



RESULT 12 

US-10-478-914-64/C 



Sequence 64, Application US/10478914 
Patent No. 7335755 
GENERAL INFORMATION: 
APPLICANT: NAKAGAWARA, AKIRA 

TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
FILE REFERENCE: 7388-80899 

CURRENT APPLICATION NUMBER: US / 1 0 / 478 , 9 1 4 
CURRENT FILING DATE: 2003-11-26 
PRIOR APPLICATION NUMBER: PCT/JP02/05294 
PRIOR FILING DATE: 2001-05-30 
PRIOR APPLICATION NUMBER: JP 2001-162775 
PRIOR FILING DATE: 2001-05-30 
PRIOR APPLICATION NUMBER: JP 2001-255226 
PRIOR FILING DATE: 2001-08-24 
NUMBER OF SEQ ID NOS : 417 
SOFTWARE: Patentln version 3.2 
SEQ ID NO 64 

LENGTH: 2120 

TYPE: DNA 

ORGANISM: Homo sapiens 



Query Match 2 6.6%; Score 518 4; DB 1; Length 212 0; 

Best Local Similarity 38.0%; Pred. No. 0; 

Matches 1268; Conservative 0; Mismatches 780; Indels 1286; 
Gaps 196; 

Qy 14 GCCTAGTCCCCACGCCGCGGCGCGCCCGGGCTC 

CCTGCTGATCCCAGAACAATC 6 7 

I I I I I III I I I I I I I III I I 

Db 212 0 GTCTAGCGGCCGCGAC-CTGC- 

AGCTCGAGCACATGTGGCCTTTTTTTTTTTTTTTTA-A 2064 

Qy 68 

AACCATGACGACCGAATCTGGATCAGACTCGGAATCCAAGCCGGACCAGGAGGCCGAGCC 12 7 



Db 2 063 AAG AAT GAG T T AC AT T T AT TG AT AT GG T T T G TCATATGCTTTATAAATGGTC — 

ACC 2009 

Qy 12 8 CCAGGAGGCG GC GGGGGCGCAGGGG-CGG 

CGGGGGCC 163 



Db 2008 

CTTTGAAACATGTATTATTACTATTTGGGGGAGAGGGGGACTGTTCATTTTACAGGGGAC 19 49 



Qy 164 

GTGCCGGAGCCGCCCAAGGAGGAGCAGCAGCAGGCCCTGGAGCAGTTCGCCGCCGCTGCA 2 23 
II II I I I I I I I I I I I I I I I I I I I III 

Db 19 48 AAGCAAGACAGGCTCAAGGAGGGAAAGGA-CAGGCTC 

AAAGT 1908 



Qy 224 

GCGCACAGCACCCCGGTGCGAGGGAGGTCACTGACAAGGAACAGGAGTTTGCTGCCAGGG 2 83 

I I I I I I I I I I III II II I I I I I 

Db 19 0 7 CATCACAGTGTGGGG CTGGAATG CA 

GTTGCC 1877 



Qy 



284 



CTGCAAAACAGCTCGAATATCAGCAATTAGAAGACGATAAACTTTCTCAGAAATCATCTA 3 43 
III II I I I I I I 

I I I I I 

Db 1876 CTTC CT 

TCTTTCTTTTTGCACATCTT 1851 

Qy 344 

GCAGTAAACTCTCTCGGTCTCCATTAAAGATTGTCAAAAAGCCTAAAAGCATGCAGTGCA 4 03 

I I I I I I I I III 

Db 1850 CCG TCTCTAGG 

GTGAG 1835 

Qy 404 

AAGTGATACTTCTCGATGGATCAGAATATACCTGTGATGTAGAGAAACGCTCCAGAGGAC 463 

I I I I I I I I I I I I I I I 

I I I 

Db 183 4 GAGGGGTGC AGGCACAG GCACCC- 

AAGAC 1807 

Qy 464 

AAGTGCTGTTTGATAAAGTGTGTGAACACTTGAACTTGCTAGAGAAAGACTACTTTGGGC 523 

I I I I I I I I I I I I I I I I I 

I 

Db 18 06 AGCTGCGGT CCAGCCCCGGCC 

CCACCTGTGGTC 1774 

Qy 524 

TTACGTATCGAGATGCTGAAAACCAGAAGAATTGGTTGGACCCTGCTAAGGAAATAAAAA 583 

II I I I I I I I I I I I I I I I I I 

Db 17 73 TCA GTGACGC CCCAGA GGCCCCATCT 

TC 1746 

Qy 584 

AACAGGTTCGAAGTGGTGCTTGGCACTTTTCATTTAATGTGAAATTTTATCCACCAGACC 6 43 

II I I I I I I I I I I I I I I I I I 

I I 

Db 17 45 CCCA CATAATGAGGCT — GCTCCATCC 

TCCTCAAAGCC 1710 

Qy 644 

CTGCCCAACTATCTGAAGATATCACCAGGTACTACCTCTGCTTGCAGTTGCGAGATGACA 7 03 

I I I I I I I I III I I I I I I I I I I I III 

I 

Db 1709 C AG AC C T A- T T T C AT AAG CCCCAGACCCCACCTTCAC 

CCAGGGCCCC 166 4 

Qy 70 4 TCGTGTCCGGAAGGCTG 

CCCTGCTCCTTTGTTACCCTGGCCTTGCTGGGCTCCTAC 759 

II III I I I I I I I I I I I I I I I I I I I I II 

I 

Db 1663 AAG AG AAC AG A — G C T GG AG AC AC T T C T AC T CCTAGC 

ACTGGATGCCCTC 1616 

Qy 760 

ACTGTCCAGTCAGAGCTCGGAGACTATGACCCAGATGAATGTGGGAGCGATTACATTAGT 819 

I II III Mill I I I I I I I I I I I I 

I 

Db 1615 TC CC CTC CTGTGA AGCTGTAGGTGGGGG 

GTGGA 15 83 



Qy 820 

GAGTTCCGCTTTGCAC C AAAC C AC AC T AAAG AAC T GG AAG AC AAAG T GAT CGAGCTGCAC 8 79 
I I I I I I I I I I I I I I I I I I I II 

I 

Db 1582 GGACACCCCTTT AAGGCAGGCTC 

GGGGGGCTATGAACTC C 15 43 

Qy 880 AAGAGCCACAGAGGAATGACGCCAG- 

C AGAAGC AGAGATGC AT T TC T TGGAAAATGCCAA 938 

I I I I I II I I I I I I I I I I I I I II I I I I I 

I I I 

Db 15 42 AAGA-CTCCTG G AAGC C AG T AAG AAG T G G G AAG G TAGGCACA— 

CCA- 1499 

Qy 939 

AAAATTATCAATGTATGGGGTAGATTTACATCATGCTAAGGACTCAGAAGGGGTAGAAAT 99 8 
I I I I I I I I I II II I I I I I I 

Db 1498 TTTGG 

GATTTTTATTTTTCACTGAGCTAAGAG 146 7 

Qy 999 

TATGTTAGGAGTTTGTGCAAGTGGTCTGTTGATATATCGCGACCGGCTGCGAATAAACAG 105 8 

I I I I I I I I I I I I I I I 

I I I 

Db 1466 AGGAGATT CCAGCTATGGA- 

ATCCAG 14 42 

Qy 1059 

ATTTGCCTGGCC C AAG G T T C T AAAG AT T T CAT AC AAAC GG AAC AAC T T T T AC AT T AAG AT 1118 
I II I I I I I I I I I I I I I I I I II 

Db 1441 A GAGC GTTCTACAG 

G AAC AG AAC AG AAAT T 1412 

Qy 1119 

CCGGCCGGGAGAGTTTGAACAATTTGAAAGCACCATTGGGTTTAAGCTGCCAAACCATCG 117 8 

I I I I I I I I I I I II 

I 

Db 1411 — GGCTGGG CCCTGGG 

CCCAGAG 1391 

Qy 1179 

AGCTGCCAAGCGTTTATGGAAAGTATGTGTTGAGCATCATACATTTTTCAGACTACTGTT 123 8 
I I I I I I I I I I I I I I I I I I I 

I 

Db 1390 -GAGGCC TGGAGCCGCTTAC — 

TGTTCAG T 1364 

Qy 1239 

ACCAGAAGCACCTCCCAAGAAATTCCTAACCTTGGGTTCCAAGTTTCGTTATAGTGGCAG 12 9 8 
I I I I III I I I I I I 

Db 1363 GCCAG CTTC 

ATCCTTG 1348 

Qy 1299 

GACACAAGCGCAAACGAGAAGAGCCAGTGCGTTGATAGATCGCCCAGCACCTTACTTTGA 135 8 

I I I I III I I I I I I I I I II 

I 

Db 1347 GCACTC TGCCT CTCACACAGCATC — 

GCTCCG- 1318 



Qy 1359 ACGCTCATCCAGCAAACGTTATACCATGTCTCGCAGCT — 

TGGATGGAGCATCAGTGAAT 1416 

I I I I I I I I I II III I I I I I I I I I I I I I I I 

I 

Db 1317 — GCTC— CCAGAGCCC ACAGTGGC- 

AGCCGCTGGAGGCTGTGGCAGCAGCTGCT 1268 

Qy 1417 

GAAAACCATGAAATATACATGAAGGATTCTATGTCTGCTGCAGAGGTTGGTACTGGCCAG 14 76 

I I I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 126 7 G CCCTGC CCAAGGAGGG ATGTGGGC TTGGCTCT- 

GACAG 123 0 

Qy 1477 TAC — GCCACAA 

CAAAAGGCATCTCTCAGACCAACTTGATCACCACTGTGACTCCG 153 0 

II I I I I I I I II II I I I I I II I I I I I I 

I 

Db 1229 T T C AAG C C AG AAAG AGC C T T AG AAG TCATTCAGGAC ATTCCCATTTT 

ACAG 1179 

Qy 1531 GAGAAG AAGGCTGAGGAGGAGC- 

GGGACGAG G AAG AG G AC AAAC G G AG G AAG G G G G 1585 

III I I I I II I I I I I I I I I I I I I I I I I I I I I I I 

Db 1178 AT GAT G AAAT AAGGC TC AGAGAGAGC AGGGGC T T T C AAG AAC AAT AAAGG TGG A 

GAC 1122 

Qy 1586 

AAGAAGTCACGCCCATCTCGGCCATCCAGCACGAGGGAAAGACTGACAGTGAGCGCACGG 16 45 

I I I I I I I I I II III I I I I I I I I I I II III 

Db 1121 TGGAACTCAGGCCT CCTGTCTGCT-GAGGG — 

TGTCTGCCCATGCTCTCTC — 1074 

Qy 16 46 ACACCGCAGCCGACGGGGAGACCACTGCCACTGAGGAGCTAGAAAAAACTCA 

AGAT 1701 

II I I I I I I I I III I I I I I I I I I I I I I II 

I 

Db 10 73 TCAGAGTTTCTGA-GTGGTGGTGTCTG GAGGGGCTGG 

ACACGCAAGGGAGCT 1023 

Qy 1702 

GAC C T G AT G AAAC AT C AAAC C AAC AT T AGCGAGC TGAAAAGAAC C T T C T TAGAAAC CTCA 1761 
I I I I I I I I I I I I I I I I I I I 

I 

Db 1022 GGCCTCGTG TGTGTCTGTCTGTGCCGGGATGTG TCTGTGG 



Qy 1762 ACAGACACTG- 

CCGTAACGAATGAATGGGAGAAGAGGCTTTCCACCTCCCCCGTGCGACT 182 0 

I I I I I I I I I I I I I I I I I III III 

I I 

Db 980 TCTGTCTCTGAACGTGCC AGGGGTTTT — 

GTGTCCTGTGTGTGTCT 937 

Qy 1821 

GGCCGCCAGGCAGGAGGATGCCCCCATGATCGAACCACTTGTCCCTGAAGAGAAAATGGA 18 8 0 

II I I I I I I I I I I I I I I I I I 

I I I 

Db 936 CTCT GTGGGTGCTCCTGTG TGCAG- 

GAACTTGGG 904 



Qy 1881 AAC C AAG AC G G AG T C C AG T GG AT AG AG AC GG AAC C C AC 

CGTGCACCACCTGCCGCTT 1937 

I III I I I I III I I I I III I I I I I I 

Db 9 03 GCCATGGCCAG AGAGGGATGGGCCTATTCCTACAATCTTGGGAGGCCT- 

CCAC— 852 

Qy 1938 

AGCACTGAGAAGGTGGTGCAGGAGACCGTGTTGGTGGAGGAGCGGCGTGTGGTGCACGCG 199 7 

II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 851 AG-GCTGAGCTGGTGGGG — AGGGTCC TGAAGG- 

G TAG AG T G AAG 811 

Qy 1998 

AGTGGGGATGCTTCTTACTCGGCGGGAGACAGCGGGGATGCTGCAGCACAGCCCGCATTC 2 05 7 
I I I I I I II I I I I I I I I I I I I I I I III 

Db 810 -ATGGGGA — CT — TTACTC 

CCAGCGGAGAG G AAG AAAC 777 

Qy 2058 

AC AGGC AT T AAAGGGAAAGAGGGC TC TGC T TGACGGAGGGGGC T AAAGAGGAAGGAGGGG 2117 

III I I I I I I I I I I I I I I I I I I 

I I I 

Db 776 -CAG AAGATAG GCTT GGGGC 

ATGGATGGG 7 49 

Qy 2118 AGGAGGTCGCTAAAGCTGTCCTGGAAC- 

AGGAAGAGACAGCCGCTGCTTCCCGTGAGCGA 2176 

I I I I I I I I I I I I I I I I I I I I I I I III 

Db 748 AAAG — GTCTTGGAACTAGGATGGGGCAAGGGAAAC 

AAAGCCC 70 8 

Qy 2177 

CAAGAGGAGCAGAGTGCAGCCATCCACATTTCAGAAACTTTGGAACAAAAACCTCATTTT 2 236 

III I I I I I I III I I I I III III I I I I 

Db 70 7 CAAATGGAGTATGG GTCC CTGGAA-GTTGACACA 

CTCTT 670 

Qy 2 23 7 GAGTCCTCAACGGTGAAGACGGAAACCATCAGTTTTGGCAGT — 

GTTTCACCGGGAGGAG 229 4 

I I I I I I I II II I I I I I I I I II I I I I I I I 

I I 

Db 669 -AGCCCTCA TGGGGGC CCAGCC CAGTGGGTAACCCCGGG- 

GAAG 628 

Qy 2295 T AAAGC T AG AAAT T TCC AC AAGAAG T GC C AG TAG T T C AC AC C — 

GAAACCAAAA 23 46 

I I I I I I I I I I I I I I I I I I I I I II 

Db 62 7 TGGTCACATGACGTGACACTTCTCAGCTGATTGCTTG- 

AGATCCCCCAGTGTGGCCTGGG 569 

Qy 23 4 7 CCATCACATATGAATCATCACAGG- 

TCGATCCAGGCACAGATCTGGAGCCAGGCGTGCTG 2 405 

III I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I 

Db 56 8 CCTTGTCCT-TGAGTGAGCCCAGGTTCAAGCCAG 

GAAGGCTGGCCAGCTG 52 0 



Qy 2406 - 

ATGAGTGCACAGACGATCACATCTGAAACCACCAGTACCACCACCACTACCCACATCAC 2 46 4 



Db 519 CCCAAGGACTGAGGCCTTCTCCTC 

CTCCTCCTCTGTGTCCTCCTCGATCATCTTGGC 463 

Qy 2465 

CAAAACTGTGAAAGGGGGCATTTCAGAGACAAGAATTGAGAAGCGAATAGTCATCACGGG 2 52 4 
III I I I I I I I I II II 

Db 462 CAGCTC CCGGGGCAGCTC CA 

CGTCTCCAC 43 4 



Qy 2525 

GGATGCAGACATTGACCATGACCAGGCGCTGGCTCAGGCAATTAAAGAGGCCAAAGAGCA 2 5 8 4 
I I I I I III I I I I I I I 

I I 

Db 433 CTGCCAT CTGGATCTGGCTCT 

CA 411 

Qy 2585 GC— 

AC C C T G AC AT G T C AG T G AC CAAAG TAG T GG T CC AT AAAG AG AC AG AG AT CACACCA 2642 

I II III I I I I I I I I I I I I I I I I 

I I 

Db 410 TCTGTCTCGTTCTTG — GGTAGCC — GGTATTTGTC 

TCGGAAGTGGCTCC- 365 



Qy 2643 

GAAGATGGAGAGGATTGACCAGAGGAATAACTTAGCTTGCACATGAATGCAGTCATGCAA 2 7 02 

I I I I I I I I I I I I I I I I I I I 

I I I I 

Db 36 4 GCAGTGTGGCCCG CTCTGCCTTCCTCTGTGTGAACT- 

GTGCAT 323 



Qy 2703 

ACCGTTAGGAAAACCAGAGCCTATATGGAGTTCCCTCTTCTAACCCAACTGTACTTGTAT 2 762 

II I I I I I I I I I I III I III 

I 

Db 322 CCC GCTCCATCTTCTCTTCCACGAG 

TTGCTT 292 

Qy 2763 

CTGTCCGTGGAAAATTTCAGTCCAGAAGAATTGACCTTGACCATTAATAAAGACACTGGC 2 82 2 

III I I I I I I I I I I I III 

I I 

Db 291 CTG ATAC — TCC TCGTACTCCTCC 

CGGCTCATGC 26 0 



Qy 2823 

AGAGAGATCTTCCCATAATAAAGCAATCTGATTCAGCATCACTAAACCGATAATGCATGA 2 8 82 

III I I I I I II II II I I I I I II I I I I 

I 

Db 259 CCTGAG — CTTCGGCT GCCGACTTGTCCC-CATCTC CC — 

TTATCCTCGC 215 

Qy 2883 

AGC AAC G AT AAAAT T AC AAAAGAGC AGC AT T T T TAAT T T TC AC AAAAT GTCTCAGTTTTC 2942 

III II II I I I I III 

I I I 

Db 214 CGCCTC CCA GGC TCCCAGTGAGGTTC 



Qy 2943 

AGCTATACCTGCTCGTTCATAACCAACAATATAAACCGTGGTCTCATGTAACACATAAAC 3 0 02 

I I I I I I I I II I I I I I I I I I I I I I I I 

I I I 

Db 185 AGCTG-GCC-GC CC ACCATGGTCT TCAC- 

CATGAAC 153 

Qy 3003 

AATTCATGCCTTTCATAGTTTATTATTATTAAAGTCTAAACAAAATTGCAATTTCTTAGG 3 062 

I I I I I I I I I II I 

I I 

Db 152 GCCAT GGTCTTCGCCG 

GCCGGGGCACCGG 12 4 

Qy 3063 

TAACCTTATATTTACAATAAATGAAGATTACCCTCAAATGCTAGAAGCTGTCTAGGTCCG 312 2 

III I I I I I I I I I I I II 

I I 

Db 123 GCGCCT GCTACCACGC GCCAGGCACCG 

GGCACG 91 

Qy 3123 

TCCGGTGTGTCAGATTTCCTCAGATTAGATGTGCCAATAACCAAGTTTATTCAGTAAACA 3182 

I I I I I I I I I I I III I I I I I 

I 

Db 9 0 TCCG CGCCTACTTCAG CACTC 

CAGTAGGCC 61 

Qy 3183 

ACTTGTACTTGTTTCATCTGGTTTATTACTCTCACCCATAAACAGTAATGACTCTCTGAC 32 42 

I I I I I I III II III III 

I I 

Db 6 0 AACAGTGCTCG AGGAATT CC GCAG 

CTTTTAGAG 2 8 

Qy 32 43 CCTCTGGAAATATGTAATGCTTCCAATCTTGCTT 32 76 

I I I I I I I I I I I I I III 

Db 2 7 C AGAAGTAACACTTCCGTACAGGCCT 2 



RESULT 13 

US-10-478-914-17/C 

; Sequence 17, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 

; TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
FILE REFERENCE: 7388-80899 

CURRENT APPLICATION NUMBER: US/ 1 0 / 478 , 9 1 4 
; CURRENT FILING DATE: 2003-11-26 

PRIOR APPLICATION NUMBER: PCT/JP02/05294 
; PRIOR FILING DATE: 2001-05-30 
; PRIOR APPLICATION NUMBER: JP 2001-162775 

PRIOR FILING DATE: 2001-05-30 
; PRIOR APPLICATION NUMBER: JP 2001-255226 
; PRIOR FILING DATE: 2001-08-24 
; NUMBER OF SEQ ID NOS : 417 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 17 

LENGTH: 219 8 



TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-478-914-17 



Query Match 26.4%; Score 5132.5; DB 1 ; Length 2198; 

Best Local Similarity 42.1%; Pred. No. 0; 

Matches 1313; Conservative 0; Mismatches 776; Indels 1033; 
Gaps 184; 

Qy 214 

CGCCGCTGCAGCGCACAGCACCCCGGTGCGAGGGAGGTCACTGACAAGGAACAGGAGTTT 2 73 

I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 219 7 CGAC-CTGCAGCTCG - AGC AC AT 

GTGGCCTTTTTTTTTTTTTTTTTTTTT 2150 

Qy 274 

GCTGCCAGGGCTGCAAAACAGCTCGAATATCAGCAATTAGAAGACGATAAACTTTCTCAG 333 

I III I I I I I I I I I I I I 

Db 2149 TTTTATGGTGC 

CAATTTTAAATAGTTTTATTT 2118 

Qy 334 

AAATCATCTAGCAGTAAACTCTCTCGGTCTCCATTAAAGATTGTCAAAAAGCCTAAAAGC 393 

II II I I I I I III I I I I I I I III III 

Db 2117 AAGACA — TTGCATTTTCCACT TACAATACAG — 

TGTTTATAA 2 0 79 

Qy 39 4 ATGCAGTGCAAAGTGA — 

TACTTCTCGATGGATCAGAATATACCTGTGATGTAGAGAAAC 451 

I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 2 0 78 AGTGCAATGTTATTTCCTTC CCCTGTGC 

ATAT 2047 

Qy 452 

GCTCCAGAGGACAAGTGCTGTTTGATAAAGTGTGTGAACACTTGAACTTGCTAGAGAAAG 511 

I I I I I I I I I I I II I I I I I II I I I I I I I I I I I 

Db 2 0 46 GTTCCATA-TTCAAGT ATTGAGAATG 

CCCAGTAACTTACTATAGCAGC 2000 

Qy 512 ACTACTTTGGGCTTACGTATCGAGATGCTGAAAACCAGAAGAATTGG 

TTGGA 563 

I I I I I III I I I I I I I I I I I I I I II 

I 

Db 1999 TTAACTTT TTA AAACTG CCA- 

CAGAATTTGCTACGAATTTAGG 1958 

Qy 56 4 CCCTGCTAAGGAAATAAAAAAACAGGTTCGAAGTGGTGCTTGGC-ACTT- 

TTCATTTAAT 621 

I I I I I I I I I I I II I I I I I I I I I I I I I I I 

I 

Db 1957 TCCTTCAAATGTTTTAAAT GT 

GTGGAACAATGCTACATCTACACTTGGT 19 09 

Qy 622 GTGAAATTTTATCCACCAGACCCT 

GCCCAACTATCTGAAGATATCACCAGGTAC 675 

I I I I I I I I I I III I I I I I I I I I I I 

Db 19 0 8 TGGC TTAATCAACCTCTTCAATGGTGGGCC CTGAGGA- 

AGCACC 1866 



Qy 6 76 TACCTCTGCTTGCAGTTGCGAGATGACATCGTGTCCGGAAGGCTGCCCTGC — 

TCCTTTG 733 

III I I I I I I II I I I I I I I II II III 

Db 1865 -ACCAGAGGGAGGAGCTCC ACCAC 

CAGGAAATCCCCCAGGCATTCCT 182 0 

Qy 734 

TTACCCTGGCCTTGCTGGGCTCCTACACTGTCCAGTCAGAGCTCGGAGACTATGACCCAG 793 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I III 

Db 1819 CCTGG-CATGC CTCCTGCACT 

CTGGTACAGCTTGGTGATGATG 1778 

Qy 794 

ATGAATGTGGGAGCGATTACATTAGTGAGTTCCGCTTTGCACCAAACCACACTAAAGAAC 853 

III I I I I I I I I I I I I I II 

Db 1777 GGG TTGCA AACTTT- 

CTCCAGCTCTTTCTGTTGATG 1743 

Qy 854 

TGGAAGACAAAGTGATCGAGCTGCACAAGAGCCACAGAGGAATGACGCCAGCAGAAGCAG 913 

I I I I I I II II I I I III I I I I I I I I I II 

Db 1742 T TCAAATTCTTC 

CTTCTCAGCAGTCTGATTCTTATCAAGCCAGTTGAT 1695 

Qy 914 AGATGCATTTCTTGGAAAATG- 

CCAAAAAATTATCAATGTATGGGGTAGATTTACATCAT 972 

I I I I I I I I I I I I I I II I I I I I I I I 

I 

Db 169 4 AATTTCATTACACTTGTCCAGAATCTT 

CTGTTTGTCCTCATCGTTAATCTT 1644 

Qy 9 73 GCTAAGGA CTCA GAAGGGGTAGAAATTATGTTAGGAGTTTGTG 

CAAGTGG 1022 

II II I I I I I III I I I I I I III 

I I I I I I 

Db 1643 

GCCTTGAAGTTTCTCATCTTCAACAGTTGCTTTCATGTTGAAGGCATAGGACTCAAGTGA 15 8 4 

Qy 1023 TCTGTTGATATATCGC GAC 

CGGCTGCGAATAAACAGATTTG 1063 

I I I I II II II I I I I I II I I I I I I I 
Db 1583 ATTCTTGG- 

ATGACACCTTGTCCCTCTGCTTCTCATCTTCAGCTTTGTACTTCTCAGCTT 152 5 

Qy 106 4 CCTGGCCCA-AGGTTCTAAAGATTTCAT 

AC AAAC GG AAC AAC T T T T AC AT T AAG AT C 1119 

I I I I I I I I I I I I I I I I II I I I I I I I I I I II I I I I I 

Db 1524 CCTGGACCATACGTTC-AATGTCTTCCTTGCTCAAACGGCCC 

TTGTCATTA 1475 

Qy 112 0 CGGCCGGGAGAGT 

TTGAACAATTTGAAAGCACCATTGGGTTTAAGCTGCCAAACCA 1175 

I I I I I I III I III III III I III I 

I I 

Db 1474 GTGATAGTAATCTTGTTCTCTTTTCCCGTACTCTTG 

TCCACAGCAGAGACA 142 4 



Qy 1176 TCGAG- 

CTGCCAAGCGTTTATGGAAAGTATGTGTTGAGCATCATACATTTTTCAGACTAC 123 4 



I 1 1 1 I 1 1 1 1 1 1 I III 1 1 1 I I 

Db 1423 TTGAGTATACCA TTGGCATCAATG TCAAAAGT 

GACTTC 1386 

Qy 1235 

TGTTACCAGAAGCACCTCCCAAGAAATTCCTAACCTTGGGTTCCAAGTTTCGTTATAGTG 12 9 4 
I I I I I I I I I I I I I I I 

Db 13 85 AAT CTGAGGAAC 

ACCTCGGGGT 1364 

Qy 1295 

GCAGGACACAAGCGCAAACGAGAAGAGCCAGTGCGTTGATAGATCGCCCAGCA 13 4 7 

I I I I III I I I I I I I I I I I I I I I I I I I 

Db 1363 GCAG GAGGTATGCCTGTGAGTTCA- 

AACTTGCCAAGCAGGTTGTT 1320 

Qy 1348 — CCTT— ACTTTGAACGCTCATCCAGCAAACGTTATACCATGTCTCGCA- 

GCTTGGATG 1402 

I I I I I I I I I I I I I II II I I I I I III I 

I I 

Db 1319 ATCCTTTGTCATGGCACGCTCG-CCTTCA 

TAAACCTGAATAAGCACACCAGGCTG 1266 

Qy 1403 

GAGCATCAGTGAATGAAAACCATGAAATATACATGAAGGATTCTATGTCTGCTGCAGAGG 1462 

I I I I I III I I I I I I I I I I I I I I I I I 

I I 

Db 1265 G T - T G T C AG AAT AGGT A GTGAAGG — TCTGTGTCTGCT- 

TGG 1228 



Qy 1463 TTGGTACTG-GCCAGTACGCC — 

ACAACAAAAGGCATCTCTCAGACCAACTTGATCACCA 1519 

I I I I II I I I I I I I I I I I I I I III I I I I I III 

I 

Db 122 7 TAGGAATGGTGGTATTACGCTTGATGAGGACAGTCATGACTCC- 

ACCAGCAGTTTCAATA 1169 

Qy 1520 

CTGTGACTCCGGAGAAGAAGGCTGAGGAGGAGCGGGACGAGGAAGAGGACAAACGGAGGA 15 79 

I I I I I I I I I I I I I I I I I 

Db 116 8 C CAAGGGA AAGAGGAGT 

GACATCC 1145 

Qy 158 0 AGGGGGAAGAAGTCACGCCCA-TCTCGGCC AT C C AG - C AC G AGGG AAAG AC T - 

GAC 1632 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 

I I I 

Db 1144 AAGAGCAGCAAATCTTGAACATTCTCAGACTTGTCTCCAGACAAGATGG- 

CTGCCTGGAC 10 86 



Qy 1633 AGTGAGCGCACGGACACCGCAGC — 

CGACGGGGAGACCACTGC C AC T G AGG AGC TAG AAA 1690 

II I I I I I I I I I I I I I I I III I III 

Db 10 85 AGC-TGCAC-CATAAGCAACAGCTTCATCAGGGTTGATGCT-CTTATTCAGTTCT — 

TTT 1031 

Qy 1691 

AAAC T C AAG AT GAC C T G AT G AAAC ATC AAACC AAC AT T AGCGAGC TGAAAAGAAC CTTCT 1750 

I I I I I I I I I I I I I I I I II I I I I I II I I I I I I 



I 

Db 103 0 CCATTGAAGAAGTCTTGGAGAAGCTTCTG 

AATCTTGGGGATACGAGTAGAACCACC- 975 



Qy 1751 

TAGAAACCTCAACAGACACTGCCGTAACGAATGAATGGGAGAAGAGGCTTTCCACCTCCC 1810 
I I I I I I I I I I I I I I I I 

Db 9 74 AACC AGGACAATATC 

ATGAAT 954 

Qy 1811 

CCGTGCGACTGGCCGCCAGGCAGGAGGATGCCCCCATGATCGAACCACTTGTCCCTGAAG 18 7 0 

I I I I I I I I I II III I I I I I I I I I I I I 

Db 953 -C-TGTGACTTGTC TAGTTTGGCATC TCGAAGGGCTT- 

TCTCT 914 

Qy 1871 

AGAAAATGGAAACCAAGACGGAGTCCAGTGGATAGAGACGGAACCCACCGTGCACCACCT 193 0 

I I I I I I I I I I I I I I I I I I I I 

I 

Db 913 ACTGGGTCCAGGG— TGCCACGGAAC 

AGGTCAGCAT 880 

Qy 1931 GCCGCTTAGCACTGAGAAGGTGGTGCAGGAGACCGTGTTGGTGGAGGAGCGG- 

CGTGTGG 1989 

III II I I I I I I I I I I I I I I I I I 

Db 8 79 TCAGTTCTTCAAATCG GGCACGG 

G T AAT GG AGG T AT AG AAG T C GAT T C C 831 



Qy 1990 TGCA-CGCGAGTGGGGATGCTTCTTACTCGGC — 

GGGAGACAGCGGGGATGCTGCAGCAC 2046 

III I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 83 0 TTCATAGAGAGAATCGAT-CTCAATACT-GGCCTGGGTGCTGGAAGAGAGGGTAC 

GCTT 774 

Qy 2047 

AGCCCGCATTCACAGGCATTAAAGGGAAAGAGGGCTCTGCTTGACGGAGGGGGCTAAAGA 2106 

I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 7 73 AGCACG — TTCACAAGCAGT 

ACGGAGGCGTCTTAC — 741 

Qy 2107 GGAAGGAGGGGAGGAGGTCGCT 

AAAG CTGTCCTG G AAC AG G AAG AG AC AG C C G 2159 

I I I I I I I I I I I I I I I I 

I I 

Db 740 AGCTCTCTTGTTCTCACTGATGTCCTTCTTATG 

CTTGCG 702 

Qy 2160 

CTGCTTCCCGTGAGCGACAAGAGGAGCAGAGTGCAGCCATCCACATTTCAGAAACTTTGG 2 219 

II II I I I I I I I I I II I I I I I I I I I I I I 

Db 701 CTTAAACTC AGCAATAAAATG-GTTGA 

CCATTCGGTTGTCAAAATCTT 655 

Qy 2220 

AACAAAAACCTCATTTTGAGTCCTCAACGGTGAAGACGGAAACCATCAGTTTTGGCAGTG 2 2 79 

I I I I I I I I I I I 



Qy 22 8 0 TTTCACCGG — GAGGAGTAAAGCTAGAAATTTCCACAAGAAGTGC- 

C AG TAG T T C AC AC C 2336 

I I I I I I I III I I II I I I I I I I I I I I I I 

Db 639 TGTCTCCAGCTGTAGACTTGACCTCAAAGATTCCA 

TCCTCAATAGT 59 4 

Qy 233 7 G AAAC C AAAAC CAT C AC AT ATGAATC ATC AC AGG T C GAT C C AG - 

GCACAGATCTGGAGCC 2395 

II I I I I I I I II I I I I I I I I I I I I I I I III 

Db 593 

GAGGATTGACACATCAAAAGTGCCACCTCCCAGGTCAAAGATGAGCACGTTTCT 5 4 0 

Qy 2396 AGGCGTGCTGATGAGTGCACAGACGATCACATCTGAAACC 

ACCAGTACCACCACCAC 2452 

II I I I I II I I I I I I I I I I I I I I I I I 

I I 

Db 539 TTC TGCTCCAACCTTTTTGTCT- 

AAGCCGTAAGCAATAGCAGCAGCAG 493 

Qy 2453 

TACCCACATCACCAAAACTGTGAAAGGGGGCATTTCAGAGACAAGAATTGAGAAGCGAAT 2 512 

I I I I I I II II II I I I I I I I I I I I 

I I I 

Db 492 TTGGCTCAT T AAT AAT T C T AAG T A CATT GAGACCAGC 

AAT 453 

Qy 2513 

AGTCATCACGGGGGATGCAGACATTGACCATGACCAGGCGCTGGCTCAGGCAATTAAAGA 2572 

III II III I I I I I I I I I I I I I 

I I I 

Db 452 AGT — TC CAGCA-TCTTTG GTAGCCTGACGCTG 

AGA 42 0 

Qy 2573 

GGCCAAAGAGCAGCACCCTGACATGTCAGTGACCAAAGTAGTGGTCCATAAAGAGACAGA 2 632 

II I III I I I I I I I I I I I I I I I I I I I I 

I I I I 

Db 419 GTCATTA AAGTAAGCTGGCA CTGTGACCACAGCATTGGT 

AACAG- 3 76 

Qy 2633 

GATCACACCAGAAGATGGAGAGGATTGACCAGAGGAATAACTTAGCTTGCACATGAATGC 2 692 

II II I I I I II I I I II III I III II 

Db 3 75 — TCTTCCC — AAGGT AGGCTT CTGC — AATTTC 

CTTCATCTT 339 

Qy 2 693 AGTCATGCAAACCGTTAGGAAAACCAGAGCCTATATGGAGTTCCCTCT — 

TCTAACCC — 2748 

I I I I II I I I I I I I I I I I II III III III 

Db 33 8 TGTCA G AAC CAT AG AAG AC AC C T 

CCTCTGGATAGAAGCTTTTGGTCTCTCCCTT 2 85 

Qy 2749 - 

AACTGTACTTGTATCTGTCCGTGGAAAATTTCAGTCCAGAAGAATTGACCTTGACCATT 2 8 0 7 

I I I I I I I I I I I I I I I I I I I I I I I I I I III 

Db 284 GTATTCTACTTG GACCTTGGG CCTGCCAGCATCATTCACC 

ACC— 242 



Qy 2808 



AATAAAGACACTGGCAGAGAGATCTTCCCATAATAAAGCAATCTGATTCAGCATCACTAA 2 86 7 

I I I I I I III I III I I I I I I I I I I I I I I I I 

I I 

Db 2 41 -ATAAAG GGCCAATGTTTCATATCAGACTGGA-CAA 

CAGCATCATCAA 196 

Qy 2868 

AC C G AT AAT GC AT G AAGC AACG AT AAAAT T AC AAAAGAGC AGC AT T T T T AAT T T T C AC AA 2 92 7 

II I I I I III I I I I III III 

I I 

Db 195 ATC TGCGT CCAAT CAGACGTTTGGCA 

TCAAAA 164 

Qy 2928 

AATGTCTCAGTTTTCAGCTATACCTGCTCGTTCATAACCAACAATATAAACCGTGGTCTC 2 9 8 7 

I I I I I I I I I I I I I I I I I I II III 

I I 

Db 163 ACTGTGTTGGTG GG GTTCATTG- 

CAACTTGATTCTTTGCGGCATC 12 0 

Qy 2988 

ATGTAACACATAAACAATTCATGCCTTTCATAGTTTATTATTATTAAAGTCTAAACAAAA 3 0 4 7 
II III I I I I I I 

Db 119 ACCGATCA- 

ACCGTTCA 104 

Qy 3048 

TTGCAATTTCTTAGGTAACCTTATATTTACAATAAATGAAGATTACCCTCAAATGCTAGA 310 7 

I I I I I I III II I I I I 

I 

Db 103 GTGTCCGTA AAGGCGAC 

ATAGCTTGG 7 8 

Qy 3108 

AGCTGTCTAGGTCCGTCCGGTGTGTCAGATTTCCTCAGATTAGATGTGCCAATAACCAAG 316 7 

I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 77 AG-TG GTTCGG TTTCC-CTGATCA 

TTGGCAATTATC 4 4 

Qy 3168 

TTTATTCAGTAAACAACTTGTACTTGTTTCATCTGGTTTATTACTCTCACCCATAAACAG 32 2 7 

I I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 43 TCGACTTTTCCGTG CTGG 

AAAAC AC C C AC AC AAG AC C AG 5 



Qy 3228 TA 3229 

I I 

Db 4 TA 3 
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Qy 4 GCGCGCCCGAG CCTAGTCCCCACGCCGCGGCGCGCC-CGGGCTC 

CCTGC 51 



Db 1 

GTACGGAAGTGTTACTTCTGCTCTAAAAGCTGCGGAATTCCTCGAGCACTGTTGGCCTAC 6 0 



Qy 52 T G AT C C C AG AAC - AAT C AAC CAT G AC G AC C 

GAATCTGGATCAGACTCGGAATCCAAG 107 

II III III I I I I I I I I I I I I II II I II 

I 

Db 61 TGGAATGCGAGCTGAGCAGACAGGGCTGCAAGGAAATCTGGCGCGGT-TC- 

AATACCTCG 118 



Qy 108 CCGGACCAGGAGGCCGAGCCCCAG 

GAGGCGGCGGGGGCGCAGGGGCGGCG 157 

I I I I I I I II I II 

Db 119 TCTAGCCTGGGTTCCAGTATCTAATTTTTTTTTTGTTTTAACTG 

ACAAACTCATT- 173 



Qy 158 

GGGGCCGTGCCGGAGCCGCCCAAGGAGGAGCAGCAGCAGGCCCTGGAGCAGTTCGCCGCC 217 

I I I I I II I III I I I I I I I I I I I I I 

Db 174 TCTCTACTGGGACAG GATGCTG-TGCTGG — CTGGA — AGTT — 

CCATT 215 



Qy 218 

GCTGCAGCGCACAGCACCCCGGTGCGAGGGAGGTCACTGACAAGGAACAGGAGTTTGCTG 2 7 7 

I I I I I I I I I I I I III II I I I I I I I I 

Db 216 TCTACAGC AAGAATCC TATCTGGAAA — 

CACAGAAGTTGTCCT 2 56 



Qy 2 78 C C AGGGC TGC AAAAC AGC TCGAAT ATC AGC AAT T AG AAG AC GAT - 

AAACTTTCTCAGAAA 336 

II II I I I I I I I I I I I I I II I I I I 

Db 257 CTA — GC — CACAGCAGCTCGAACTT 

TTTTGATTGTCGTTGCTGCTTTCTCC 304 



Qy 337 

TCATCTAGCAGTAAACTCTCTCGGTCTCCATTAAAGATTGTCAAAAAGCCTAAAAGCATG 396 



Db 3 05 -CATC ACCCCC ATCCCCTT- 

T T G AC AAAG AT C C AA 33 7 



Qy 397 

CAGTGCAAAGTGATACTTCTCGATGGATCAGAATATACCTGTGATGTAGAGAAACGCTCC 456 
III I I I I I I I I I I I I I I I I 

I 

Db 33 8 CTGTAAAAAGTCTTACGTAAC AGTTCAGGA 

CTAC 3 71 



Qy 457 

AGAGGACAAGTGCTGTTTGATAAAGTGTGTGAACACTTGAACTTGCTAGAGAAAGACTAC 516 

I I I I I I I I I I I I I I I I I I I 

I 

Db 3 72 TTCG GTTCTTTT ACTGGGTAAGCACTT 

TCAAT 4 03 

Qy 517 

TTTGGGCTTACGTATCGAGATGCTGAAAACCAGAAGAATTGGTTGGACCCTGCTAAGGAA 5 76 
III II II I I I I I I I 

Db 404 TTT TTTTTTTT 

AACTAAAAGCC 42 5 

Qy 577 ATAAAAAAACAGGTTC-GAAGTGGTGCTTGGCACTTTTCATTTAATGTGAAATTT- 

TATC 63 4 

II I I I I I III I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 426 ATTTTAAAATTGAATCTGTTGAGGGGCTTGACTAAAATCTTTTAA 

GTAATTTGTGT- 481 



Qy 635 

CACCAGACCCTGCCCAACTATCTGAAGATATCACCAGGTACTACCTCTGCTTGCAGTTGC 69 4 
II I I I I I I III I I I I I I 

Db 482 AATGGAATACTGTCAGTGGAT TTTTTCGTCT 

CATTT — 517 



Qy 695 GAGATGACATCGTGTCCGGAAGGCTGCCC-TGCTCCTTTGTTACCCTGGCC- 

TTGCTGG 751 

I I I I I I I I I I I I I I I I I I I I I 

Db 518 CTGCACGTGCTCCTTTGTT- 

CTCAGAACAGAAGCT — 551 



Qy 752 

GCTCCTACACTGTCCAGTCAGAGCTCGGAGACTATGACCCAGATGAATGTGGGAGCGATT 811 

I I I I I I II I I I I I I I I I I I I I I 

Db 552 TTTTATACACATCCC ATAACGCAGCTG 

GAGAGAG— 585 



Qy 812 

ACATTAGTGAGTTCCGCTTTGCACCAAACCACACTAAAGAACTGGAAGACAAAGTGATCG 8 71 
I I I I I I I I I I I I I I I I II 

Db 586 TTATGAAGTCAGTTAT TATAAGGAAC 

ACAAAG 617 



Qy 872 

AGCTGCACAAGAGCCACAGAGGAATGACGCCAGCAGAAGCAGAGATGCATTTCTTGGAAA 931 
I I I I III III III I I I I 

Db 618 -GTTGCTTT CCA TTCTTTGCCTTTAGA 

TAATTAATTTTTTTG 658 



Qy 932 ATGCCAAAAAATTATCAATGTATGG — 

GG T AG AT T T AC AT C AT GC T AAGGAC TC AGAAGG 989 



Db 659 T T T T C T T AAAATGGAGT AT T T AAAGAAGGAAGAAAT T C A- C AAG AAAT AAAC T — 

GTTGG 715 

Qy 990 GG 

TAGAAATTATGTTAGGAGTTTGTGCAAGTGGTCTGTTGATATATCGCGACCGGC 10 45 



Db 716 AGAATTTAGAA AAGTTTGAAGTTTTTACCA 

CCTTTTCTATCTCTAGTTTTG- 766 

Qy 1046 

TGCGAATAAACAGATTTGCCTGGCCCAAGGTTCTAAAGATTTCATACAAACGGAACAACT 1105 



Db 76 7 TGTGGCCAAACACTTGTGCC — GCCTGGGGCGGTGGGGGT AGAGGCA- 

AGCA 815 

Qy 1106 TTTACATTAAGATCC — 

GGCCGGGAGAGTTTGAACAATTTGAAAGCACCATTGGGTTTAA 1163 



Db 816 TAGACAGAGAGGAACTAAGCCAGAC ATGGACA 

GAGCCAAA 862 

Qy 116 4 GCTGCCAAACCATCGAGCT- 

GCCAAGCGTTTATGGAAAGTATGTGTTGAGCATCATACAT 1222 



Db 863 AC C AG AC AG T C CTGGCCGTTCGAT — AGGCCAGCAGTGGGTGAG- 

ACA- 907 

Qy 1223 

TTTTCAGACTACTGTTACCAGAAGCACCTCCCAAGAAATTCCTAACCTTGGGTTCCAAGT 12 82 



Db 908 -GGTCAGCCAGCTG CAGGGGC- 

GGGAGCGGGGG 93 8 

Qy 1283 

TTCGTTATAGTGGCAGGACACAAGCGCAAACGAGAAGAGCCAGTGCGTTGATAGATCGCC 13 42 
II III I I I I I II I I I I I I I I I I I 

Db 939 GGCG GGTAG — GGACA TGGATGTGTGAGGTGCTCATGCGT 

GCGAT 981 

Qy 1343 

CAGCACCTTACTTTGAACGCTCATCCAGCAAACGTTATACCATGTCTCGCAGCTTGGATG 14 02 



Db 9 82 CGTGACTAGATTCTGAA ACTGCCAGC CATTTTCCCA GCTCCGCTTTG- 

TG 1030 

Qy 14 03 GAGCATCAGTGAATGAAAACC — 

ATGAAATATACATGAAGGATTCTATGTCTGCTGCAGA 1460 

I II I I I I I I II I I I I I III II III II I 

I I I 



I I I I I I II 
-AAGGCAC 



Db 1031 CAATCTAAAGGAATGCATCCCCTCTGAAGCAGTCTTGCCAGAGCCTAG 

T GAG G GAGA 1087 



Qy 1461 

GGTTGGTACTGGCCAGTACGCCACAACAAAAGGCATCTCTCAGACCAACTTGATCACCAC 152 0 
I I I I I I I III I I I I I I I I I I I I I I I I I I 

Db 1088 G — AAGTA-TGGTAAATA — CCACAACATATGGAA 

TCAGAAAAT 1126 

Qy 1521 

TGTGACTCCGGAGAAGAAGGCTGAGGAGGAGCGGGACGAGGAAGAGGACAAACGGAGGAA 15 8 0 

I I I I III II I I I I I I I I I I I II I I I I I I 

Db 112 7 ACCGG GAA — CT GGAGTGGG-CAAGG 

GGGAATGCAGAGGGT 116 4 

Qy 1581 

GGGGGAAGAAGTCACGCCCATCTCGGCCATCCAGCACGAGGGAAAGACTGACAGTGAGCG 16 4 0 

I I I I I II I III I I I I I I I I I I I I II 

I 

Db 1165 GTGGAAA TTTTTTAGTGATCTGGAATGTG 

TTGAGTGACAG G AAG T G 1210 

Qy 16 41 CACGGA- 

CACCGCAGCCGACGGGGAGACCACTGCCACTGAGGAGCTAGAAAAAACTCAAG 1699 

I I I I I I I I I I I I III I I I I 

Db 1211 CCCCAAGCTTCTC— CC CCACCAACTCT 

TCTCA— 1241 

Qy 1700 

ATGACCTGATGAAACATCAAACCAACATTAGCGAGCTGAAAAGAACCTTCTTAGAAACCT 1759 

II II I I I I II III I I I I II 

I 

Db 1242 GTCGCGCCTGCTTTTGT CT AAC— 

TCTTGTAA T 1272 

Qy 176 0 CAACAGAC- 

ACTGCCGTAACGAATGAATGGGAGAAGAGGCTTTCCACCTCCCCCGTGCGA 1818 

I I I I I I I I I I I I I I I I I I I I I II 

I 

Db 1273 C T AC AC AC T AC T GC — T T AC AAA GCTGTC 

TGAGT 13 0 4 

Qy 1819 

CTGGCCGCCAGGCAGGAGGATGCCCCCATGATCGAACCACTTGTCCCTGAAGAGAAAATG 18 7 8 

I I I I I I I I I I II I I I I I I I I I I 

Db 13 05 TT AAG AC AAAAAG AAAC C T AAAAG TCTCCTT 

ACTTCATAGTA 13 46 

Qy 18 79 GAAACCAAGACGGAGTCCAGTGGATAGAGACGGAACCCACCGTGCACCA- 

CCTGCCGCTT 1937 

I I I I I I I I I I I I I I III I I I I I 

I I I 

Db 13 4 7 CCTGT-GATATGGAAGGAA 

TGTAAAAGCATGACCCTT 13 82 

Qy 1938 — 

AGCACTGAGAAGGTGGTGCAGGAGACCGTGTTGGTGGAGGAGCGGCGTGTGGTGCACG 1995 

I I I I I I I I I I I I I I I I 

I I 

Db 13 83 TAAGC-CTATG GACATTTTT 



TCAGGTATACA 1412 



Qy 1996 CGAGTGGGGATGCTTCTTACTCGGCGGGAGACAGCGG 

GGATGCTGCAGCACAG — 2 0 48 

III I I I I I I I I I I I I I I I 

Db 1413 

GGAGAAAGATAAAATAATTTTCCACAGAAAAATGAGAATTATGAATTATATAGTTCAGGT 14 72 
Qy 2049 

CCCGCATTCACAGGCATTAAAGGGAAAGAGGGCTCTGCTTGACGGAGGGGGCTAAAGAGG 210 8 
II II I I I I I III I I I I I I I II 

Db 14 73 TCCAAAT CTAATTTTTAAAAGAATTCTGATTCTGCTACAC 

TTTACAAA — 1520 



Qy 2109 AAGGAGGGGAGGAGGTCGCTAAAGCTG — 

T C C T G G AAC AG G AAG AG AC AG CCGCTGCTTC 2166 



Db 1521 TGCTTAGGTTGGTTCCT-AATTTGAAGGAGACT- 

TGTTTT 1558 

Qy 216 7 CCGTGAGCGACAAGAGGAGCAGAGTGCAGCCATCCACATTTC- 

AGAAACTTTGGAACAAA 2225 

II III I I I I I I I I I I I I I I 

Db 1559 ATTTG GTTAA TGC — 

ATTGCATTTGAACTTGTTTCTATTTTCTTTG 16 02 



Qy 2226 

AACCTCATTTTGAGTCCTCAACGGTGAAGACGGAAACCATCAGTTTTGGCAGTGTTTCAC 22 85 
I I I I I I I I I I I I I I I I III III I 

I I I 

Db 16 03 — CATAAATTTG-GACTTTG — GGAGAAAAATGCAA 

AGTAATAAGTAGAATGCAC 1652 



Qy 22 86 C G G G AGG AG T AAAGC T AG AAAT T T C C — 

ACAAGAAGTGCCAGTAGTTCACACCGAAACCA 2343 

I I I I I I I I I II I I I I I III II 
Db 1653 TTGGGGGAAAAAAGGAGGATTTTTCCCTTCATGGGTTG- 

GAAA 1694 



Qy 2344 

AAACCATCACATATGAATCATCACAGGTCGATCCAGGCACAGATCTGGAGCCAGGCGTGC 2 4 03 
I I I I I I I I I I III 

Db 1695 GTATTTTAAAGGGTTGTT 

TTCT 1716 

Qy 2404 

TGATGAGTGCACAGACGATCACATCTGAAACCACCAGTACCACCACCACTACCCACATCA 2 463 

I I I I I I I I I I I I II II III III III 

I 

Db 1717 TGAAAAAACAAGCTC-TCTCTTACTTTCTG 

CATCTATGCTATAAAGAT-A 176 4 

Qy 2 46 4 C C AAAAC T G T G AAAGGGGGC AT T TC AGAGAC AAG AAT T G AG AAGC G AAT AGT C AT C - 

ACG 2522 

II I I I I I I I I III II I I I I I I I I I I III 

I 

Db 1765 AC T AT G C TAT AAT AAAT G TAAGG TAG AAAAC T T T AA- 

AG AG AAAAT AAC AG T G T T C T AAG 1823 



Qy 2523 

GGGGATGCAGACATTGACCATGACCAGGCGCTGGCTCAGGCAATTAAAGAGGCCAAAGAG 2 5 82 



Db 182 4 TGAAAAGCT-ACTTAG-CATTTTCC CAAACTCACACATTAT 

CAACAGA- 1869 

Qy 2583 

CAGCACCCTGACATGTCAGTGACCAAAGTAGTGGTCCATAAAGAGACAGAGATCACACCA 2 6 42 



Db 18 70 AACACTAAAAATTAAGAGA — AAAGCGG CCAACTAATTAAATAG — 

CTAGCCT 1918 



Qy 2643 

GAAGATGGAGAGGATTGACCAGAGGAATAACTTAGCTTGCACATGAATGCAGTCATGCAA 2 7 02 
I I I I I I I I I III I I I I I I I I I I I 

Db 1919 TATTTTGG-GGGTATGGGATAGAAAATTAAGT 

GTGAATAAAATGATAC— 1965 



Qy 2703 

ACCGTTAGGAAAACCAGAGCCTATATGGAGTTCCCTCTTCTAACCCAACTGTACTTGTAT 2 762 
I I I I I III I I I I I I I I I I I I I 

Db 1966 TTGGGA ATGTTTCTCCCTCG — 

TACCACAAAGG 1996 



Qy 2 763 CTGTCCGTGGAAAATTTCAGTCCAGAAGAATTGA- 

C C T T G AC CAT T AAT AAAG AC AC TGG 2 821 

III I I I I I I I I I I I I I I I I I I I I 

Db 199 7 ATGTTAGTGG TCAG-CCT-ACGAGTTAATCCTT- 

GGCTCT — 2041 

Qy 2 822 CAGAGAGATCTTCCCATAATAAAGCAATC- 

TGATTCAGCATCACTAAACCGATAATGCAT 2880 

III I I I I I I I I I I I I I I I I 

I I I 

Db 2 0 42 — GAGCCTTCATGCC GAGC AG AC GTTATTCA 

CAT 2 0 73 



Qy 2881 

G AAGC AAC G AT AAAAT T AC AAAAG AGC AGC AT T T T T AAT T T TC AC AAAAT G T C T C AG T T T 2 9 4 0 

I I I I I I I I II I I I I I II I I I I I II 

I I I 

Db 2 0 74 G ACGA TTCGAAAAGTC — CATTCATATATCTCGC 

TACCTGGATTT 2116 



Qy 2941 

TCAGCTATACCTGCTCGTTCATAACCAACAATATAAACCGTGGTCTCATGTAACACATAA 3 0 0 0 
I I I I I I I I I I I I III II 

Db 2117 G AAT AG AAAC C AG ACAGCAAT TCTTTAGTT — 

CCAGCC 2152 



Qy 3001 

ACAATTCATGCCTTTCATAGTTTATTATTATTAAAGTCTAAACAAAATTGCAATTTCTTA 3 06 0 

I I I I I I I I I II I I I I I I I I I I I I I 

I I I 

Db 2153 ACCATTC — GCC — CCA CTGGAC — 

AATAGCGATTTGTTA 2186 



Qy 3061 



GGTAACCTTATATTTACAATAAATGAAGATTACCCTCAAATGCTAGAAGCTGTCTAGGTC 312 0 



Db 2187 G CACAGAGTCACAGGCTGTGGG ACAC 

AAAGCTTGGAGCTG C 222 7 

Qy 3121 

CGTCCGGTGTGTCAGATTTCCTCAGATTAGATGTGCCAATAACCAAGTTTATTCAGTAAA 318 0 
I I I I I I I I I II I I I I 

Db 222 8 AGAAAGATGGG GGATT 

C AGAGAAC AGGAAAT T AC AG G 226 4 

Qy 3181 CAACTTGTACTTGTTTCATCTGGTTTATTACTCTCA 

CCCATAAACAGTAATGACTC 3236 

I I I I I I I I I I I I I I I I I I I I I 

Db 2265 CTCGATGCACT CCTGG 

CAGCTCTGAGAATACAAAAAAAAAAAAAAAAAA 2313 

Qy 3237 TCTGACCCTCTGGAAATATGTAATGCTTCCAATCT 3271 

III II I I I I I I I I 

Db 2314 AAAGGCCACATG TGCT — CGAGCT 2335 



RESULT 15 
US-10-478-914-3/C 

; Sequence 3, Application US/10478914 

; Patent No. 7335755 

; GENERAL INFORMATION: 

; APPLICANT: N AKAG AWARA , AKIRA 

; TITLE OF INVENTION: NUCLEIC ACIDS ISOLATED IN NEUROBLASTOMA 
; FILE REFERENCE: 7388-80899 

; CURRENT APPLICATION NUMBER: US/10/478,914 

; CURRENT FILING DATE: 2003-11-26 

; PRIOR APPLICATION NUMBER: PCT/ JP0 2 / 0 52 9 4 

; PRIOR FILING DATE: 2001-05-30 

; PRIOR APPLICATION NUMBER: JP 2001 — 1627' / 5 

; PRIOR FILING DATE: 2001-05-30 

; PRIOR APPLICATION NUMBER: JP 2001-255226 

; PRIOR FILING DATE: 2001-08-24 

; NUMBER OF SEQ ID NOS : 417 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 3 

LENGTH: 2 0 83 

TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-478-914-3 

Query Match 24.2%; Score 4714; DB 1; Length 2083; 

Best Local Similarity 39.2%; Pred. No. 0; 

Matches 1256; Conservative 0; Mismatches 770; Indels 1176 
Gaps 181; 

Qy 152 GCGGCGGGGGCCGTGCCGGAGCCGCCCAAGGAGGAGCAGCAGCAGGCC 

CT 201 



Db 2080 GCGGCCGCGACC TGCAGCTC- 

CATGTGGCCTTTTTTTTTTTT 2035 



Qy 



202 



GGAGCAGTTCGCCGCCGCTGCAGCGCACAGCACCCCGGTGCGAGGGAGGTCACTGACAAG 2 61 

I I I I I I I I I I I I I I II I I I I I I 

I 

Db 2034 TTTTTATTTTGCCTTC-TTGCT CTGGT 

GGTTAGTCTCCTACTAT 1992 

Qy 262 GAACAGGAGTTTGCTGCCAGGGCTGCAAAACAGC- 

T C G AAT AT C AGC AAT T AG AAG AC G A 32 0 

I I I I I I I I I I I III III II I I I I II II 

Db 1991 AAACATGATTTTCCT TCTTCCCTTCAGAGTC — CTGTCAATCTTT 

ACTT 1945 

Qy 321 TAAACTTTCTCAGAAATCATCTAGCAG- 

TAAACTCTCTCGGTCTCCATTAAAGATTGTCA 379 

I I I I I I I I III I I I I I I I I I I II I I I I I I I I I 

Db 19 44 TT TTTCTCAT T AAT AT AC - AGAAGT T AT AC T ACAG- 

ATACATTCAAGCT 189 8 

Qy 380 

AAAAGCCTAAAAGCATGCAGTGCAAAGTGATACTTCTCGATGGATCAGAATATACCTGTG 439 

I I I I I I I I I I I I I I I I II III III 

I 

Db 189 7 CATG-ACTGCTAA AAACTTCCTGA AATGAGCCT- 

TA 1864 

Qy 440 

ATGTAGAGAAACGCTCCAGAGGACAAGTGCTGTTTGATAAAGTGTGTGAACACTTGAACT 499 

II II I I I I I I I I I I I III 

I 

Db 1863 AT TCATACGAATCATTTCTATTTGTT 

CTTTTTTT 1830 

Qy 500 

TGCTAGAGAAAGACTACTTTGGGCTTACGTATCGAGATGCTGAAAACCAGAAGAATTGGT 559 

I I I I I I I I I I I I I I I I III III 

I I 

Db 1829 T — T T GAG AC AG AC T CTTAC TCTGTCGCCCA 

GGC 1798 

Qy 560 

TGGACCCTGCTAAGGAAATAAAAAAACAGGTTCGAAGTGGTGCTTGGCACTTTTCATTTA 619 
I I I I III II I I I I I I I I I I 

Db 179 7 TGGA — GTGCACTGG CATGAT 

CTCGGCTC 1771 

Qy 62 0 ATGTGAAATTTTATCCACCAGACCCTGCCCAACTATCTGAAGATATCACCAGG- 

T ACT AC 6 78 

I II I I I I I I III I III II III 

Db 1770 ACTGCAACCTCTGCCTCACGAGTT — 

AGGTCAAGCAATTCTTCTGC 172 7 

Qy 6 79 CTCTGCTTGCAGTTGCGAGATGACATCGTGTCCGGAAGGCTGCC- 

CTGCTCCTTTGTTAC 737 

I I I I I I I I II III I I I I I I I I I I I III II 

I 

Db 172 6 CTCAGCCTCCCGAGGAGCTAGGATTACAGGTCC 

CTGCCACCATGCCTGAGTAAT 16 73 

Qy 73 8 CCTGGCCTT GCTGGGCTCCTACACTGT 



C C AG T C AG AGC T C GG AG AC T A 785 

II II I I I I I I II III III II I I I I I I 

Db 1672 

TTTTGTATTTTTAGTAGAGATGGGGTTTCACCATGTTAGCCAAGCTGGTCTCGAATTC-C 1614 

Qy 786 TGACC-CAGATGAATGTGGGAGCGATTACATTAGTGAGTTCCGCTTTGCACC- 

AAACCAC 8 43 

I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 1613 TGACCTCAGGTGA TCCGCC 

CACCTCAGCCTC 1583 

Qy 844 

ACTAAAGAACTGGAAGACAAAGTGATCGAGCTGCACAAGAGCCACAGAGGAATGACGCCA 9 03 
I I I I I I I I I I III I I I I I I I I I 

I I I 

Db 1582 CC-AAAGTGCTGG GATTACGGG CGTGAGCCACGGT 

GCCT 1545 

Qy 904 GCAGAAGCAGAGATGCATTTCTTGGAAAA 

TGCCAAAAAATTATCAATGTATGGGG 958 

I II II II II III I I I I II I I I I I I I 

Db 1544 G GCCTCTATTTTTTATTTTAAATACCTTTTGCCCTGAA 

ATCTATGTCC T 1496 

Qy 959 TAGATTTACATCA- 

TGCTAAGGACTCAGAAGGGGTAGAAATTATGTTAGGAGTTTGTGCA 1017 

I I I I I I I I I I I I I III I I I I I I I I I I 

Db 1495 CTGATTATCAACACTGCTTA CCAG TTTTATT 

TTTGT 1460 

Qy 1018 

AGTGGTCTGTTGATATATCGCGACCGGCTGCGAATAAACAGATTTGCCTGGCCCAAGGTT 10 7 7 
I I I I I I I II I I I I I I I I I I I I I I 

I 

Db 1459 — TGGTATTTATTTCATT C AAT C C C AGC TCTACA- 

ACTTACCAG T 1418 

Qy 1078 

CTAAAGATTTCATACAAACGGAACAACTTTTACATTAAGATCCGGCCGGGAGAGTTTGAA 113 7 

I I I I I I I I I I I I I I I I I I III II I I I I I 

Db 1417 ATAAAAATTT GGATCAG TTACTT AATCTTGC 

AGTTTCTT 1379 

Qy 1138 

CAATTTGAAAGCACCATTGGGTTTAAGCTGCCAAACCATCGAGCTGCCAAGCGTTTATGG 119 7 

I I I I I I I II I I I I I I I I I I I I I I I 

Db 13 7 8 CCTTTTAAA AATGGAATAATATTATCTATCATATAAAATGCCCAG 

TTAA 1330 

Qy 1198 

AAAGTATGTGTTGAGCATCATACATTTTTCAGACTACTGTTACCAGAAGCACCTCCCAAG 12 5 7 

I I I I I I I I I I I III I I I I I I III I 

I I 

Db 132 9 CAAATATGAACT- 

AACACCTACAATGTGCCAGGCACCATTCTAGGGAATAGAATGTAAAT 12 71 

Qy 1258 AAATTCCTAACCTTGGGTTCC AAGTTT- 

CGTTATAGTGGCAGGACACAAGCGCAA 1311 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I 



Db 1270 AAGTTCAAGTTCTTG 

AC C T GAATAAG TTTACATTTTAGT G AG AAG AC - 



1224 



Qy 1312 

ACGAGAAGAGCCAGTGCGTTGATAGATCGCCCAGCACCTTACTTTGAACGCTCATCCAGC 13 71 
I I I I I III! Ill I I I I 

I I I I 

Db 12 23 AG AC AAT AAAT C AAT AAC AT G T C C ATTATG 

CCAG- 119 0 

Qy 13 72 AAACGTTATACCATGTCTCGCAG-CTTGGATGGAGC- 

AT C AG T G AAT G AAAAC CAT G AAA 142 9 

I I I I I I I I I I I I II I I I I I I I I I I I III I 

I I 

Db 1189 -ATAGTTTTA-AATGTCACAAAGAAAAAGGGGGAGCAATCAGATAGTG 

ATGGAA 113 8 

Qy 1430 

TATACATGAAGGATTCTATGTCTGCTGCAGAGGTTGGTACTGGCCAGTACGCCACAACAA 14 89 
I I I I I I I I I I I I I I I I I I I 

I I 

Db 113 7 GTTACA CT-CTAGAGAGGG TGGACAG 

AA 1111 

Qy 1490 

AAGGCATCTCTCAGACCAACTTGATCACCACTGTGACTCCGGAGAAGAAGGCTGAGGAGG 1549 

III I I I I I I I I I I I I III I I I I I III 

Db 1110 AAAGTTTCTCTTAGG AG AT G AC AT C T G AACAGAGAT 

CTAA 1071 

Qy 1550 

AGCGGGACGAGGAAGAGGACAAACGGAGGAAGGGGGAAGAAGTCACGCCCATCTCGGCCA 16 09 
III I I I I I I I I I I I I I 

Db 10 70 ATGAC T GAG T AC AAAC 

CCCA 1051 

Qy 1610 TCCAGCACGAGGGAAAGACTGACAGTGAGCGCACGGACACCGCA — 

GCCGACGGGGAGAC 166 7 

I I I I I I III I I I I I I I I I I I I I I I 

I I I 

Db 1050 ATGGAAAAG — TGATTG- 

GTGTGTTGTGAAACAGCAACACAGCCAGTTTGAC 1002 

Qy 1668 

CACTGCCACTGAGGAGCTAGAAAAAACTCAAGATGACCTGATGAAACATCAAACCAACAT 172 7 
I I I I I I I I II I I I I I I III I I I I I I I 

Db 1001 TA GAT GAG TAG AT C AG AAT G AC T G AGG T GGG AC T G T G G — 

TCATCAA 957 

Qy 1728 

TAGCGAGCT G AAAAG AAC C T T C T TAG AAAC C T C AAC AG AC AC T GC C G T AAC G AAT G AAT G 17 8 7 
I I I I I I I I I I III 

I I 

Db 956 -AGAGATCAGAGAGGTA GCC 

TG 936 



Qy 1788 

GGAGAAGAGGCTTTCCACCTCCCCCGTGCGACTGGCCGCCAGGCAGGAGGATGCCCCCAT 18 4 7 

III III I I I I I I I I I I I I I I 

Db 935 GGA AGG CCACATCAC 



AGAATGTCC 912 



Qy 1848 

GATCGAACCACTTGTCCCTGAAGAGAAAATGGAAACCAAGACGGAGTCCAGTGGATAGAG 19 0 7 

II I I I I II I I I I I I I I I I I I I I I I I I I I I I 

Db 911 -TTTGGGCCTC-TGT AAGAGCGTTTTGAA AAAAC — 

AGTTCTGAGCAAAGGT 86 4 

Qy 1908 

ACGGAACCCACCGTGCACCACCTGCCGCTTAGCACTGAGAAGGTGGTGCAGGAGACCGTG 196 7 

II I I I I I I I I I I I I I I I III 

Db 863 TAGTCATTTAACTTG ACTTACTCTTT AGAAG 

GATCACT 826 

Qy 1968 

TTGGTGGAGGAGCGGCGTGTGGTGCACGCGAGTGGGGATGCTTCTTACTCGGCGGGAGAC 2 02 7 

I I I I I I I I I I III 

I I I I I 

Db 825 TTGGCTATTGAGTGGC 

AAACAACTGGAGA- 79 7 

Qy 2028 

AGCGGGGATGCTGCAGCACAGCCCGCATTCACAGGCATTAAAGGGAAAGAGGGCTCTGCT 2 0 8 7 

I I I I I I I I I I I I I I I 

Db 796 AGTGG TTAGGGGA 

GCACTGT- 777 

Qy 2088 

TGACGGAGGGGGCTAAAGAGGAAGGAGGGGAGGAGGTCGCTAAAGCTGTCCTGGAACAGG 214 7 

I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 7 76 GGGCAGGAATAGAAGGAGG AAAAGCAG- 

ACTGGAGCC — 741 

Qy 2148 

AAGAGACAGCCGCTGCTTCCCGTGAGCGACAAGAGGAGCAGAGTGCAGCCATCCACATTT 2 2 0 7 
III I I I I I III I I I I I I I I I I III 

Db 740 T AAT C C AG T AC T T C AG AT G A — GAAATGA CAGATT— 

TGCC 704 

Qy 2208 

CAGAAACTTTGGAACAAAAACCTCATTTTGAGTCCTCAACGGTGAAGACGGAAACCATCA 2 2 6 7 

I I I I I I I I I I I I I I I I I I I 

I 

Db 703 TGGTACA TCATT TCCT 

ATCCCTTTA 679 

Qy 2268 

GTTTTGGCAGTGTTTCACCGGGAGGAGTAAAGCTAGAAATTTCCACAAGAAGTGCCAGTA 232 7 

I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 6 78 CTTATAACT-TATT GGTGT 

ATTTTTATTTTAAGTGTAAATT 639 



Qy 2328 

GTTCACACCGAAACCAAAACCATCACATATGAATCATCACAGGTCGATCCAGGCACAGAT 23 8 7 
II I I I I I I I I I I I I I I I 

I I I 

Db 63 8 TTT ATAAATGAAGCATAATTTGAAGAT 

TGAT 608 



Qy 2388 

CTGGAGCCAGGCGTGCTGATGAGTGCACAGACGATCACATCTGAAACCACCAGTACCACC 2 4 4 7 

II I I I I I I I I III III III 

I 

Db 607 ATGTT — CGGACTTG-TGTT TTCTATTATCA- 

TGGTGTCT-C 571 

Qy 2448 

AC C AC T AC C C AC AT C AC C AAAAC TGTGAAAGGGGGC AT T TC AGAGAC AAGAAT T G AG AAG 2 5 0 7 

I I I I II III I I I I I I I I I I I 

Db 570 TCCACATACCTTTTCCCTTTAG TTCTGCTTCT — 

TATTGTGC — 531 

Qy 2508 

CGAATAGTCATCACGGGGGATGCAGACATTGACCATGACCAGGCGCTGGCTCAGGCAATT 2 56 7 

I I I I I I I I I I I I I I I I I I I I 

I I 

Db 53 0 TCCTCAAG TCTATTTTTGTTCTTTTCC CTCT- 

ACTTCTACTGTT 488 

Qy 2568 

AAAGAGGCCAAAGAGCAGCACCCTGACATGTCAGTGACCAAAGTAGTGGTCCATAAAGAG 2 62 7 

III III II III I I I I 

I 

Db 487 T TGAACTGTATATGC TATT — TCTATTT — 

TG 460 

Qy 2628 

AC AG AG AT C AC AC C AG AAGATGGAGAGGAT TGACC AGAGGAAT AAC T T AGC T TGC AC AT G 26 8 7 

III I I I I I II II I I I I I II 

Db 459 CCAGGAGTTACCCC CTT CA 

ATTTTCGTTGTACTCTCT — 423 

Qy 2688 

AATGCAGTCATGCAAACCGTTAGGAAAACCAGAGCCTATATGGAGTTCCCTCTTCTAACC 2 7 4 7 

I I I I I I I I I I I I I I I III I I I I I I I I 

I 

Db 422 CAGTCT — CAAAC ACATACCC ATG 

CTCTCTTCTGT-C 389 

Qy 2 748 CAACTGTACTTGT- 

ATCTGTCCGTGGAAAATTTCAGTCCAGAAGAATTGACCTTGACCAT 2 8 06 

I I I I I I I I I I II II I I I I III III I I I 

I I 

Db 388 CTTCTCTTCATATCATC — TTCTAGAATTAGTTCTACCCCG ATTTTC — 

TACCAAT 33 7 

Qy 2807 

TAATAAAGACACTGGCAGAGAGATCTTCCCATAATAAAGCAATCTGATTCAGCATCACTA 2 866 
III I I I I I I III I I I I I I I I 

I 

Db 336 TTATTTTAT TTCCAAT C AG T AC AT T GAT T C AG 

ATT 302 

Qy 2867 

AAC C G AT AAT GC AT G AAGC AACGAT AAAAT T AC AAAAGAGC AGC AT T T T T AAT T T T C AC A 2 92 6 

I I I I III I I I I II I I I I I I I I 

I 

Db 301 TATCAAT ATGTTTTACCAATTTTAT 

CATTTTTTTTGGAGGCT 260 



Qy 2927 

AAATGTCTCAGTTTTCAGCTATACCTGCTCGTTCATAACCAACAATATAAACCGTGGTCT 2 9 86 
I I I I I I I I I I ! II I I I I I II II I 

I I I 

Db 259 TGTTATTTAAG-GTTCA — TTTTTCTTGGAGTACAT — CCTTCAGAA 

GTTCT 213 

Qy 2987 

CATGTAACACATAAACAATTCATGCCTTTCATAGTTTATTATTATTAAAGTCTAAACAAA 3 0 46 
I II III I I I I I I I I I I I I I I I I I 

Db 212 TTCAGCAAGAAC TCATGG- 

TTGTTAATCTTTGAGTC C 177 

Qy 3047 

ATTGCAATTTCTTAGGTAACCTTATATTTACAATAAATGAAGATTACCCTCAAATGCTAG 3106 
III I I I I I II I I I I I I I I I I I I I I I 



Db 176 TTTG TTTCT— GG TGTTTTAACTTT 

TTCCCCTCACTCTTGAG 13 7 

Qy 3107 

AAGCTGTCTAGGTCCGTCCGGTGTGTCAGATTTCCTCAGATTAGATGTGCCAATAACCAA 3166 
I I I I I I III I I I I I I I I I I I I I 

Db 136 TAATAGTTTAG CTGGGCATAAAATT 

TCAGATTGG 103 

Qy 3167 

GTTTATTCAGTAAACAACTTGTACTTGTTTCATCTGGTTTATTACTCTCACCCATAAACA 3226 
I I I I I I I I I I I I I III 



Db 102 CAGT TACTTTTCCTTG GCACT 

ACG 79 

Qy 3227 

GTAATGACTCTCTGACCCTCTGGAAATATGTAATGCTTCCAATCTTGCTTTGTGTATCTC 32 86 

I I I I I I I I III I I I I I I I I I I I I I I I I I 

I I 

Db 7 8 AAGATGCCGCTCCGT-TATCT TAT T AC AGC C AAT AT T AC T AT - 

TGTTCCTT 30 

Qy 3287 ATTTAATTTGTTCCGGTTAAGG 3308 

I I I I I I I III 

Db 2 9 TGTAGGTAATCTCCAGT — AGG 10 
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